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CROSS-REFERENCES TO RELATED APPLICATIONS 

This application claims priority to USSN 60/263,965, filed January 24, 2001; 
USSN 60/265,928, filed February 2, 2001; USSN 09/829,472 filed April 9, 2001; USSN 
60/282,698, filed April 9, 2001; USSN 60/288,590, filed May 4, 2001; and USSN 
60/294,443, filed May 29, 2001, all of which are incorporated herein by reference in their 
entirety. 

FIELD OF THE INVENTION 

The invention relates to the identification of nucleic acid and protein 
expression profiles and nucleic acids, products, and antibodies thereto that are involved in 
breast cancer; and to the use of such expression profiles and compositions in the diagnosis, 
prognosis and therapy of breast cancer. The invention further relates to methods for 
identifying and using agents and/or targets that inhibit breast cancer. 

BACKGROUND OF THE INVENTION 
Breast cancer is one of the most frequently diagnosed cancers and the second 
leading cause of female cancer death in North America and northern Europe, with lung 
cancer being the leading cause. Lifetime incidence of the disease in the United States is one- 
in-eight, with a l-in-29 lifetime risk of dying from breast cancer. Early detection of breast 
cancer, using mammography, clinical breast examination, and self breast examination, has 
dramatically improved the treatment of the disease, although sensitivity is still major concern, 
as mammographic sensitivity has been estimated at only 60%-90%. Treatment of breast 
cancer consists largely of surgical lumpectomy or mastectomy, radiation therapy, anti- 
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hormone therapy, and/or chemotherapy. Although many breast cancer patients are effectively 
treated, the current therapies can all induce serious side effects which diminish quality of life. 
Deciding on a particular course of treatment is typically based on a variety of prognostic 
parameters and markers (Fitzgibbons et al., 2000, Arch. Pathol. Lab. Med. 124:966-978; 
5 Hamilton and Piccart, 2000, Ann. Oncol. 1 1 :647-663), including genetic predispostion 
markers BRCA-1 and BRCA-2 (Robson, 2000, J. Clin. Oncol. 18:113sup-118sup). 

Imaging of breast cancer for diagnosis has been problematic and limited. In 
p. addition, dissemination of tumor cells (metastases) to locoregional lymph nodes is an 
2 important prognostic factor; five year survival rates drop from 80 percent in patients with no 
LI!0 lymph node metastases to 45 to 50 percent in those patients who do have lymph node 
n 1 metastases. A recent report showed that micrometastases can be detected from lymph nodes 

^ using reverse transcriptase-PCR methods based on the presence of mRNA for 

jp 

% carcinoembryonic antigen, which has previously been shown to be present in the vast 

jsss. 

jT majority of breast cancers but not in normal tissues. Liefers et al., New England J. of Med. 
1J5 339(4):223 (1998). 

Q The identification of novel therapeutic targets and diagnostic markers is 

v essential for improving the current treatment of breast cancer patients. Recent advances in 
molecular medicine have increased the interest in tumor-specific cell surface antigens that 
could serve as targets for various immunotherapeutic or small molecule strategies. Antigens 
20 suitable for immunotherapeutic strategies should be highly expressed in cancer tissues and 
ideally not expressed in normal adult tissues. Expression in tissues that are dispensable for 
life, however, may be tolerated. Examples of such antigens include Her2/neu and the B-cell 
antigen CD20. Humanized monclonal antibodies directed to Her2/neu 
(Herceptin®/trastuzumab) are currently in use for the treatment of metastatic breast cancer 
25 (Ross and Fletcher, 1998, Stem Cells 16:413-428). Similarly, anti-CD20 monoclonal 

antibodies (Rituxin®/rituximab) are used to effectively treat non-Hodgekin's lymphoma 
(Maloney et al., 1997, Blood 90:2188-2195; Leget and Czuczman, 1998, Curr. Opin. Oncol. 
10:548-551). 

Other potential immunotherapeutic targets have been identified for breast 
30 cancer. One such target is polymorphic epithelial mucin (MUC1). MUC1 is a transmembrane 
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protein, present at the apical surface of glandular epithelial cells. It is often overexpressed in 
breast cancer, and typically exhibits an altered glycosylation pattern, resulting in an 
antigenically distinct molecule, and is in early clinical trials as a vaccine target (Gilewski et 
al., 2000, Clin. Cancer Res. 6:1693-1701; Scholl et aL, 2000, J. Immunother. 23:570-580). 
The tumor-expressed protein is often cleaved into the circulation, where it is detectable as the 
tumor marker, CA 15-3 (Bon et al., 1997, Clin. Chem. 43:585-593). However, many patients 
have tumors that express neither HER2 nor MUC-1 ; therefore, it is clear that other targets 
need to be identified to manage localized and metastatic disease. Many other genes have been 
reported to be overexpressed in breast cancer, such as EGFR (Sainsbury et al., 1987, Lancet 
1(8547):1398-1402), c-erbB3 (Naidu et al., 1988, Br. J. Cancer 78:1385-1390), FGFR2 
(Penault-Llorca et al., 1991, Int. J. Cancer 61:170-176), PKW (Preiherr et al., 2000, 
Anticancer Res. 20:2255-2264), MTA1 (Nawa et al., 2000, J. Cell Biochem. 79:202-212), 
breast cancer associated gene 1 (Kurt et al., 2000, Breast Cancer Res. Treat. 59:41-48). 
Although monoclonal antibodies to the protein products of some of these overexpressed 
genes have been reported (for review, see Green et al., 2000, Cancer Treat. Rev. 26:269-286), 
none are currently approved for breast cancer therapy in the US. 

Disclosures of certain genes and ESTs described as being expressed in breast 
cancer are found in international patent applications WO-99/33869, WO-97/25426, WO- 
97/02280 and WO-00/55173, WO-98/45328 and WO-00/22130. Similarly, genes and ESTs 
described as being expressed in breast cancer are disclosed in US Patent Nos. 5,759,776 and 
5,693,522. The utility of such genes is described in each of these publications, and their 
disclosures are incorporated herein in their entirety. 

While industry and academia have identified novel sequences, there has not 
been an equal effort exerted to identify the function of these novel sequences. The 
elucidation of a role for novel proteins and compounds in disease states for identification of 
therapeutic targets and diagnostic markers is essential for improving the current treatment of 
breast cancer patients. Accordingly, provided herein are molecular targets for therapeutic 
intervention in breast and other cancers. Additionally, provided herein are methods that can 
be used in diagnosis and prognosis of breast cancer. Further provided are methods that can 
be used to screen candidate bioactive agents for the ability to modulate breast cancer. 
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SUMMARY OF THE INVENTION 
The present invention therefore provides nucleotide sequences of genes that 
are up- and down-regulated in breast cancer cells. Such genes are useful for diagnostic 
5 purposes, and also as targets for screening for therapeutic compounds that modulate breast 
cancer, such as hormones or antibodies. Other aspects of the invention will become apparent 
to the skilled artisan by the following description of the invention. 
U: In one aspect, the present invention provides a method of detecting a breast 

S cancer-associated transcript in a cell from a patient, the method comprising contacting a 
!I!0 biological sample from the patient with a polynucleotide that selectively hybridizes to a 
W\ sequence at least 80% identical to a sequence as shown in Tables 1-25. 
y In one embodiment, the present invention provides a method of determining 

= the level of a breast cancer associated transcript in a cell from a patient. 

JS5--. 

r: In one embodiment, the present invention provides a method of detecting a 

^5 breast cancer-associated transcript in a cell from a patient, the method comprising contacting 
Q a biological sample from the patient with a polynucleotide that selectively hybridizes to a 
n>] sequence at least 80% identical to a sequence as shown in Tables 1-25. 

In one embodiment, the polynucleotide selectively hybridizes to a sequence at 
least 95% identical to a sequence as shown in Tables 1-25. 
20 In one embodiment, the biological sample is a tissue sample. In another 

embodiment, the biological sample comprises isolated nucleic acids, e.g., mRNA. 

In one embodiment, the polynucleotide is labeled, e.g., with a fluorescent 

label. 

In one embodiment, the polynucleotide is immobilized on a solid surface. 
25 In one embodiment, the patient is undergoing a therapeutic regimen to treat 

breast cancer. In another embodiment, the patient is suspected of having metastatic breast 
cancer. 

In one embodiment, the patient is a human. 

In one embodiment, the breast cancer associated transcript is mRNA. 
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In one embodiment, the method further comprises the step of amplifying 
nucleic acids before the step of contacting the biological sample with the polynucleotide. 

In another aspect, the present invention provides a method of monitoring the 
efficacy of a therapeutic treatment of breast cancer, the method comprising the steps of: (i) 
providing a biological sample from a patient undergoing the therapeutic treatment; and (ii) 
determining the level of a breast cancer-associated transcript in the biological sample by 
contacting the biological sample with a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1-25, thereby monitoring 
the efficacy of the therapy. In a further embodiment, the patient has metastatic breast cancer. 
In a further embodiment, the patient has a drug resistant form of breast cancer. 

In one embodiment, the method further comprises the step of: (iii) comparing 
the level of the breast cancer-associated transcript to a level of the breast cancer-associated 
transcript in a biological sample from the patient prior to, or earlier in, the therapeutic 
treatment. 

Additionally, provided herein is a method of evaluating the effect of a 
candidate breast cancer drug comprising administering the drug to a patient and removing a 
cell sample from the patient. The expression profile of the cell is then determined. This 
method may further comprise comparing the expression profile to an expression profile of a 
healthy individual. In a preferred embodiment, said expression profile includes a gene of 
Tables 1-25. 

In one aspect, the present invention provides an isolated nucleic acid molecule 
consisting of a polynucleotide sequence as shown in Tables 1-25. 

In one embodiment, an expression vector or cell comprises the isolated nucleic 

acid. 

In one aspect, the present invention provides an isolated polypeptide which is 
encoded by a nucleic acid molecule having polynucleotide sequence as shown in Tables 1-25. 

In another aspect, the present invention provides an antibody that specifically 
binds to an isolated polypeptide which is encoded by a nucleic acid molecule having 
polynucleotide sequence as shown in Tables 1-25. 
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In one embodiment, the antibody is conjugated to an effector component, e.g., 
a fluorescent label, a radioisotope or a cytotoxic chemical. 

In one embodiment, the antibody is an antibody fragment. In another 
embodiment, the antibody is humanized. 

In one aspect, the present invention provides a method of detecting a breast 
cancer cell in a biological sample from a patient, the method comprising contacting the 
biological sample with an antibody as described herein. 

In another aspect, the present invention provides a method of detecting 
antibodies specific to breast cancer in a patient, the method comprising contacting a 
biological sample from the patient with a polypeptide encoded by a nucleic acid comprising a 
sequence from Tables 1-25. 

In another aspect, the present invention provides a method for identifying a 
compound that modulates a breast cancer-associated polypeptide, the method comprising the 
steps of: (i) contacting the compound with a breast cancer-associated polypeptide, the 
polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at least 
80% identical to a sequence as shown in Tables 1-25; and (ii) determining the functional 
effect of the compound upon the polypeptide. 

In one embodiment, the functional effect is a physical effect, an enzymatic 
effect, or a chemical effect. 

In one embodiment, the polypeptide is expressed in a eukaryotic host cell or 
cell membrane. In another embodiment, the polypeptide is recombinant. 

In one embodiment, the functional effect is determined by measuring ligand 
binding to the polypeptide. 

In another aspect, the present invention provides a method of inhibiting 
proliferation of a breast cancer-associated cell to treat breast cancer in a patient, the method 
comprising the step of administering to the subject a therapeutically effective amount of a 
compound identified as described herein. 

In one embodiment, the compound is an antibody. 
In another aspect, the present invention provides a drug screening assay 
comprising the steps of: (i) administering a test compound to a mammal having breast cancer 
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or to a cell sample isolated therefrom; (ii) comparing the level of gene expression of a 
polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 
as shown in Tables 1-25 in a treated cell or mammal with the level of gene expression of the 
polynucleotide in a control cell sample or mammal, wherein a test compound that modulates 
5 the level of expression of the polynucleotide is a candidate for the treatment of breast cancer. 

In one embodiment, the control is a mammal with breast cancer or a cell 
sample therefrom that has not been treated with the test compound. In another embodiment, 
the control is a normal cell or mammal, 
fa In one embodiment, the test compound is administered in varying amounts or 

[|j 0 concentrations. In another embodiment, the test compound is administered for varying time 
SR periods. In another embodiment, the comparison can occur after addition or removal of the 
SI drug candidate. 

ssz-- . 

u! In one embodiment, the levels of a plurality of polynucleotides that selectively 

P hybridize to a sequence at least 80% identical to a sequence as shown in Tables 1-25 are 
Hi 5 individually compared to their respective levels in a control cell sample or mammal. In a 
£ preferred embodiment the plurality of polynucleotides is from three to ten. 
fU In another aspect, the present invention provides a method for treating a 

mammal having breast cancer comprising administering a compound identified by the assay 

described herein. 

20 In another aspect, the present invention provides a pharmaceutical 

composition for treating a mammal having breast cancer, the composition comprising a 
compound identified by the assay described herein and a physiologically acceptable 
excipient. 

In one aspect, the present invention provides a method of screening drug 
25 candidates by providing a cell expressing a gene that is up- and down-regulated as in a breast 
cancer. In one embodiment, a gene is selected from Tables 1-25. The method further 
includes adding a drug candidate to the cell and determining the effect of the drug candidate 
on the expression of the expression profile gene. 

In one embodiment, the method of screening drug candidates includes 
30 comparing the level of expression in the absence of the drug candidate to the level -of 

1 
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expression in the presence of the drug candidate, wherein the concentration of the drug 
candidate can vary when present, and wherein the comparison can occur after addition or 
removal of the drug candidate. In a preferred embodiment, the cell expresses at least two 
expression profile genes. The profile genes may show an increase or decrease. 

Also provided is a method of evaluating the effect of a candidate breast cancer 
drug comprising administering the drug to a transgenic animal expressing or over-expressing 
the breast cancer modulatory protein, or an animal lacking the breast cancer modulatory 
protein, for example as a result of a gene knockout. 

Moreover, provided herein is a biochip comprising one or more nucleic acid 
segments of Tables 1-25, wherein the biochip comprises fewer than 1000 nucleic acid probes. 
Preferably, at least two nucleic acid segments are included. More preferably, at least three 
nucleic acid segments are included. 

Furthermore, a method of diagnosing a disorder associated with breast cancer 
is provided. The method comprises determining the expression of a gene of Tables 1-25, 
preferably a gene of Table 25, in a first tissue type of a first individual, and comparing the 
distribution to the expression of the gene from a second normal tissue type from the first 
individual or a second unaffected individual. A difference in the expression indicates that the 
first individual has a disorder associated with breast cancer. 

In a further embodiment, the biochip also includes a polynucleotide sequence 
of a gene that is not up- and down-regulated in breast cancer. 

In one embodiment a method for screening for a bioactive agent capable of 
interfering with the binding of a breast cancer modulating protein (breast cancer modulatory 
protein) or a fragment thereof and an antibody which binds to said breast cancer modulatory 
protein or fragment thereof. In a preferred embodiment, the method comprises combining a 
breast cancer modulatory protein or fragment thereof, a candidate bioactive agent and an 
antibody which binds to said breast cancer modulatory protein or fragment thereof. The 
method further includes determining the binding of said breast cancer modulatory protein or 
fragment thereof and said antibody. Wherein there is a change in binding, an agent is 
identified as an interfering agent. The interfering agent can be an agonist or an antagonist. 
Preferably, the agent inhibits breast cancer. 

8 
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Also provided herein are methods of eliciting an immune response in an 
individual. In one embodiment a method provided herein comprises administering to an 
individual a composition comprising a breast cancer modulating protein, or a fragment 
thereof. In another embodiment, the protein is encoded by a nucleic acid selected from those 
5 of Tables 1-25. 

Further provided herein are compositions capable of eliciting an immune 
response in an individual. In one embodiment, a composition provided herein comprises a 
breast cancer modulating protein, preferably encoded by a nucleic acid of Tables 1-25, more 
P preferably of Table 25, or a fragment thereof, and a pharmaceutically acceptable carrier. In 
til 0 another embodiment, said composition comprises a nucleic acid comprising a sequence 

encoding a breast cancer modulating protein, preferably selected from the nucleic acids of 
Si Tables 1-25, and a pharmaceutically acceptable carrier. 

y Also provided are methods of neutralizing the effect of a breast cancer protein, 

P or a fragment thereof, comprising contacting an agent specific for said protein with said 
fU 5 protein in an amount sufficient to effect neutralization. In another embodiment, the protein is 
% encoded by a nucleic acid selected from those of Tables 1-25. 

fU In another aspect of the invention, a method of treating an individual for breast 

cancer is provided. In one embodiment, the method comprises administering to said 
individual an inhibitor of a breast cancer modulating protein. In another embodiment, the 
20 method comprises administering to a patient having breast cancer an antibody to a breast 

cancer modulating protein conjugated to a therapeutic moiety. Such a therapeutic moiety can 
be a cytotoxic agent or a radioisotope. 

DETAILED DESCRIPTION OF THE INVENTION 
25 In accordance with the objects outlined above, the present invention provides 

novel methods for diagnosis and prognosis evaluation for breast cancer (PC), including 
metastatic breast cancer, as well as methods for screening for compositions which modulate 
breast cancer. Also provided are methods for treating breast cancer. 

Tables 1-24B provide unigene cluster identification numbers for the 
30 nucleotide sequence of genes that exhibit increased or decreased expression in breast cancer 

9 
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samples. Tables 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 18, 19, 20, 21, and 22 list those genes that are 
up-regulated in breast cancer cells. Table 14 lists those genes that are highly upregulated in 
breast cancer cells. Table 1, 2, 3, 15, and 23 list genes that are down-regulated in breast 
cancer cells and Table 16, lists genes that are highly down-regulated in breast cancer genes. 
5 The Tables also provide an exemplar accession number that provides a nucleotide sequence 
that is part of the unigene cluster. 

Definitions 

O The term "breast cancer protein" or "breast cancer polynucleotide" or "breast 

: 

(So cancer-associated transcript" refers to nucleic acid and polypeptide polymorphic variants, 
S- alleles, mutants, and interspecies homologues that: (1) have a nucleotide sequence that has 
Si greater than about 60% nucleotide sequence identity, 65%, 70%, 75%, 80%, 85%, 90%, 
r preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% or greater nucleotide 
r: sequence identity, preferably over a region of over a region of at least about 25, 50, 100, 200, 
til 5 500, 1000, or more nucleotides, to a nucleotide sequence of or associated with a gene of 
K Tables 1-25: (2) bind to antibodies, e.g., polyclonal antibodies, raised against an immunogen 
^ comprising an amino acid sequence encoded by a nucleotide sequence of or associated with a 
gene of Tables 1-25, and conservatively modified variants thereof; (3) specifically hybridize 
under stringent hybridization conditions to a nucleic acid sequence, or the complement 
20 thereof of Tables 1 -25 and conservatively modified variants thereof or (4) have an amino acid 
sequence that has greater than about 60% amino acid sequence identity, 65%, 70%, 75%, 
80%, 85%, 90%, preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% or greater 
amino sequence identity, preferably over a region of over a region of at least about 25, 50, 
100, 200, 500, 1000, or more amino acid, to an amino acid sequence encoded by a nucleotide 
25 sequence of or associated with a gene of Tables 1-25. A polynucleotide or polypeptide 
sequence is typically from a mammal including, but not limited to, primate, e.g., human; 
rodent, e.g., rat, mouse, hamster; cow, pig, horse, sheep, or other mammal. A "breast cancer 
polypeptide" and a "breast cancer polynucleotide," include both naturally occurring or 
recombinant forms. 
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A "full length" breast cancer protein or nucleic acid refers to a breast cancer 
polypeptide or polynucleotide sequence, or a variant thereof, that contains all of the elements 
normally contained in one or more naturally occurring, wild type breast cancer 
polynucleotide or polypeptide sequences. The "full length" may be prior to, or after, various 
5 stages of post-translation processing or splicing, including alternative splicing. 

"Biological sample" as used herein is a sample of biological tissue or fluid that 
contains nucleic acids or polypeptides, e.g., of a breast cancer protein, polynucleotide or 
transcript. Such samples include, but are not limited to, tissue isolated from primates, e.g., 
H humans, or rodents, e.g., mice, and rats. Biological samples may also include sections of 
EjO tissues such as biopsy and autopsy samples, frozen sections taken for histologic purposes, 
jj=; blood, plasma, serum, sputum, stool, tears, mucus, hair, skin, etc. Biological samples also 
W include explants and primary and/or transformed cell cultures derived from patient tissues. A 
Q biological sample is typically obtained from a eukaryotic organism, most preferably a 
L mammal such as a primate e.g., chimpanzee or human; cow; dog; cat; a rodent, e.g., guinea 

H 5 pig, rat, mouse; rabbit; or a bird; reptile; or fish. 

fti 

J:; "Providing a biological sample" means to obtain a biological sample for use m 

| ! ; methods described in this invention. Most often, this will be done by removing a sample of 
cells from an animal, but can also be accomplished by using previously isolated cells (e.g., 
isolated by another person, at another time, and/or for another purpose), or by performing the 
20 methods of the invention in vivo. Archival tissues, having treatment or outcome history, will 
be particularly useful. 

The terms "identical" or percent "identity," in the context of two or more 
nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that 
are the same or have a specified percentage of amino acid residues or nucleotides that are the 
25 same (i.e., about 60% identity, preferably 70%, 75%, 80%, 85%, 90%, 91 %, 92%, 93%, 94%, 
95%, 96%, 97%, 98%, 99%, or higher identity over a specified region, when compared and 
aligned for maximum correspondence over a comparison window or designated region) as 
measured using a BLAST or BLAST 2.0 sequence comparison algorithms with default 
parameters described below, or by manual alignment and visual inspection (see, e.g., NCBI 
30 web site http://www.ncbi.nlm.nih.gov/BLAST/ or the like). Such sequences are then said to 
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be "substantially identical." This definition also refers to, or may be applied to, the 
compliment of a test sequence. The definition also includes sequences that have deletions 
and/or additions, as well as those that have substitutions, as well as naturally occurring, e.g., 
polymorphic or allelic variants, and man-made variants. As described below, the preferred 
5 algorithms can account for gaps and the like. Preferably, identity exists over a region that is 
at least about 25 amino acids or nucleotides in length, or more preferably over a region that is 
50-100 amino acids or nucleotides in length. 

For sequence comparison, typically one sequence acts as a reference sequence, 
to which test sequences are compared. When using a sequence comparison algorithm, test 
0 and reference sequences are entered into a computer, subsequence coordinates are designated, 
if necessary, and sequence algorithm program parameters are designated. Preferably, default 
program parameters can be used, or alternative parameters can be designated. The sequence 
comparison algorithm then calculates the percent sequence identities for the test sequences 
relative to the reference sequence, based on the program parameters. 
1 5 A "comparison window", as used herein, includes reference to a segment of 

one of the number of contiguous positions selected from the group consisting typically of 
from 20 to 600, usually about 50 to about 200, more usually about 100 to about 150 in which 
a sequence may be compared to a reference sequence of the same number of contiguous 
positions after the two sequences are optimally aligned. Methods of alignment of sequences 
20 for comparison are well-known in the art. Optimal alignment of sequences for comparison 
can be conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv. Appl. 
Math. 2:482 (1981), by the homology alignment algorithm of Needleman & Wunsch, J. Mol. 
Biol. 48:443 (1970), by the search for similarity method of Pearson & Lipman, Proc. Nat 'I. 
Acad. Sci. USA 85:2444 (1988), by computerized implementations of these algorithms 
25 (GAP, BESTFIT, FAST A, and TFASTA in the Wisconsin Genetics Software Package, 
Genetics Computer Group, 575 Science Dr., Madison, WI), or by manual alignment and 
visual inspection (see, e.g., Current Protocols in Molecular Biology (Ausubel et at, eds. 
1995 supplement)). 

Preferred examples of algorithms that are suitable for determining percent 
30 sequence identity and sequence similarity include the BLAST and BLAST 2.0 algorithms, 

12 
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which are described in Altschul et al, Nuc. Acids Res. 25:3389-3402 (1977) and Altschul et 
al, J. Mol Biol. 215:403-410 (1990). BLAST and BLAST 2.0 are used, with the parameters 
described herein, to determine percent sequence identity for the nucleic acids and proteins of 
the invention. Software for performing BLAST analyses is publicly available through the 
5 National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). This 

algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short 
words of length W in the query sequence, which either match or satisfy some positive-valued 
} _. threshold score T when aligned with a word of the same length in a database sequence. T is 
O referred to as the neighborhood word score threshold (Altschul et al, supra). These initial 
1 1;0 neighborhood word hits act as seeds for initiating searches to find longer HSPs containing 
if ! them. The word hits are extended in both directions along each sequence for as far as the 
Hi cumulative alignment score can be increased. Cumulative scores are calculated using, e.g., 
for nucleotide sequences, the parameters M (reward score for a pair of matching residues; 

IK, 

}= ! always > 0) and N (penalty score for mismatching residues; always < 0). For amino acid 

srs?.: 

115 sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word 
5! hits in each direction are halted when: the cumulative alignment score falls off by the 

quantity X from its maximum achieved value; the cumulative score goes to zero or below, 
due to the accumulation of one or more negative-scoring residue alignments; or the end of 
either sequence is reached. The BLAST algorithm parameters W, T, and X determine the 
20 sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) 
uses as defaults a wordlength (W) of 1 1, an expectation (E) of 10, M=5, N=-4 and a 
comparison of both strands. For amino acid sequences, the BLASTP program uses as 
defaults a wordlength of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix 
(see Henikoff & Henikoff, Proc. Natl. Acad. Sci. USA 89:10915 (1989)) alignments (B) of 
25 50, expectation (E) of 10, M=5, N=-4, and a comparison of both strands. 

The BLAST algorithm also performs a statistical analysis of the similarity 
between two sequences (see, e.g., Karlin & Altschul, Proc. Nat 'I. Acad. Sci. USA 90:5873- 
5787 (1993)). One measure of similarity provided by the BLAST algorithm is the smallest 
sum probability (P(N))> which provides an indication of the probability by which a match 
30 between two nucleotide or amino acid sequences would occur by chance. For example, a 
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nucleic acid is considered similar to a reference sequence if the smallest sum probability in a 
comparison of the test nucleic acid to the reference nucleic acid is less than about 0.2, more 
preferably less than about 0.01, and most preferably less than about 0.001 . Log values may 
be large negative numbers, e.g., 5, 10, 20, 30, 40, 40, 70, 90, 110, 150, 170, etc. 

An indication that two nucleic acid sequences or polypeptides are substantially 
identical is that the polypeptide encoded by the first nucleic acid is immunologically cross 
reactive with the antibodies raised against the polypeptide encoded by the second nucleic 
acid, as described below. Thus, a polypeptide is typically substantially identical to a second 
polypeptide, e.g., where the two peptides differ only by conservative substitutions. Another 
indication that two nucleic acid sequences are substantially identical is that the two molecules 
or their complements hybridize to each other under stringent conditions, as described below. 
Yet another indication that two nucleic acid sequences are substantially identical is that the 
same primers can be used to amplify the sequences. 

A "host cell" is a naturally occurring cell or a transformed cell that contains an 
expression vector and supports the replication or expression of the expression vector. Host 
cells may be cultured cells, explants, cells in vivo, and the like. Host cells may be 
prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, or 
mammalian cells such as CHO, HeLa, and the like {see, e.g., the American Type Culture 
Collection catalog or web site, www.atcc.org). 

The terms "isolated," "purified," or "biologically pure" refer to material that is 
substantially or essentially free from components that normally accompany it as found in its 
native state. Purity and homogeneity are typically determined using analytical chemistry 
techniques such as polyacrylamide gel electrophoresis or high performance liquid 
chromatography. A protein or nucleic acid that is the predominant species present in a 
preparation is substantially purified. In particular, an isolated nucleic acid is separated from 
some open reading frames that naturally flank the gene and encode proteins other than protein 
encoded by the gene. The term "purified" in some embodiments denotes that a nucleic acid 
or protein gives rise to essentially one band in an electrophoretic gel. Preferably, it means 
that the nucleic acid or protein is at least 85% pure, more preferably at least 95% pure, and 
most preferably at least 99% pure. "Purify" or "purification" in other embodiments means 
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removing at least one contaminant from the composition to be purified. In this sense, 
purification does not require that the purified compound be homogenous, e.g., 100% pure. 

The terms "polypeptide," "peptide" and "protein" are used interchangeably 
herein to refer to a polymer of amino acid residues. The terms apply to amino acid polymers 
in which one or more amino acid residue is an artificial chemical mimetic of a corresponding 
naturally occurring amino acid, as well as to naturally occurring amino acid polymers, those 
containing modified residues, and non-naturally occurring amino acid polymer. 

The term "amino acid" refers to naturally occurring and synthetic amino acids, 
as well as amino acid analogs and amino acid mimetics that function similarly to the naturally 
occurring amino acids. Naturally occurring amino acids are those encoded by the genetic 
code, as well as those amino acids that are later modified, e.g., hydroxyproline, y- 
carboxyglutamate, and O-phosphoserine. Amino acid analogs refers to compounds that have 
the same basic chemical structure as a naturally occurring amino acid, e.g., an a carbon that is 
bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, 
norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs may have 
modified R groups (e.g., norleucine) or modified peptide backbones, but retain the same basic 
chemical structure as a naturally occurring amino acid. Amino acid mimetics refers to 
chemical compounds that have a structure that is different from the general chemical 
structure of an amino acid, but that functions similarly to a naturally occurring amino acid. 

Amino acids may be referred to herein by either their commonly known three 
letter symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical 
Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly 
accepted single-letter codes. 

"Conservatively modified variants" applies to both amino acid and nucleic 
acid sequences. With respect to particular nucleic acid sequences, conservatively modified 
variants refers to those nucleic acids which encode identical or essentially identical amino 
acid sequences, or where the nucleic acid does not encode an amino acid sequence, to 
essentially identical or associated, e.g., naturally contiguous, sequences. Because of the 
degeneracy of the genetic code, a large number of functionally identical nucleic acids encode 
most proteins. For instance, the codons GCA, GCC, GCG and GCU all encode the-amino 
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acid alanine. Thus, at every position where an alanine is specified by a codon, the codon can 
be altered to another of the corresponding codons described without altering the encoded 
polypeptide. Such nucleic acid variations are "silent variations," which are one species of 
conservatively modified variations. Every nucleic acid sequence herein which encodes a 
5 polypeptide also describes silent variations of the nucleic acid. One of skill will recognize 
that in certain contexts each codon in a nucleic acid (except AUG, which is ordinarily the 
only codon for methionine, and TGG, which is ordinarily the only codon for tryptophan) can 
be modified to yield a functionally identical molecule. Accordingly, often silent variations of 
p a nucleic acid which encodes a polypeptide is implicit in a described sequence with respect to 
rIO the expression product, but not with respect to actual probe sequences. 
w As to amino acid sequences, one of skill will recognize that individual 

%j substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein 
^ sequence which alters, adds or deletes a single amino acid or a small percentage of ammo 
t ! acids in the encoded sequence is a "conservatively modified variant" where the alteration 
35 results in the substitution of an amino acid with a chemically similar amino acid. 
J ; Conservative substitution tables providing functionally similar amino acids are well known in 
ft! the art. Such conservatively modified variants are in addition to and do not exclude 
polymorphic variants, interspecies homologs, and alleles of the invention. typically 
conservative substitutions for one another: 1) Alanine (A), Glycine (G); 2) Aspartic acid (D), 
20 Glutamic acid (E); 3) Asparagine (N), Glutamine (Q); 4) Arginine (R), Lysine (K); 5) 

Isoleucine (I), Leucine (L), Methionine (M), Valine (V); 6) Phenylalanine (F), Tyrosine (Y), 
Tryptophan (W); 7) Serine (S), Threonine (T); and 8) Cysteine (C), Methionine (M) (see, 
e.g., Creighton, Proteins (1984)). 

Macromolecular structures such as polypeptide structures can be described in 
25 terms of various levels of organization. For a general discussion of this organization, see, 
e.g., Alberts et al, Molecular Biology of the Cell (3 rd ed., 1994) and Cantor & Schimmel, 
Biophysical Chemistry Part I: The Conformation of Biological Macromolecules (1980). 
"Primary structure" refers to the amino acid sequence of a particular peptide. "Secondary 
structure" refers to locally ordered, three dimensional structures within a polypeptide. These 
30 structures are commonly known as domains. Domains are portions of a polypeptide that 
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often form a compact unit of the polypeptide and are typically 25 to approximately 500 
amino acids long. Typical domains are made up of sections of lesser organization such as 
stretches of p-sheet and a-helices. "Tertiary structure" refers to the complete three 
dimensional structure of a polypeptide monomer. "Quaternary structure" refers to the three 
5 dimensional structure formed, usually by the noncovalent association of independent tertiary 
units. Anisotropic terms are also known as energy terms. 

"Nucleic acid" or "oligonucleotide" or "polynucleotide" or grammatical 
L = equivalents used herein means at least two nucleotides covalently linked together. 

0 Oligonucleotides are typically from about 5, 6, 7, 8, 9, 10, 12, 15, 25, 30, 40, 50 or more 
IIP nucleotides in length, up to about 100 nucleotides in length. Nucleic acids and 

Jfi polynucleotides are a polymers of any length, including longer lengths, e.g., 200, 300, 500, 

1000, 2000, 3000, 5000, 7000, 10,000, etc. A nucleic acid of the present invention will 
r generally contain phosphodiester bonds, although in some cases, nucleic acid analogs are 
b! included that may have alternate backbones, comprising, e.g., phosphoramidate, 
ri5 phosphorothioate, phosphorodithioate, or O-methylphophoroamidite linkages (see Eckstein, 
p Oligonucleotides and Analogues: A Practical Approach, Oxford University Press); and 

1 y peptide nucleic acid backbones and linkages. Other analog nucleic acids include those with 

positive backbones; non-ionic backbones, and non-ribose backbones, including those 
described in U.S. Patent Nos. 5,235,033 and 5,034,506, and Chapters 6 and 7, ASC 

20 Symposium Series 580, Carbohydrate Modifications in Antisense Research, Sanghui & 

Cook, eds.. Nucleic acids containing one or more carbocyclic sugars are also included within 
one definition of nucleic acids. Modifications of the ribose-phosphate backbone may be done 
for a variety of reasons, e.g. to increase the stability and half-life of such molecules in 
physiological environments or as probes on a biochip. Mixtures of naturally occurring 

25 nucleic acids and analogs can be made; alternatively, mixtures of different nucleic acid 
analogs, and mixtures of naturally occurring nucleic acids and analogs may be made. 

A variety of references disclose such nucleic acid analogs, including, for 
example, phosphoramidate (Beaucage et al., Tetrahedron 49(10):1925 (1993) and references 
therein; Letsinger, J. Org. Chem. 35:3800 (1970); Sprinzl et al, Eur. J. Biochem. 81:579 

30 (1977); Letsinger et al., Nucl. Acids Res. 14:3487 (1986); Sawai et al, Chem. Lett. 805 
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(1984), Letsinger et al., J. Am. Chem. Soc. 1 10:4470 (1988); and Pauwels et aL, Chemica 
Scripta 26:141 91986)), phosphorothioate (Mag et al., Nucleic Acids Res. 19:1437 (1991); 
and U.S. Patent No. 5,644,048), phosphorodithioate (Briu et al., J. Am. Chem. Soc. 11 1:2321 
(1989), O-methylphophoroamidite linkages (see Eckstein, Oligonucleotides and Analogues: 
5 A Practical Approach, Oxford University Press), and peptide nucleic acid backbones and 

linkages (see Egholm, J. Am. Chem. Soc. 1 14:1895 (1992); Meier et al., Chem. Int. Ed. Engl. 
31:1008 (1992); Nielsen, Nature, 365:566 (1993); Carlsson et al., Nature 380:207 (1996), all 
of which are incorporated by reference). Other analog nucleic acids include those with 
P positive backbones (Denpcy et al., Proc. Natl. Acad. Sci. USA 92:6097 (1995); non-ionic 
So backbones (U.S. Patent Nos. 5,386,023, 5,637,684, 5,602,240, 5,216,141 and 4,469,863; 
m Kiedrowshi et al., Angew. Chem. Intl. Ed. English 30:423 (1991); Letsinger et al., J. Am. 
PI Chem. Soc. 1 10:4470 (1988); Letsinger et al, Nucleoside & Nucleotide 13: 1597 (1994); 
O Chapters 2 and 3, ASC Symposium Series 580, "Carbohydrate Modifications in Antisense 
O Research", Ed. Y.S. Sanghui and P. Dan Cook; Mesmaeker et al., Bioorganic & Medicinal 
Ct5 Chem. Lett. 4:395 (1994); Jeffs et al., J. Biomolecular NMR 34:17 (1994); Tetrahedron Lett. 

37:743 (1996)) and non-ribose backbones, including those described in U.S. Patent Nos. 
fjj 5,235,033 and 5,034,506, and Chapters 6 and 7, ASC Symposium Series 580, "Carbohydrate 
Modifications in Antisense Research", Ed. Y.S. Sanghui and P. Dan Cook. Nucleic acids 
containing one or more carbocyclic sugars are also included within one definition of nucleic 
20 acids (see Jenkins et al., Chem. Soc. Rev. (1995) pp 169-176). Several nucleic acid analogs 
are described in Rawls, C & E News June 2, 1997 page 35. All of these references are hereby 
expressly incorporated by reference. 

Particularly preferred are peptide nucleic acids (PNA) which includes peptide 
nucleic acid analogs. These backbones are substantially non-ionic under neutral conditions, in 
25 contrast to the highly charged phosphodiester backbone of naturally occurring nucleic acids. 
This results in two advantages. First, the PNA backbone exhibits improved hybridization 
kinetics. PNAs have larger changes in the melting temperature (T m ) for mismatched versus 
perfectly matched basepairs. DNA and RNA typically exhibit a 2-4°C drop in T m for an 
internal mismatch. With the non-ionic PNA backbone, the drop is closer to 7-9°C. Similarly, 
30 due to their non-ionic nature, hybridization of the bases attached to these backbones is 
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relatively insensitive to salt concentration. In addition, PNAs are not degraded by cellular 

enzymes, and thus can be more stable. 

The nucleic acids may be single stranded or double stranded, as specified, or 

contain portions of both double stranded or single stranded sequence. As will be appreciated 
5 by those in the art, the depiction of a single strand also defines the sequence of the 

complementary strand; thus the sequences described herein also provide the complement of 

the sequence. The nucleic acid may be DNA, both genomic and cDNA, RNA or a hybrid, 
u where the nucleic acid may contain combinations of deoxyribo- and ribo-nucleotides, and 
D combinations of bases, including uracil, adenine, thymine, cytosine, guanine, inosine, 
5o xanthine hypoxanthine, isocytosine, isoguanine, etc. "Transcript' ' typically refers to a 
}} = i naturally occurring RNA, e.g., a pre-mRNA, hnRNA, or mRNA. As used herein, the term 

^ "nucleoside" includes nucleotides and nucleoside and nucleotide analogs, and modified 

P 

r nucleosides such as amino modified nucleosides. In addition, "nucleoside" includes non- 
h 1 naturally occurring analog structures. Thus, e.g. the individual units of a peptide nucleic 
Ri5 acid, each containing a base, are referred to herein as a nucleoside, 
ft A "label" or a "detectable moiety" is a composition detectable by 

!V spectroscopic, photochemical, biochemical, immunochemical, chemical, or other physical 
means. For example, useful labels include 32 P, fluorescent dyes, electron-dense reagents, 
enzymes (e.g., as commonly used in an ELISA), biotin, digoxigenin, or haptens and proteins 
20 or other entities which can be made detectable, e.g., by incorporating a radiolabel into the 

peptide or used to detect antibodies specifically reactive with the peptide. The labels may be 
incorporated into the breast cancer nucleic acids, proteins and antibodies at any position. 
Any method known in the art for conjugating the antibody to the label may be employed, 
including those methods described by Hunter et al., Nature . 144:945 (1962); David et al., 
25 Biochemistry. 13:1014 (1974); Pain et al., J. Tmmunol. Meth. . 40:219 (1981); and Nygren, J. 
Histochem. and Cvtochem.. 30:407 (1982). 

An "effector" or "effector moiety" or "effector component" is a molecule that 
is bound (or linked, or conjugated), either covalently, through a linker or a chemical bond, or 
noncovalently, through ionic, van der Waals, electrostatic, or hydrogen bonds, to an antibody. 
30 The "effector" can be a variety of molecules including, e.g., detection moieties including 
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radioactive compounds, fluorescent compounds, an enzyme or substrate, tags such as epitope 
tags, a toxin; activatable moieties, a chemotherapeutic agent; a lipase; an antibiotic; or a 
radioisotope emitting "hard" e.g., beta radiation. 

A "labeled nucleic acid probe or oligonucleotide" is one that is bound, either 
covalently, through a linker or a chemical bond, or noncovalently, through ionic, van der 
Waals, electrostatic, or hydrogen bonds to a label such that the presence of the probe may be 
detected by detecting the presence of the label bound to the probe. Alternatively, method 
using high affinity interactions may achieve the same results where one of a pair of binding 
partners binds to the other, e.g., biotin, streptavidin. 

As used herein a "nucleic acid probe or oligonucleotide" is defined as a 
nucleic acid capable of binding to a target nucleic acid of complementary sequence through 
one or more types of chemical bonds, usually through complementary base pairing, usually 
through hydrogen bond formation. As used herein, a probe may include natural (i.e., A, G, C, 
or T) or modified bases (7-deazaguanosine, inosine, etc.). In addition, the bases in a probe 
may be joined by a linkage other than a phosphodiester bond, so long as it does not 
functionally interfere with hybridization. Thus, e.g., probes may be peptide nucleic acids in 
which the constituent bases are joined by peptide bonds rather than phosphodiester linkages. 
It will be understood by one of skill in the art that probes may bind target sequences lacking 
complete complementarity with the probe sequence depending upon the stringency of the 
hybridization conditions. The probes are preferably directly labeled as with isotopes, 
chromophores, lumiphores, chromogens, or indirectly labeled such as with biotin to which a 
streptavidin complex may later bind. By assaying for the presence or absence of the probe, 
one can detect the presence or absence of the select sequence or subsequence. Diagnosis or 
prognosis may be based at the genomic level, or at the level of RNA or protein expression. 

The term "recombinant" when used with reference, e.g., to a cell, or nucleic 
acid, protein, or vector, indicates that the cell, nucleic acid, protein or vector, has been 
modified by the introduction of a heterologous nucleic acid or protein or the alteration of a 
native nucleic acid or protein, or that the cell is derived from a cell so modified. Thus, e.g., 
recombinant cells express genes that are not found within the native (non-recombinant) form 
of the cell or express native genes that are otherwise abnormally expressed, under expressed 
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or not expressed at all. By the term "recombinant nucleic acid" herein is meant nucleic acid, 
originally formed in vitro, in general, by the manipulation of nucleic acid, e.g., using 
polymerases and endonucleases, in a form not normally found in nature. In this manner, 
operably linkage of different sequences is achieved. Thus an isolated nucleic acid, in a linear 
form, or an expression vector formed in vitro by ligating DNA molecules that are not 
normally joined, are both considered recombinant for the purposes of this invention. It is 
understood that once a recombinant nucleic acid is made and reintroduced into a host cell or 
organism, it will replicate non-recombinantly, i.e., using the in vivo cellular machinery of the 
host cell rather than in vitro manipulations; however, such nucleic acids, once produced 
recombinantly, although subsequently replicated non-recombinantly, are still considered 
recombinant for the purposes of the invention. Similarly, a "recombinant protein" is a protein 
made using recombinant techniques, i.e., through the expression of a recombinant nucleic 
acid as depicted above. 

The term "heterologous" when used with reference to portions of a nucleic 
acid indicates that the nucleic acid comprises two or more subsequences that are not normally 
found in the same relationship to each other in nature. For instance, the nucleic acid is 
typically recombinantly produced, having two or more sequences, e.g., from unrelated genes 
arranged to make a new functional nucleic acid, e.g., a promoter from one source and a 
coding region from another source. Similarly, a heterologous protein will often refer to two 
or more subsequences that are not found in the same relationship to each other in nature (e.g., 
a fusion protein). 

A "promoter" is defined as an array of nucleic acid control sequences that 
direct transcription of a nucleic acid. As used herein, a promoter includes necessary nucleic 
acid sequences near the start site of transcription, such as, in the case of a polymerase II type 
promoter, a TATA element. A promoter also optionally includes distal enhancer or repressor 
elements, which can be located as much as several thousand base pairs from the start site of 
transcription. A "constitutive" promoter is a promoter that is active under most 
environmental and developmental conditions. An "inducible" promoter is a promoter that is 
active under environmental or developmental regulation. The term "operably linked" refers 
to a functional linkage between a nucleic acid expression control sequence (such asha 
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promoter, or array of transcription factor binding sites) and a second nucleic acid sequence, 
wherein the expression control sequence directs transcription of the nucleic acid 
corresponding to the second sequence. 

An "expression vector" is a nucleic acid construct, generated recombinantly or 
5 synthetically, with a series of specified nucleic acid elements that permit transcription of a 
particular nucleic acid in a host cell. The expression vector can be part of a plasmid, virus, or 
nucleic acid fragment. Typically, the expression vector includes a nucleic acid to be 
L , transcribed operably linked to a promoter. 

O The phrase "selectively (or specifically) hybridizes to" refers to the binding, 

y|0 duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under 
W- stringent hybridization conditions when that sequence is present in a complex mixture (e.g., 
Si total cellular or library DNA or RNA). 

I" The phrase "stringent hybridization conditions" refers to conditions under 

which a probe will hybridize to its target subsequence, typically in a complex mixture of 
1115 nucleic acids, but to no other sequences. Stringent conditions are sequence-dependent and 
K will be different in different circumstances. Longer sequences hybridize specifically at 
higher temperatures. An extensive guide to the hybridization of nucleic acids is found in 
Tijssen, Techniques in Biochemistry and Molecular Biology— Hybridization with Nucleic 
Probes, "Overview of principles of hybridization and the strategy of nucleic acid assays" 
20 (1993). Generally, stringent conditions are selected to be about 5-10°C lower than the 

thermal melting point (T m ) for the specific sequence at a defined ionic strength pH. The T m is 
the temperature (under defined ionic strength, pH, and nucleic concentration) at which 50% 
of the probes complementary to the target hybridize to the target sequence at equilibrium (as 
the target sequences are present in excess, at T m , 50% of the probes are occupied at 
25 equilibrium). Stringent conditions will be those in which the salt concentration is less than 
about 1.0 M sodium ion, typically about 0.01 to 1.0 M sodium ion concentration (or other 
salts) at pH 7.0 to 8.3 and the temperature is at least about 30°C for short probes (e.g., 10 to 
50 nucleotides) and at least about 60°C for long probes (e.g., greater than 50 nucleotides). 
Stringent conditions may also be achieved with the addition of destabilizing agents such as 
30 formamide. For selective or specific hybridization, a positive signal is at least two times 
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background, preferably 10 times background hybridization. Exemplary stringent 
hybridization conditions can be as following: 50% formamide, 5x SSC, and 1% SDS, 
incubating at 42°C, or, 5x SSC, 1% SDS, incubating at 65°C, with wash in 0.2x SSC, and 
0.1% SDS at 65°C. For PCR, a temperature of about 36°C is typical for low stringency 
5 amplification, although annealing temperatures may vary between about 32°C and 48°C 

depending on primer length. For high stringency PCR amplification, a temperature of about 
62°C is typical, although high stringency annealing temperatures can range from about 50°C 

M to about 65°C, depending on the primer length and specificity. Typical cycle conditions for 

ft 

p both high and low stringency amplifications include a denaturation phase of 90°C - 95°C for 
j}io 30 sec - 2 min., an annealing phase lasting 30 sec. - 2 min., and an extension phase of about 
W 72°C for 1 - 2 min. Protocols and guidelines for low and high stringency amplification 
Cli reactions are provided, e.g., in Innis et al. (1990) PCR Protocols, A Guide to Methods and 
%, Applications, Academic Press, Inc. N.Y.). 

M Nucleic acids that do not hybridize to each other under stringent conditions are 

111 5 still substantially identical if the polypeptides which they encode are substantially identical. 
5 This occurs, e.g., when a copy of a nucleic acid is created using the maximum codon 

degeneracy permitted by the genetic code. In such cases, the nucleic acids typically hybridize 
under moderately stringent hybridization conditions. Exemplary "moderately stringent 
hybridization conditions" include a hybridization in a buffer of 40% formamide, 1 M NaCl, 
20 1% SDS at 37°C, and a wash in IX SSC at 45°C. A positive hybridization is at least twice 
background. Those of ordinary skill will readily recognize that alternative hybridization and 
wash conditions can be utilized to provide conditions of similar stringency. Additional 
guidelines for determining hybridization parameters are provided in numerous reference, e.g., 
and Current Protocols in Molecular Biology, ed. Ausubel, et al 
25 The phrase "functional effects" in the context of assays for testing compounds 

that modulate activity of a breast cancer protein includes the determination of a parameter 
that is indirectly or directly under the influence of the breast cancer protein or nucleic acid, 
e.g., a functional, physical, or chemical effect, such as the ability to decrease breast cancer. It 
includes ligand binding activity; cell growth on soft agar; anchorage dependence; contact 
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inhibition and density limitation of growth; cellular proliferation; cellular transformation; 
growth factor or serum dependence; tumor specific marker levels; invasiveness into Matrigel; 
tumor growth and metastasis in vivo; mRNA and protein expression in cells undergoing 
metastasis, and other characteristics of breast cancer cells. "Functional effects" include in 
5 vitro, in vivo, and ex vivo activities. 

By "determining the functional effect" is meant assaying for a compound that 
increases or decreases a parameter that is indirectly or directly under the influence of a breast 
L ,. cancer protein sequence, e.g., functional, enzymatic, physical and chemical effects. Such 
C functional effects can be measured by any means known to those skilled in the art, e.g., 
§10 changes in spectroscopic characteristics (e.g., fluorescence, absorbance, refractive index), 
Wi hydrodynamic (e.g., shape), chromatographic, or solubility properties for the protein, 
measuring inducible markers or transcriptional activation of the breast cancer protein; 
7 measuring binding activity or binding assays, e.g. binding to antibodies or other ligands, and 
h measuring cellular proliferation. Determination of the functional effect of a compound on 
Hi 5 breast cancer can also be performed using breast cancer assays known to those of skill in the 
pi art such as an in vitro assays, e.g., cell growth on soft agar; anchorage dependence; contact 
^ inhibition and density limitation of growth; cellular proliferation; cellular transformation; 

growth factor or serum dependence; tumor specific marker levels; invasiveness into Matrigel; 
tumor growth and metastasis in vivo; mRNA and protein expression in cells undergoing 
20 metastasis, and other characteristics of breast cancer cells. The functional effects can be 

evaluated by many means known to those skilled in the art, e.g., microscopy for quantitative 
or qualitative measures of alterations in morphological features, measurement of changes in 
RNA or protein levels for breast cancer-associated sequences, measurement of RNA stability, 
identification of downstream or reporter gene expression (CAT, luciferase, (i-gal, GFP and 
25 the like), e.g., via chemiluminescence, fluorescence, colorimetric reactions, antibody binding, 
inducible markers, and ligand binding assays. 

"Inhibitors", "activators", and "modulators" of breast cancer polynucleotide 
and polypeptide sequences are used to refer to activating, inhibitory, or modulating molecules 
or compounds identified using in vitro and in vivo assays of breast cancer polynucleotide and 
30 polypeptide sequences. Inhibitors are compounds that, e.g., bind to, partially or totally block 
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activity, decrease, prevent, delay activation, inactivate, desensitize, or down regulate the 
activity or expression of breast cancer proteins, q.g., antagonists. Antisense nucleic acids 
may seem to inhibit expression and subsequent function of the protein. "Activators" are 
compounds that increase, open, activate, facilitate, enhance activation, sensitize, agonize, or 
up regulate breast cancer protein activity. Inhibitors, activators, or modulators also include 
genetically modified versions of breast cancer proteins, e.g., versions with altered activity, as 
well as naturally occurring and synthetic ligands, antagonists, agonists, antibodies, small 
chemical molecules and the like. Such assays for inhibitors and activators include, e.g., 
expressing the breast cancer protein in vitro, in cells, or cell membranes, applying putative 
modulator compounds, and then determining the functional effects on activity, as described 
above. Activators and inhibitors of breast cancer can also be identified by incubating breast 
cancer cells with the test compound and determining increases or decreases in the expression 
of 1 or more breast cancer proteins, e.g., 1,2,3,4, 5, 10, 15, 20, 25, 30, 40, 50 or more breast 
cancer proteins, such as breast cancer proteins encoded by the sequences set out in Tables 1- 
25. 

Samples or assays comprising breast cancer proteins that are treated with a 
potential activator, inhibitor, or modulator are compared to control samples without the 
inhibitor, activator, or modulator to examine the extent of inhibition. Control samples 
(untreated with inhibitors) are assigned a relative protein activity value of 100%. Inhibition 
of a polypeptide is achieved when the activity value relative to the control is about 80%, 
preferably 50%, more preferably 25-0%. Activation of a breast cancer polypeptide is 
achieved when the activity value relative to the control (untreated with activators) is 1 10%, 
more preferably 150%, more preferably 200-500% (i.e., two to five fold higher relative to the 
control), more preferably 1000-3000% higher. 

The phrase "changes in cell growth" refers to any change in cell growth and 
proliferation characteristics in vitro or in vivo, such as formation of foci, anchorage 
independence, semi-solid or soft agar growth, changes in contact inhibition and density 
limitation of growth, loss of growth factor or serum requirements, changes in cell 
morphology, gaining or losing immortalization, gaining or losing tumor specific markers, 
ability to form or suppress tumors when injected into suitable animal hosts, and/or - 
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immortalization of the cell. See, e.g., Freshney, Culture of Animal Cells a Manual of Basic 
Technique pp. 231-241 (3 rd ed. 1994). 

"Tumor cell" refers to precancerous, cancerous, and normal cells in a tumor. 
"Cancer cells," "transformed" cells or "transformation" in tissue culture, refers 
5 to spontaneous or induced phenotypic changes that do not necessarily involve the uptake of 
new genetic material. Although transformation can arise from infection with a transforming 
virus and incorporation of new genomic DNA, or uptake of exogenous DNA, it can also arise 
u . spontaneously or following exposure to a carcinogen, thereby mutating an endogenous gene. 
C - Transformation is associated with phenotypic changes, such as immortalization of cells, 
01 0 aberrant growth control, nonmorphological changes, and/or malignancy (see, Freshney, 
i}-; Culture of Animal Cells a Manual of Basic Technique (3 rd ed. 1994)). 
SI "Antibody" refers to a polypeptide comprising a framework region from an 

immunoglobulin gene or fragments thereof that specifically binds and recognizes an antigen. 
F? The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, 
M 5 epsilon, and mu constant region genes, as well as the myriad immunoglobulin variable region 
fi genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as 
^ gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, 
IgM, IgA, IgD and IgE, respectively. Typically, the antigen-binding region of an antibody or 
its functional equivalent will be most critical in specificity and affinity of binding. See Paul, 
20 Fundamental Immunology. 

An exemplary immunoglobulin (antibody) structural unit comprises a 
tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each pair 
having one "light" (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus 
of each chain defines a variable region of about 100 to 1 10 or more amino acids primarily 
25 responsible for antigen recognition. The terms variable light chain (V L ) and variable heavy 
chain (V H ) refer to these light and heavy chains respectively. 

Antibodies exist, e.g., as intact immunoglobulins or as a number of well- 
characterized fragments produced by digestion with various peptidases. Thus, e.g., pepsin 
digests an antibody below the disulfide linkages in the hinge region to produce F(ab)' 2) a 
30 dimer of Fab which itself is a light chain joined to V H -C H 1 by a disulfide bond. The F(ab)' 2 
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may be reduced under mild conditions to break the disulfide linkage in the hinge region, 
thereby converting the F(ab)' 2 dimer into an Fab' monomer. The Fab' monomer is 
essentially Fab with part of the hinge region (see Fundamental Immunology (Paul ed., 3d ed. 
1993). While various antibody fragments are defined in terms of the digestion of an intact 
5 antibody, one of skill will appreciate that such fragments may be synthesized de novo either 
chemically or by using recombinant DNA methodology. Thus, the term antibody, as used 
herein, also includes antibody fragments either produced by the modification of whole 
antibodies, or those synthesized de novo using recombinant DNA methodologies (e.g., single 

6 chain Fv) or those identified using phage display libraries (see, e.g., McCafferty etal, Nature 
BO 348:552-554 (1990)) 

K For preparation of antibodies, e.g., recombinant, monoclonal, or polyclonal 

SI antibodies, many technique known in the art can be used (see, e.g., Kohler & Milstein, 

■ass. 

7 Nature 256:495-497 (1975); Kozbor et al, Immunology Today 4:72 (1983); Cole et al, pp. 
O 77_ 96 i n Monoclonal Antibodies and Cancer Therapy (1985); Coligan, Current Protocols in 
RJ15 Immunology (1991); Harlow & Lane, Antibodies, A Laboratory Manual (1988); and Goding, 
f i Monoclonal Antibodies: Principles and Practice (2d ed. 1986)). Techniques for the 

W production of single chain antibodies (U.S. Patent 4,946,778) can be adapted to produce 

antibodies to polypeptides of this invention. Also, transgenic mice, or other organisms such 
as other mammals, may be used to express humanized antibodies. Alternatively, phage 
20 display technology can be used to identify antibodies and heteromeric Fab fragments that 
specifically bind to selected antigens (see, e.g., McCafferty et al., Nature 348:552-554 
(1990); Marks et al., Biotechnology 10:779-783 (1992)). 

A "chimeric antibody" is an antibody molecule in which (a) the constant 
region, or a portion thereof, is altered, replaced or exchanged so that the antigen binding site 
25 (variable region) is linked to a constant region of a different or altered class, effector function 
and/or species, or an entirely different molecule which confers new properties to the chimeric 
antibody, e.g., an enzyme, toxin, hormone, growth factor, drug, etc.; or (b) the variable 
region, or a portion thereof, is altered, replaced or exchanged with a variable region having a 
different or altered antigen specificity. 

30 
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Identification of breast cancer-associated sequences 

In one aspect, the expression levels of genes are determined in different 
patient samples for which diagnosis information is desired, to provide expression profiles. 
An expression profile of a particular sample is essentially a "fingerprint" of the state of the 
5 sample; while two states may have any particular gene similarly expressed, the evaluation of 
a number of genes simultaneously allows the generation of a gene expression profile that is 
characteristic of the state of the cell. That is, normal tissue (e.g., normal breast or other 
tissue) may be distinguished from cancerous or metastatic cancerous tissue of the breast, or 
breast cancer tissue or metastatic breast cancerous tissue can be compared with tissue 
V 10 samples of breast and other tissues from surviving cancer patients. By comparing expression 
py profiles of tissue in known different breast cancer states, information regarding which genes 
? ~: are important (including both up- and down-regulation of genes) in each of these states is 
* obtained. 

M< The identification of sequences that are differentially expressed in breast 

5 J;l 5 cancer versus non-breast cancer tissue allows the use of this information in a number of ways. 
For example, a particular treatment regime may be evaluated: does a chemotherapeutic drug 
act to down-regulate breast cancer, and thus tumor growth or recurrence, in a particular 
patient. Similarly, diagnosis and treatment outcomes may be done or confirmed by 
comparing patient samples with the known expression profiles. Metastatic tissue can also be 
20 analyzed to determine the stage of breast cancer in the tissue. Furthermore, these gene 

expression profiles (or individual genes) allow screening of drug candidates with an eye to 
mimicking or altering a particular expression profile; e.g., screening can be done for drugs 
that suppress the breast cancer expression profile. This may be done by making biochips 
comprising sets of the important breast cancer genes, which can then be used in these screens. 
25 These methods can also be done on the protein basis; that is, protein expression levels of the 
breast cancer proteins can be evaluated for diagnostic purposes or to screen candidate agents. 
In addition, the breast cancer nucleic acid sequences can be administered for gene therapy 
purposes, including the administration of antisense nucleic acids, or the breast cancer proteins 
(including antibodies and other modulators thereof) administered as therapeutic drugs. 
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Thus the present invention provides nucleic acid and protein sequences that 
are differentially expressed in breast cancer, herein termed "breast cancer sequences." As 
outlined below, breast cancer sequences include those that are up-regulated (i.e., expressed at 
a higher level) in breast cancer, as well as those that are down-regulated (i.e., expressed at a 
5 lower level). In a preferred embodiment, the breast cancer sequences are from humans; 
however, as will be appreciated by those in the art, breast cancer sequences from other 
organisms may be useful in animal models of disease and drug evaluation; thus, other breast 

y, cancer sequences are provided, from vertebrates, including mammals, including rodents (rats, 
mice, hamsters, guinea pigs, etc.), primates, farm animals (including sheep, goats, pigs, cows, 

ff 10 horses, etc.) and pets, e.g., (dogs, cats, etc.). Breast cancer sequences from other organisms 

ffj may be obtained using the techniques outlined below. 

y Breast cancer sequences can include both nucleic acid and amino acid 

7 sequences. As will be appreciated by those in the art and is more fully outlined below, breast 

S cancer nucleic acid sequences are useful in a variety of applications, including diagnostic 

IVl 5 applications, which will detect naturally occurring nucleic acids, as well as screening 

p applications; e.g., biocbips comprising nucleic acid probes or PCR microtiter plates with 

1 y selected probes to the breast cancer sequences can be generated. 

A breast cancer sequence can be initially identified by substantial nucleic acid 

and/or amino acid sequence homology to the breast cancer sequences outlined herein. Such 
20 homology can be based upon the overall nucleic acid or amino acid sequence, and is 

generally determined as outlined below, using either homology programs or hybridization 

conditions. 

For identifying breast cancer-associated sequences, the breast cancer screen 
typically includes comparing genes identified in different tissues, e.g., normal and cancerous 
25 tissues, or tumor tissue samples from patients who have metastatic disease vs. non metastatic 
tissue. Other suitable tissue comparisons include comparing breast cancer samples with 
metastatic cancer samples from other cancers, such as lung, breast, gastrointestinal cancers, 
ovarian, etc. Samples of different stages of breast cancer, e.g., survivor tissue, drug resistant 
states, and tissue undergoing metastasis, are applied to biocbips comprising nucleic acid 
30 probes. The samples are first microdissected, if applicable, and treated as is known in the art 
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for the preparation of mRNA. Suitable biochips are commercially available, e.g. from 
Affymetrix. Gene expression profiles as described herein are generated and the data 
analyzed. 

In one embodiment, the genes showing changes in expression as between 
5 normal and disease states are compared to genes expressed in other normal tissues, preferably 
normal breast, but also including, and not limited to lung, heart, brain, liver, breast, kidney, 
muscle, colon, small intestine, large intestine, spleen, bone and placenta. In a preferred 
Li embodiment, those genes identified during the breast cancer screen that are expressed in any 

significant amount in other tissues are removed from the profile, although in some 
If 10 embodiments, this is not necessary. That is, when screening for drugs, it is usually preferable 
p i that the target be disease specific, to minimize possible side effects. 
N-i In a preferred embodiment, breast cancer sequences are those that are up- 

te>' 

s regulated in breast cancer; that is, the expression of these genes is higher in the breast cancer 
r: tissue as compared to non-cancerous tissue. "Up-regulation" as used herein often means at 
ill 5 least about a two-fold change, preferably at least about a three fold change, with at least about 
p five-fold or higher being preferred. All unigene cluster identification numbers and accession 
1 y numbers herein are for the GenBank sequence database and the sequences of the accession 

numbers are hereby expressly incorporated by reference. GenBank is known in the art, see, 

e.g., Benson, DA, et ah, Nucleic Acids Research 26:1-7 (1998) and 
20 http://www.ncbi.nlm.nih.gov/. Sequences are also available in other databases, e.g., 

European Molecular Biology Laboratory (EMBL) and DNA Database of Japan (DDBJ). 

U.S. Patent Application N. 09/687,576, with the same assignee as the present application, 

further discloses related sequences, compositions, and methods of diagnosis and treatment of 

breast cancer is hereby expressly incorporated by reference. 
25 In another preferred embodiment, breast cancer sequences are those that are 

down-regulated in the breast cancer; that is, the expression of these genes is lower in breast 

cancer tissue as compared to non-cancerous tissue {see, e.g., Tables 1,2, 3, 15, 16 etc. . .). 

"Down-regulation" as used herein often means at least about a two-fold change, preferably at 

least about a three fold change, with at least about five-fold or higher being preferred. 

30 
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Informatics 

The ability to identify genes that are over or under expressed in breast cancer 
can additionally provide high-resolution, high-sensitivity datasets which can be used in the 
areas of diagnostics, therapeutics, drug development, pharmacogenetics, protein structure, 
biosensor development, and other related areas. For example, the expression profiles can be 
used in diagnostic or prognostic evaluation of patients with breast cancer. Or as another 
example, subcellular toxicological information can be generated to better direct drug structure 
and activity correlation (see Anderson, Pharmaceutical Proteomics: Targets, Mechanism, 
and Function, paper presented at the IBC Proteomics conference, Coronado, CA (June 11-12, 
1998)). Subcellular toxicological information can also be utilized in a biological sensor 
device to predict the likely toxicological effect of chemical exposures and likely tolerable 
exposure thresholds (see U.S. Patent No. 5,81 1,231). Similar advantages accrue from 
datasets relevant to other biomolecules and bioactive agents (e.g., nucleic acids, saccharides, 
lipids, drugs, and the like). 

Thus, in another embodiment, the present invention provides a database that 
includes at least one set of assay data. The data contained in the database is acquired, e.g., 
using array analysis either singly or in a library format. The database can be in substantially 
any form in which data can be maintained and transmitted, but is preferably an electronic 
database. The electronic database of the invention can be maintained on any electronic 
device allowing for the storage of and access to the database, such as a personal computer, 
but is preferably distributed on a wide area network, such as the World Wide Web. 

The focus of the present section on databases that include peptide sequence 
data is for clarity of illustration only. It will be apparent to those of skill in the art that similar 
databases can be assembled for any assay data acquired using an assay of the invention. 

The compositions and methods for identifying and/or quantitating the relative 
and/or absolute abundance of a variety of molecular and macromolecular species from a 
biological sample undergoing breast cancer, i.e., the identification of breast cancer-associated 
sequences described herein, provide an abundance of information, which can be correlated 
with pathological conditions, predisposition to disease, drug testing, therapeutic monitoring, 
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gene-disease causal linkages, identification of correlates of immunity and physiological 
status, among others. Although the data generated from the assays of the invention is suited 
for manual review and analysis, in a preferred embodiment, prior data processing using high- 
speed computers is utilized. 
5 An array of methods for indexing and retrieving biomolecular information is 

known in the art. For example, U.S. Patents 6,023,659 and 5,966,712 disclose a relational 
database system for storing biomolecular sequence information in a manner that allows 
M sequences to be catalogued and searched according to one or more protein function 
^; hierarchies. U.S. Patent 5,953,727 discloses a relational database having sequence records 
UtO containing information in a format that allows a collection of partial-length DNA sequences 
ft \ to be catalogued and searched according to association with one or more sequencing projects 
5; for obtaining full-length sequences from the collection of partial length sequences. U.S. 
s Patent 5,706,498 discloses a gene database retrieval system for making a retrieval of a gene 
y : sequence similar to a sequence data item in a gene database based on the degree of similarity 
1 j;15 between a key sequence and a target sequence. U.S. Patent 5,538,897 discloses a method 

sabs: 

O using mass spectroscopy fragmentation patterns of peptides to identify amino acid sequences 
in computer databases by comparison of predicted mass spectra with experimentally-derived 
mass spectra using a closeness-of-flt measure. U.S. Patent 5,926,818 discloses a multi- 
dimensional database comprising a functionality for multi-dimensional data analysis 

20 described as on-line analytical processing (OLAP), which entails the consolidation of 

projected and actual data according to more than one consolidation path or dimension. U.S. 
Patent 5,295,261 reports a hybrid database structure in which the fields of each database 
record are divided into two classes, navigational and informational data, with navigational 
fields stored in a hierarchical topological map which can be viewed as a tree structure or as 

25 the merger of two or more such tree structures. 

See also Mount et al., Bioinformatics (2001); Biological Sequence Analysis: 
Probabilistic Models of Proteins and Nucleic Acids (Durbin et al, eds., 1999); 
Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins (Baxevanis & 
Oeullette eds., 1998)); Rashidi & Buehler, Bioinformatics: Basic Applications in Biological 

30 Science and Medicine (1 999); Introduction to Computational Molecular Biology (Setubal et 
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al, eds 1997); Bioinformatics: Methods and Protocols (Misener & Krawetz, eds, 2000); 
Bioinformatics: Sequence, Structure, and Databanks: A Practical Approach (Higgins & 
Taylor, eds., 2000); Brown, Bioinformatics: A Biologist's Guide to Biocomputing and the 
Internet (2001); Han & Kamber, Data Mining: Concepts and Techniques (2000); and 
Waterman, Introduction to Computational Biology: Maps, Sequences, and Genomes (1995). 

The present invention provides a computer database comprising a computer 
and software for storing in computer-retrievable form assay data records cross-tabulated, e.g., 
with data specifying the source of the target-containing sample from which each sequence 
specificity record was obtained. 

In an exemplary embodiment, at least one of the sources of target-containing 
sample is from a control tissue sample known to be free of pathological disorders. In a 
variation, at least one of the sources is a known pathological tissue specimen, e.g., a 
neoplastic lesion or another tissue specimen to be analyzed for breast cancer. In another 
variation, the assay records cross-tabulate one or more of the following parameters for each 
target species in a sample: (1) a unique identification code, which can include, e.g., a target 
molecular structure and/or characteristic separation coordinate (e.g., electrophoretic 
coordinates); (2) sample source; and (3) absolute and/or relative quantity of the target species 

present in the sample. 

The invention also provides for the storage and retrieval of a collection of 
target data in a computer data storage apparatus, which can include magnetic disks, optical 
disks, magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, 
magnetic bubble memory devices, and other data storage devices, including CPU registers 
and on-CPU data storage arrays. Typically, the target data records are stored as a bit pattern 
in an array of magnetic domains on a magnetizable medium or as an array of charge states or 
transistor gate states, such as an array of cells in a DRAM device (e.g., each cell comprised of 
a transistor and a charge storage area, which may be on the transistor). In one embodiment, 
the invention provides such storage devices, and computer systems built therewith, 
comprising a bit pattern encoding a protein expression fingerprint record comprising unique 
identifiers for at least 10 target data records cross-tabulated with target source. 
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When the target is a peptide or nucleic acid, the invention preferably provides 
a method for identifying related peptide or nucleic acid sequences, comprising performing a 
computerized comparison between a peptide or nucleic acid sequence assay record stored in 
or retrieved from a computer storage device or database and at least one other sequence. The 
5 comparison can include a sequence analysis or comparison algorithm or computer program 
embodiment thereof (e.g., FASTA, TFASTA, GAP, BESTFIT) and/or the comparison may 
be of the relative amount of a peptide or nucleic acid sequence in a pool of sequences 
determined from a polypeptide or nucleic acid sample of a specimen. 
5 The invention also preferably provides a magnetic disk, such as an IBM- 

Oil 0 compatible (DOS, Windows, Windows95/9 8/2000, Windows NT, OS/2) or other format 
SI (e.g., Linux, SunOS, Solaris, ATX, SCO Unix, VMS, MV, Macintosh, etc.) floppy diskette or 
J! hard (fixed, Winchester) disk drive, comprising a bit pattern encoding data from an assay of 
the invention in a file format suitable for retrieval and processing in a computerized sequence 
M: analysis, comparison, or relative quantitation method. 
:| 1 5 The invention also provides a network, comprising a plurality of computing 

devices linked via a data link, such as an Ethernet cable (coax or lOBaseT), telephone line, 
ISDN line, wireless network, optical fiber, or other suitable signal transmission medium, 
whereby at least one network device (e.g., computer, disk array, etc.) comprises a pattern of 
magnetic domains (e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM 
20 cells) composing a bit pattern encoding data acquired from an assay of the invention. 

The invention also provides a method for transmitting assay data that includes 
generating an electronic signal on an electronic communications device, such as a modem, 
ISDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the signal 
includes (in native or encrypted format) a bit pattern encoding data from an assay or a 
25 database comprising a plurality of assay results obtained by the method of the invention. 

In a preferred embodiment, the invention provides a computer system for 
comparing a query target to a database containing an array of data structures, such as an assay 
result obtained by the method of the invention, and ranking database targets based on the 
degree of identity and gap weight to the target data. A central processor is preferably 
30 initialized to load and execute the computer program for alignment and/or comparison of the 

34 



Attorney Docket No.:018501-005210US 



assay results. Data for a query target is entered into the central processor via an I/O device. 
Execution of the computer program results in the central processor retrieving the assay data 
from the data file, which comprises a binary description of an assay result. 

The target data or record and the computer program can be transferred to 
5 secondary memory, which is typically random access memory (e.g., DRAM, SRAM, 
SGRAM, or SDRAM). Targets are ranked according to the degree of correspondence 
between a selected assay characteristic (e.g. , binding to a selected affinity moiety) and the 
;y same characteristic of the query target and results are output via an I/O device. For example, 
Js 1 a central processor can be a conventional computer (e.g., Intel Pentium, PowerPC, Alpha, 

« j 

LflO PA-8000, SPARC, MIPS 4400, MIPS 10000, VAX, etc.); a program can be a commercial or 

ffi 

ft \ public domain molecular biology software package (e.g., UWGCG Sequence Analysis 
2 Software, Darwin); a data file can be an optical or magnetic disk, a data server, a memory 
J" device (e.g., DRAM, SRAM, SGRAM, SDRAM, EPROM, bubble memory, flash memory, 
y : ' etc.); an I/O device can be a terminal comprising a video display and a keyboard, a modem, 
rti 5 an ISDN terminal adapter, an Ethernet port, a punched card reader, a magnetic strip reader, or 
p other suitable I/O device. 

The invention also preferably provides the use of a computer system, such as 

that described above, which comprises: (1) a computer; (2) a stored bit pattern encoding a 

collection of peptide sequence specificity records obtained by the methods of the invention, 
20 which may be stored in the computer; (3) a comparison target, such as a query target; and (4) 

a program for alignment and comparison, typically with rank-ordering of comparison results 

on the basis of computed similarity values. 

Characteristics of breast cancer-associated proteins 

25 Breast cancer proteins of the present invention may be classified as secreted 

proteins, transmembrane proteins or intracellular proteins. In one embodiment, the breast 
cancer protein is an intracellular protein. Intracellular proteins may be found in the 
cytoplasm and/or in the nucleus. Intracellular proteins are involved in all aspects of cellular 
function and replication (including, e.g., signaling pathways); aberrant expression of such 

30 proteins often results in unregulated or disregulated cellular processes (see, e.g., Molecular 
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Biology of the Cell (Alberts, ed., 3rd ed., 1994). For example, many intracellular proteins 
have enzymatic activity such as protein kinase activity, protein phosphatase activity, protease 
activity, nucleotide cyclase activity, polymerase activity and the like. Intracellular proteins 
also serve as docking proteins that are involved in organizing complexes of proteins, or 
5 targeting proteins to various subcellular localizations, and are involved in maintaining the 
structural integrity of organelles. 

An increasingly appreciated concept in characterizing proteins is the presence 
in the proteins of one or more motifs for wMch defined functions have been attributed. In 
S! addition to the highly conserved sequences found in the enzymatic domain of proteins, highly 
y lO conserved sequences have been identified in proteins that are involved in protein-protein 
ry interaction. For example, Src-homology-2 (SH2) domains bind tyrosine-phosphorylated 
.=! targets in a sequence dependent manner. PTB domains, which are distinct from SH2 
« domains, also bind tyrosine phosphorylated targets. SH3 domains bind to proline-rich 
; targets. In addition, PH domains, tetratricopeptide repeats and WD domains to name only a 

; 1 5 few, have been shown to mediate protein-protein interactions. Some of these may also be 

involved in binding to phospholipids or other second messengers. As will be appreciated by 
one of ordinary skill in the art, these motifs can be identified on the basis of primary 
sequence; thus, an analysis of the sequence of proteins may provide insight into both the 
enzymatic potential of the molecule and/or molecules with which the protein may associate. 
20 One useful database is Pfam (protein families), which is a large collection of multiple 

sequence alignments and hidden Markov models covering many common protein domains. 
Versions are available via the internet from Washington University in St. Louis, the Sanger 
Center in England, and the Karolinska Institute in Sweden (see, e.g., Bateman et al, Nuc. 
Acids Res. 28:263-266 (2000); Sonnhammer et al, Proteins 28:405-420 (1997); Bateman et 
25 al, Nuc. Acids Res. 27:260-262 (1 999); and Sonnhammer et al, Nuc. Acids Res. 26:320-322- 
(1998)). 

In another embodiment, the breast cancer sequences are transmembrane 
proteins. Transmembrane proteins are molecules that span a phospholipid bilayer of a cell. 
They may have an intracellular domain, an extracellular domain, or both. The intracellular 
30 domains of such proteins may have a number of functions including those already described 
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for intracellular proteins. For example, the intracellular domain may have enzymatic activity 
and/or may serve as a binding site for additional proteins. Frequently the intracellular 
domain of transmembrane proteins serves both roles. For example certain receptor tyrosine 
kinases have both protein kinase activity and SH2 domains. In addition, autophosphorylation 
5 of tyrosines on the receptor molecule itself, creates binding sites for additional SH2 domain 
containing proteins. 

Transmembrane proteins may contain from one to many transmembrane 
H ; domains. For example, receptor tyrosine kinases, certain cytokine receptors, receptor 
P guanylyl cyclases and receptor serine/threonine protein kinases contain a single 
w 10 transmembrane domain. However, various other proteins including channels and adenylyl 
m cyclases contain numerous transmembrane domains. Many important cell surface receptors 
==i such as G protein coupled receptors (GPCRs) are classified as "seven transmembrane 
L. domain" proteins, as they contain 7 membrane spanning regions. Characteristics of 
§=j transmembrane domains include approximately 20 consecutive hydrophobic amino acids that 
5 =15 may be followed by charged amino acids. Therefore, upon analysis of the amino acid 
O sequence of a particular protein, the localization and number of transmembrane domains 
within the protein may be predicted (see, e.g. PSORT web site http://psort.nibb.ac.jp/). 
Important transmembrane protein receptors include, but are not limited to the insulin 
receptor, insulin-like growth factor receptor, human growth hormone receptor, glucose 
20 transporters, transferrin receptor, epidermal growth factor receptor, low density lipoprotein 
receptor, epidermal growth factor receptor, leptin receptor, interleukin receptors, e.g. IL-1 
receptor, IL-2 receptor, 

The extracellular domains of transmembrane proteins are diverse; however, 
conserved motifs are found repeatedly among various extracellular domains. Conserved 
25 structure and/or functions have been ascribed to different extracellular motifs. Many 

extracellular domains are involved in binding to other molecules. In one aspect, extracellular 
domains are found on receptors. Factors that bind the receptor domain include circulating 
ligands, which may be peptides, proteins, or small molecules such as adenosine and the like. 
For example, growth factors such as EGF, FGF and PDGF are circulating growth factors that 
30 bind to their cognate receptors to initiate a variety of cellular responses. Other factors include 
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cytokines, mitogenic factors, neurotrophic factors and the like. Extracellular domains also 
bind to cell-associated molecules. In this respect, they mediate cell-cell interactions. Cell- 
associated ligands can be tethered to the cell, e.g., via a glycosylphosphatidylinositol (GPI) 
anchor, or may themselves be transmembrane proteins. Extracellular domains also associate 
5 with the extracellular matrix and contribute to the maintenance of the cell structure. 

Breast cancer proteins that are transmembrane are particularly preferred in the 
present invention as they are readily accessible targets for immunotherapeutics, as are 
H described herein. In addition, as outlined below, transmembrane proteins can be also useful 
S in imaging modalities. Antibodies may be used to label such readily accessible proteins in 
WlO situ. Alternatively, antibodies can also label intracellular proteins, in which case samples are 
f!j typically permeablized to provide access to intracellular proteins. 

>j K w iH also be appreciated by those in the art that a transmembrane protein can 

be made soluble by removing transmembrane sequences, e.g., through recombinant methods. 
Furthermore, transmembrane proteins that have been made soluble can be made to be 
1 5 secreted through recombinant means by adding an appropriate signal sequence. 

In another embodiment, the breast cancer proteins are secreted proteins; the 
secretion of which can be either constitutive or regulated. These proteins have a signal 
peptide or signal sequence that targets the molecule to the secretory pathway. Secreted 
proteins are involved in numerous physiological events; by virtue of their circulating nature, 
20 they serve to transmit signals to various other cell types. The secreted protein may function in 
an autocrine manner (acting on the cell that secreted the factor), a paracrine manner (acting 
on cells in close proximity to the cell that secreted the factor) or an endocrine manner (acting 
on cells at a distance). Thus secreted molecules find use in modulating or altering numerous 
aspects of physiology. Breast cancer proteins that are secreted proteins are particularly 
25 preferred in the present invention as they serve as good targets for diagnostic markers, e.g., 
for blood, plasma, serum, or stool tests. 

Use of breast cancer nucleic acids 

As described above, breast cancer sequence is initially identified by 
30 substantial nucleic acid and/or amino acid sequence homology or linkage to the breast cancer 
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sequences outlined herein. Such homology can be based upon the overall nucleic acid or 
amino acid sequence, and is generally determined as outlined below, using either homology 
programs or hybridization conditions. Typically, linked sequences on a mRNA are found on 
the same molecule. 

The breast cancer nucleic acid sequences of the invention, e.g., the sequences 
in Tables 1-25, can be fragments of larger genes, i.e., they are nucleic acid segments. 
"Genes" in this context includes coding regions, non-coding regions, and mixtures of coding 
and non-coding regions. Accordingly, as will be appreciated by those in the art, using the 
sequences provided herein, extended sequences, in either direction, of the breast cancer genes 
can be obtained, using techniques well known in the art for cloning either longer sequences or 
the full length sequences; see Ausubel, et al, supra. Much can be done by informatics and 
many sequences can be clustered to include multiple sequences corresponding to a single 
gene, e.g., systems such as UniGene (see, http://www.ncbi.nlm.nih.gov/UniGene/). 

Once the breast cancer nucleic acid is identified, it can be cloned and, if 
necessary, its constituent parts recombined to form the entire breast cancer nucleic acid 
coding regions or the entire mRNA sequence. Once isolated from its natural source, e.g., 
contained within a plasmid or other vector or excised therefrom as a linear nucleic acid 
segment, the recombinant breast cancer nucleic acid can be further-used as a probe to identify 
and isolate other breast cancer nucleic acids, e.g., extended coding regions. It can also be 
used as a "precursor" nucleic acid to make modified or variant breast cancer nucleic acids 
and proteins. 

The breast cancer nucleic acids of the present invention are used in several 
ways. In a first embodiment, nucleic acid probes to the breast cancer nucleic acids are made 
and attached to biochips to be used in screening and diagnostic methods, as outlined below, 
or for administration, e.g., for gene therapy, vaccine, and/or antisense applications. 
Alternatively, the breast cancer nucleic acids that include coding regions of breast cancer 
proteins can be put into expression vectors for the expression of breast cancer proteins, again 
for screening purposes or for administration to a patient. 

In a preferred embodiment, nucleic acid probes to breast cancer nucleic acids 
(both the nucleic acid sequences outlined in the figures and/or the complements thereof) are 
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made. The nucleic acid probes attached to the biochip are designed to be substantially 
complementary to the breast cancer nucleic acids, i.e. the target sequence (either the target 
sequence of the sample or to other probe sequences, e.g., in sandwich assays), such that 
hybridization of the target sequence and the probes of the present invention occurs. As 
outlined below, this complementarity need not be perfect; there may be any number of base 
pair mismatches which will interfere with hybridization between the target sequence and the 
single stranded nucleic acids of the present invention. However, if the number of mutations 
is so great that no hybridization can occur under even the least stringent of hybridization 
conditions, the sequence is not a complementary target sequence. Thus, by "substantially 
complementary" herein is meant that the probes are sufficiently complementary to the target 
sequences to hybridize under normal reaction conditions, particularly high stringency 
conditions, as outlined herein. 

A nucleic acid probe is generally single stranded but can be partially single 
and partially double stranded. The strandedness of the probe is dictated by the structure, 
composition, and properties of the target sequence. In general, the nucleic acid probes range 
from about 8 to about 100 bases long, with from about 10 to about 80 bases being preferred, 
and from about 30 to about 50 bases being particularly preferred. That is, generally whole 
genes are not used. In some embodiments, much longer nucleic acids can be used, up to 
hundreds of bases. 

In a preferred embodiment, more than one probe per sequence is used, with 
either overlapping probes or probes to different sections of the target being used. That is, 
two, three, four or more probes, with three being preferred, are used to build in a redundancy 
for a particular target. The probes can be overlapping (i.e., have some sequence in common), 
or separate. In some cases, PCR primers may be used to amplify signal for higher sensitivity. 

As will be appreciated by those in the art, nucleic acids can be attached or 
immobilized to a solid support in a wide variety of ways. By "immobilized" and grammatical 
equivalents herein is meant the association or binding between the nucleic acid probe and the 
solid support is sufficient to be stable under the conditions of binding, washing, analysis, and 
removal as outlined below. The binding can typically be covalent or non-covalent. By "non- 
covalent binding" and grammatical equivalents herein is meant one or more of electrostatic, 
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hydrophilic, and hydrophobic interactions. Included in non-covalent binding is the covalent 
attachment of a molecule, such as, streptavidin to the support and the non-covalent binding of 
the biotinylated probe to the streptavidin. By "covalent binding" and grammatical 
equivalents herein is meant that the two moieties, the solid support and the probe, are 
attached by at least one bond, including sigma bonds, pi bonds and coordination bonds. 
Covalent bonds can be formed directly between the probe and the solid support or can be 
formed by a cross linker or by inclusion of a specific reactive group on either the solid 
support or the probe or both molecules. Immobilization may also involve a combination of 
covalent and non-covalent interactions. 

In general, the probes are attached to the biochip in a wide variety of ways, as 
will be appreciated by those in the art. As described herein, the nucleic acids can either be 
synthesized first, with subsequent attachment to the biochip, or can be directly synthesized on 
the biochip. 

The biochip comprises a suitable solid substrate. By "substrate" or "solid 
support" or other grammatical equivalents herein is meant a material that can be modified to 
contain discrete individual sites appropriate for the attachment or association of the nucleic 
acid probes and is amenable to at least one detection method. As will be appreciated by those 
in the art, the number of possible substrates are very large, and include, but are not limited to, 
glass and modified or functionalized glass, plastics (including acrylics, polystyrene and 
copolymers of styrene and other materials, polypropylene, polyethylene, polybutylene, 
polyurethanes, TeflonJ, etc.), polysaccharides, nylon or nitrocellulose, resins, silica or silica- 
based materials including silicon and modified silicon, carbon, metals, inorganic glasses, 
plastics, etc. In general, the substrates allow optical detection and do not appreciably 
fluoresce. A preferred substrate is described in copending application entitled Reusable Low 
Fluorescent Plastic Biochip, U.S. Application Serial No. 09/270,214, filed March 15, 1999, 
herein incorporated by reference in its entirety. 

Generally the substrate is planar, although as will be appreciated by those in 
the art, other configurations of substrates may be used as well. For example, the probes may 
be placed on the inside surface of a tube, for flow-through sample analysis to minimize 
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sample volume. Similarly, the substrate may be flexible, such as a flexible foam, including 
closed cell foams made of particular plastics. 

In a preferred embodiment, the surface of the biochip and the probe may be 
derivatized with chemical functional groups for subsequent attachment of the two. Thus, e.g., 
the biochip is derivatized with a chemical functional group including, but not limited to, 
amino groups, carboxy groups, oxo groups and thiol groups, with amino groups being 
particularly preferred. Using these functional groups, the probes can be attached using 
functional groups on the probes. For example, nucleic acids containing amino groups can be 
attached to surfaces comprising amino groups, e.g. using linkers as are known in the art; e.g., 
homo-or hetero-bifunctional linkers as are well known (see 1994 Pierce Chemical Company 
catalog, technical section on cross-linkers, pages 155-200). In addition, in some cases, 
additional linkers, such as alkyl groups (including substituted and heteroalkyl groups) may be 
used. 

In this embodiment, oligonucleotides are synthesized as is known in the art, 
and then attached to the surface of the solid support. As will be appreciated by those skilled 
in the art, either the 5' or 3' terminus may be attached to the solid support, or attachment may 
be via an internal nucleoside. 

In another embodiment, the immobilization to the solid support may be very 
strong, yet non-covalent. For example, biotinylated oligonucleotides can be made, which 
bind to surfaces covalently coated with streptavidin, resulting in attachment. 

Alternatively, the oligonucleotides may be synthesized on the surface, as is 
known in the art. For example, photo activation techniques utilizing photopolymerization 
compounds and techniques are used. In a preferred embodiment, the nucleic acids can be 
synthesized in situ, using well known photolithographic techniques, such as those described 
in WO 95/251 16; WO 95/35505; U.S. Patent Nos. 5,700,637 and 5,445,934; and references 
cited within, all of which are expressly incorporated by reference; these methods of 
attachment form the basis of the Affimetrix GeneChip™ technology. 

Often, amplification-based assays are performed to measure the expression 
level of breast cancer-associated sequences. These assays are typically performed in 
conjunction with reverse transcription. In such assays, a breast cancer-associated nucleic acid 
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sequence acts as a template in an amplification reaction (e.g., Polymerase Chain Reaction, or 
PCR). In a quantitative amplification, the amount of amplification product will be 
proportional to the amount of template in the original sample. Comparison to appropriate 
controls provides a measure of the amount of breast cancer-associated RNA. Methods of 
5 quantitative amplification are well known to those of skill in the art. Detailed protocols for 
quantitative PCR are provided, e.g., in Innis et ah, PCR Protocols, A Guide to Methods and 
Applications (1990). 

y. In some embodiments, a TaqMan based assay is used to measure expression. 

£ TaqMan based assays use a fluorogenic oligonucleotide probe that contains a 5' fluorescent 
LJIjO dye and a 3 ' quenching agent. The probe hybridizes to a PCR product, but cannot itself be 
f{ { extended due to a blocking agent at the 3 ' end. When the PCR product is amplified in 
jy- subsequent cycles, the 5 ' nuclease activity of the polymerase, e.g., AmpliTaq, results in the 
s cleavage of the TaqMan probe. This cleavage separates the 5' fluorescent dye and the 3' 
JT quenching agent, thereby resulting in an increase in fluorescence as a function of 
Hi; 5 amplification (see, e.g., literature provided by Perkin-Elmer, e.g., www2.perkin-elmer.com). 
Q Other suitable amplification methods include, but are not limited to, ligase 

chain reaction (LCR) (see Wu & Wallace, Genomics 4:560 (1989), Landegren et al, Science 
241 : 1077 (1988), and Barringer et al, Gene 89: 1 17 (1990)), transcription amplification 
(Kwoh et al, Proc. Natl Acad. Sci. USA 86:1173 (1989)), self-sustained sequence replication 
20 (Guatelli et al, Proc. Nat. Acad. Sci. USA 87:1874 (1990)), dot PCR, and linker adapter PCR, 
etc. 

Expression of breast cancer proteins from nucleic acids 

In a preferred embodiment, breast cancer nucleic acids, e.g., encoding breast 
25 cancer proteins are used to make a variety of expression vectors to express breast cancer 

proteins which can then be used in screening assays, as described below. Expression vectors 
and recombinant DNA technology are well known to those of skill in the art (see, e.g., 
Ausubel, supra, and Gene Expression Systems (Fernandez & Hoeffler, eds, 1999)) and are 
used to express proteins. The expression vectors may be either self-replicating 
30 extrachromosomal vectors or vectors which integrate into a host genome. Generally, these 
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expression vectors include transcriptional and translational regulatory nucleic acid operably 
linked to the nucleic acid encoding the breast cancer protein. The term "control sequences" 
refers to DNA sequences used for the expression of an operably linked coding sequence in a 
particular host organism. Control sequences that are suitable for prokaryotes, e.g., include a 
promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic cells are 
known to utilize promoters, polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional 
relationship with another nucleic acid sequence. For example, DNA for a presequence or 
secretory leader is operably linked to DNA for a polypeptide if it is expressed as a preprotein 
that participates in the secretion of the polypeptide; a promoter or enhancer is operably linked 
to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site 
is operably linked to a coding sequence if it is positioned so as to facilitate translation. 
Generally, "operably linked" means that the DNA sequences being linked are contiguous, 
and, in the case of a secretory leader, contiguous and in reading phase. However, enhancers 
do not have to be contiguous. Linking is typically accomplished by ligation at convenient 
restriction sites. If such sites do not exist, synthetic oligonucleotide adaptors or linkers are 
used in accordance with conventional practice. Transcriptional and translational regulatory 
nucleic acid will generally be appropriate to the host cell used to express the breast cancer 
protein. Numerous types of appropriate expression vectors, and suitable regulatory 
sequences are known in the art for a variety of host cells. 

In general, transcriptional and translational regulatory sequences may include, 
but are not limited to, promoter sequences, ribosomal binding sites, transcriptional start and 
stop sequences, translational start and stop sequences, and enhancer or activator sequences. 
In a preferred embodiment, the regulatory sequences include a promoter and transcriptional 
start and stop sequences. 

Promoter sequences encode either constitutive or inducible promoters. The 
promoters may be either naturally occurring promoters or hybrid promoters. Hybrid 
promoters, which combine elements of more than one promoter, are also known in the art, 
and are useful in the present invention. 
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In addition, an expression vector may comprise additional elements. For 
example, the expression vector may have two replication systems, thus allowing it to be 
maintained in two organisms, e.g. in mammalian or insect cells for expression and in a 
procaryotic host for cloning and amplification. Furthermore, for integrating expression 
vectors, the expression vector contains at least one sequence homologous to the host cell 
genome, and preferably two homologous sequences which flank the expression construct. 
The integrating vector may be directed to a specific locus in the host cell by selecting the 
appropriate homologous sequence for inclusion in the vector. Constructs for integrating 
vectors are well known in the art (e.g., Fernandez & Hoeffler, supra). 

In addition, in a preferred embodiment, the expression vector contains a 
selectable marker gene to allow the selection of transformed host cells. Selection genes are 
well known in the art and will vary with the host cell used. 

The breast cancer proteins of the present invention are produced by culturing a 
host cell transformed with an expression vector containing nucleic acid encoding a breast 
cancer protein, under the appropriate conditions to induce or cause expression of the breast 
cancer protein. Conditions appropriate for breast cancer protein expression will vary with the 
choice of the expression vector and the host cell, and will be easily ascertained by one skilled 
in the art through routine experimentation or optimization. For example, the use of 
constitutive promoters in the expression vector will require optimizing the growth and 
proliferation of the host cell, while the use of an inducible promoter requires the appropriate 
growth conditions for induction. In addition, in some embodiments, the timing of the harvest 
is important. For example, the baculoviral systems used in insect cell expression are lytic 
viruses, and thus harvest time selection can be crucial for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fungi, and insect 
and animal cells, including mammalian cells. Of particular interest are Saccharomyces 
cerevisiae and other yeasts, E. coli, Bacillus subtilis, Sf9 cells, CI 29 cells, 293 cells, 
Neurospora, BHK, CHO, COS, HeLa cells, HUVEC (human umbilical vein endothelial 
cells), THP1 cells (a macrophage cell line) and various other human cells and cell lines. 

In a preferred embodiment, the breast cancer proteins are expressed in 
mammalian cells. Mammalian expression systems are also known in the art, and include 
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retroviral and adenoviral systems. One expression vector system is a retroviral vector system 
such as is generally described in PCT/US97/01019 and PC17US97/01048, both of which are 
hereby expressly incorporated by reference. Of particular use as mammalian promoters are 
the promoters from mammalian viral genes, since the viral genes are often highly expressed 
5 and have a broad host range. Examples include the S V40 early promoter, mouse mammary 
tumor virus LTR promoter, adenovirus major late promoter, herpes simplex virus promoter, 
and the CMV promoter {see, e.g., Fernandez & Hoeffler, supra). Typically, transcription 

1=5: 

o termination and polyadenylation sequences recognized by mammalian cells are regulatory 

regions located 3' to the translation stop codon and thus, together with the promoter elements, 
CiO flank the coding sequence. Examples of transcription terminator and polyadenlyation signals 
V: include those derived form SV40. 

w The methods of introducing exogenous nucleic acid into mammalian hosts, as 

C well as other hosts, is well known in the art, and will vary with the host cell used. 
: n ; Techniques include dextran-mediated transfection, calcium phosphate precipitation, 
415 polybrene mediated transfection, protoplast fusion, electroporation, viral infection, 
fy encapsulation of the polynucleotide(s) in liposomes, and direct microinjection of the DNA 
into nuclei. 

In a preferred embodiment, breast cancer proteins are expressed in bacterial 
systems. Bacterial expression systems are well known in the art. Promoters from 

20 bacteriophage may also be used and are known in the art. In addition, synthetic promoters 
and hybrid promoters are also useful; e.g., the tac promoter is a hybrid of the tip and lac 
promoter sequences. Furthermore, a bacterial promoter can include naturally occurring 
promoters of non-bacterial origin that have the ability to bind bacterial RNA polymerase and 
initiate transcription. In addition to a functioning promoter sequence, an efficient ribosome 

25 binding site is desirable. The expression vector may also include a signal peptide sequence 
that provides for secretion of the breast cancer protein in bacteria. The protein is either 
secreted into the growth media (gram-positive bacteria) or into the periplasmic space, located 
between the inner and outer membrane of the cell (gram-negative bacteria). The bacterial 
expression vector may also include a selectable marker gene to allow for the selection of 

30 bacterial strains that have been transformed. Suitable selection genes include genes which 
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render the bacteria resistant to drugs such as ampicillin, chloramphenicol, erythromycin, 
kanamycin, neomycin and tetracycline. Selectable markers also include biosynthetic genes, 
such as those in the histidine, tryptophan and leucine biosynthetic pathways. These 
components are assembled into expression vectors. Expression vectors for bacteria are well 
known in the art, and include vectors for Bacillus subtilis, E. coli, Streptococcus cremoris, 
and Streptococcus lividans, among others (e.g., Fernandez & Hoeffler, supra). The bacterial 
expression vectors are transformed into bacterial host cells using techniques well known in 
the art, such as calcium chloride treatment, electroporation, and others. 

In one embodiment, breast cancer proteins are produced in insect cells. 
Expression vectors for the transformation of insect cells, and in particular, baculovirus-based 
expression vectors, are well known in the art. 

In a preferred embodiment, breast cancer protein is produced in yeast cells. 
Yeast expression systems are well known in the art, and include expression vectors for 
Saccharomyces cerevisiae, Candida albicans and C. maltosa, Hansenula polymorpha, 
Kluyveromyces fragilis and K. lactis, Pichia guillerimondii and P. pastoris, 
Schizosaccharomyces pombe, and Yarrowia lipolytica. 

The breast cancer protein may also be made as a fusion protein, using 
techniques well known in the art. Thus, e.g., for the creation of monoclonal antibodies, if the 
desired epitope is small, the breast cancer protein may be fused to a carrier protein to form an 
immunogen. Alternatively, the breast cancer protein may be made as a fusion protein to 
increase expression, or for other reasons. For example, when the breast cancer protein is a 
breast cancer peptide, the nucleic acid encoding the peptide may be linked to other nucleic 
acid for expression purposes. 

In a preferred embodiment, the breast cancer protein is purified or isolated 
after expression. Breast cancer proteins may be isolated or purified in a variety of ways 
known to those skilled in the art depending on what other components are present in the 
sample. Standard purification methods include electrophoretic, molecular, immunological 
and chromatographic techniques, including ion exchange, hydrophobic, affinity, and reverse- 
phase HPLC chromatography, and chromato focusing. For example, the breast cancer protein 
may be purified using a standard anti-breast cancer protein antibody column. Ultrafiltration 
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and diafiltration techniques, in conjunction with protein concentration, are also useful. For 
general guidance in suitable purification techniques, see Scopes, Protein Purification (1982). 
The degree of purification necessary will vary depending on the use of the breast cancer 
protein. In some instances no purification will be necessary. 

Once expressed and purified if necessary, the breast cancer proteins and 
nucleic acids are useful in a number of applications. They may be used as immuno selection 
reagents, as vaccine reagents, as screening agents, etc. 

Variants of breast cancer proteins 

In one embodiment, the breast cancer proteins are derivative or variant breast 
cancer proteins as compared to the wild-type sequence. That is, as outlined more fully below, 
the derivative breast cancer peptide will often contain at least one amino acid substitution, 
deletion or insertion, with amino acid substitutions being particularly preferred. The amino 
acid substitution, insertion or deletion may occur at any residue within the breast cancer 
peptide. 

Also included within one embodiment of breast cancer proteins of the present 
invention are amino acid sequence variants. These variants typically fall into one or more of 
three classes: substitutional, insertional or deletional variants. These variants ordinarily are 
prepared by site specific mutagenesis of nucleotides in the DNA encoding the breast cancer 
protein, using cassette or PCR mutagenesis or other techniques well known in the art, to 
produce DNA encoding the variant, and thereafter expressing the DNA in recombinant cell 
culture as outlined above. However, variant breast cancer protein fragments having up to 
about 100-150 residues may be prepared by in vitro synthesis using established techniques. 
Amino acid sequence variants are characterized by the predetermined nature of the variation, 
a feature that sets them apart from naturally occurring allelic or interspecies variation of the 
breast cancer protein amino acid sequence. The variants typically exhibit the same 
qualitative biological activity as the naturally occurring analogue, although variants can also 
be selected which have modified characteristics as will be more fully outlined below. 

While the site or region for introducing an amino acid sequence variation is 
predetermined, the mutation per se need not be predetermined. For example, in order to 
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optimize the performance of a mutation at a given site, random mutagenesis may be 
conducted at the target codon or region and the expressed breast cancer variants screened for 
the optimal combination of desired activity. Techniques for making substitution mutations at 
predetermined sites in DNA having a known sequence are well known, e.g., Ml 3 primer 
5 mutagenesis and PCR mutagenesis. Screening of the mutants is done using assays of breast 
cancer protein activities. 

Amino acid substitutions are typically of single residues; insertions usually 
will be on the order of from about 1 to 20 amino acids, although considerably larger 

p insertions may be tolerated. Deletions range from about 1 to about 20 residues, although in 

JflO some cases deletions may be much larger. 

fU Substitutions, deletions, insertions or any combination thereof may be used to 

ri arrive at a final derivative. Generally these changes are done on a few amino acids to 

L minimize the alteration of the molecule. However, larger changes may be tolerated in certain 

Q 

¥ : circumstances. When small alterations in the characteristics of the breast cancer protein are 
jf 5 desired, substitutions are generally made in accordance with the amino acid substitution 
relationships provided in the definition section. 

The variants typically exhibit the same qualitative biological activity and will 
elicit the same immune response as the naturally-occurring analog, although variants also are 
selected to modify the characteristics of the breast cancer proteins as needed. Alternatively, 
20 the variant may be designed such that the biological activity of the breast cancer protein is 
altered. For example, glycosylation sites may be altered or removed. 

Substantial changes in function or immunological identity are made by 
selecting substitutions that are less conservative than those described above. For example, 
substitutions may be made which more significantly affect: the structure of the polypeptide 
25 backbone in the area of the alteration, for example the alpha-helical or beta-sheet structure; 
the charge or hydrophobicity of the molecule at the target site; or the bulk of the side chain. 
The substitutions which in general are expected to produce the greatest changes in the 
polypeptide's properties are those in which (a) a hydrophilic residue, e.g. seryl or threonyl is 
substituted for (or by) a hydrophobic residue, e.g. leucyl, isoleucyl, phenylalanyl, valyl or 
30 alanyl; (b) a cysteine or proline is substituted for (or by) any other residue; (c) a residue 
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having an electropositive side chain, e.g. lysyl, arginyl, or histidyl, is substituted for (or by) 
an electronegative residue, e.g. glutamyl or aspartyl; or (d) a residue having a bulky side 
chain, e.g. phenylalanine, is substituted for (or by) one not having a side chain, e.g. glycine. 

Covalent modifications of breast cancer polypeptides are included within the 
scope of this invention. One type of covalent modification includes reacting targeted amino 
acid residues of a breast cancer polypeptide with an organic derivatizing agent that is capable 
of reacting with selected side chains or the N-or C-terminal residues of a breast cancer 
polypeptide. Derivatization with bifunctional agents is useful, for instance, for crosslinking 
breast cancer polypeptides to a water-insoluble support matrix or surface for use in the 
method for purifying anti-breast cancer polypeptide antibodies or screening assays, as is more 
fully described below. Commonly used crosslinking agents include, e.g., 1,1- 
bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N-hydroxysuccinimide esters, e.g., esters 
with 4-azidosalicylic acid, homobifunctional imidoesters, including disuccinimidyl esters 
such as 3,3'-dithiobis(succinimidylpropionate), bifunctional maleimides such as bis-N- 
maleimido-l,8-octane and agents such as methyl-3-((p-azidophenyl)dithio)propioimidate. 

Other modifications include deamidation of glutaminyl and asparaginyl 
residues to the corresponding glutamyl and aspartyl residues, respectively, hydroxylation of 
proline and lysine, phosphorylation of hydroxyl groups of seryl, threonyl or tyrosyl residues, 
methylation of the amino groups of the lysine, arginine, and histidine side chains (Creighton, 
Proteins: Structure and Molecular Properties, pp. 79-86 (1983)), acetylation of the N- 
terminal amine, and amidation of any C-terminal carboxyl group. 

Another type of covalent modification of the breast cancer polypeptide 
included within the scope of this invention comprises altering the native glycosylation pattern 
of the polypeptide. "Altering the native glycosylation pattern" is intended for purposes 
herein to mean deleting one or more carbohydrate moieties found in native sequence breast 
cancer polypeptide, and/or adding one or more glycosylation sites that are not present in the 
native sequence breast cancer polypeptide. Glycosylation patterns can be altered in many 
ways. For example the use of different cell types to express breast cancer-associated 
sequences can result in different glycosylation patterns. 
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Addition of glycosylation sites to breast cancer polypeptides may also be 
accomplished by altering the amino acid sequence thereof. The alteration may be made, e.g., 
by the addition of, or substitution by, one or more serine or threonine residues to the native 
sequence breast cancer polypeptide (for O-linked glycosylation sites). The breast cancer 
amino acid sequence may optionally be altered through changes at the DNA level, 
particularly by mutating the DNA encoding the breast cancer polypeptide at preselected bases 
such that codons are generated that will translate into the desired amino acids. 

Another means of increasing the number of carbohydrate moieties on the 
breast cancer polypeptide is by chemical or enzymatic coupling of glycosides to the 
polypeptide. Such methods are described in the art, e.g., in WO 87/05330, and in Aplin & 
Wriston, CRC Crit. Rev. Biochem., pp. 259-306 (1981). 

Removal of carbohydrate moieties present on the breast cancer polypeptide 
may be accomplished chemically or enzymatically or by mutational substitution of codons 
encoding for amino acid residues that serve as targets for glycosylation. Chemical 
deglycosylation techniques are known in the art and described, for instance, by Hakimuddin, 
et al, Arch. Biochem. Biophys., 259:52 (1987) and by Edge et al.,Anal. Biochem., 1 18:131 
(1981). Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the 
use of a variety of endo-and exo-glycosidases as described by Thotakura et al. , Meth. 
Enzymol, 138:350 (1987). 

Another type of covalent modification of breast cancer comprises linking the 
breast cancer polypeptide to one of a variety of nonproteinaceous polymers, e.g., 
polyethylene glycol, polypropylene glycol, or polyoxyalkylenes, in the manner set forth in 
U.S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

Breast cancer polypeptides of the present invention may also be modified in a 
way to form chimeric molecules comprising a breast cancer polypeptide fused to another, 
heterologous polypeptide or amino acid sequence. In one embodiment, such a chimeric 
molecule comprises a fusion of a breast cancer polypeptide with a tag polypeptide which 
provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is 
generally placed at the amino-or carboxyl-terminus of the breast cancer polypeptide. The 
presence of such epitope-tagged forms of a breast cancer polypeptide can be detected using 
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an antibody against the tag polypeptide. Also, provision of the epitope tag enables the breast 
cancer polypeptide to be readily purified by affinity purification using an anti-tag antibody or 
another type of affinity matrix that binds to the epitope tag. In an alternative embodiment, 
the chimeric molecule may comprise a fusion of a breast cancer polypeptide with an 
immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the 
chimeric molecule, such a fusion could be to the Fc region of an IgG molecule. 

Various tag polypeptides and their respective antibodies are well known in the 
art. Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; 
HIS6 and metal chelation tags, the flu HA tag polypeptide and its antibody 12CA5 (Field et 
al, Mol. Cell. Biol. 8:2159-2165 (1988)); the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 
9E10 antibodies thereto (Evan et al, Molecular and Cellular Biology 5:3610-3616 (1985)); 
and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody {Paborsky et al, 
Protein Engineering 3(6):547-553 (1990)). Other tag polypeptides include the Flag-peptide 
(Hopp et al, BioTechnology 6:1204-1210 (1988)); the KT3 epitope peptide (Martin et al, 
Science 255:192-194 (1992)); tubulin epitope peptide (Skinner et al, J. Biol Chem. 
266:15163-15166 (1991)); and the T7 gene 10 protein peptide tag (Lutz-Freyermuth et al, 
Proc. Natl Acad. Sci. USA 87:6393-6397 (1990)). 

Also included are other breast cancer proteins of the breast cancer family, and 
breast cancer proteins from other organisms, which are cloned and expressed as outlined 
below. Thus, probe or degenerate polymerase chain reaction (PCR) primer sequences may be 
used to find other related breast cancer proteins from humans or other organisms. As will be 
appreciated by those in the art, particularly useful probe and/or PCR primer sequences 
include the unique areas of the breast cancer nucleic acid sequence. As is generally known in 
the art, preferred PCR primers are from about 15 to about 35 nucleotides in length, with from 
about 20 to about 30 being preferred, and may contain inosine as needed. The conditions for 
the PCR reaction are well known in the art (e.g., Innis, PCR Protocols, supra). 

Antibodies to breast cancer proteins 

In a preferred embodiment, when the breast cancer protein is to be used to 
generate antibodies, e.g., for immunotherapy or immunodiagnosis, the breast cancer protein 
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should share at least one epitope or determinant with the full length protein. By "epitope" or 
"determinant" herein is typically meant a portion of a protein which will generate and/or bind 
an antibody or T-cell receptor in the context of MHC. Thus, in most instances, antibodies 
made to a smaller breast cancer protein will be able to bind to the full-length protein, 
5 particularly linear epitopes. In a preferred embodiment, the epitope is unique; that is, 
antibodies generated to a unique epitope show little or no cross-reactivity. 

Methods of preparing polyclonal antibodies are known to the skilled artisan 
(e.g., Coligan, supra; and Harlow & Lane, supra). Polyclonal antibodies can be raised in a 

jus:: 

p mammal, e.g., by one or more injections of an immunizing agent and, if desired, an adjuvant. 
flO Typically, the immunizing agent and/or adjuvant will be injected in the mammal by multiple 

subcutaneous or intraperitoneal injections. The immunizing agent may include a protein 
SJ encoded by a nucleic acid of the figures or fragment thereof or a fusion protein thereof. It 

may be useful to conjugate the immunizing agent to a protein known to be immunogenic in 
Q the mammal being immunized. Examples of such immunogenic proteins include but are not 
fi;5 limited to keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, and soybean 
%[ trypsin inhibitor. Examples of adjuvants which may be employed include Freund's complete 
fU adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose 

dicoiynomycolate). The immunization protocol may be selected by one skilled in the art 
without undue experimentation. 
20 The antibodies may, alternatively, be monoclonal antibodies. Monoclonal 

antibodies may be prepared using hybridoma methods, such as those described by Kohler & 
Milstein, Nature 256:495 (1975). In a hybridoma method, a mouse, hamster, or other 
appropriate host animal, is typically immunized with an immunizing agent to elicit 
lymphocytes that produce or are capable of producing antibodies that will specifically bind to 
25 the immunizing agent. Alternatively, the lymphocytes may be immunized in vitro. The 

immunizing agent will typically include a polypeptide encoded by a nucleic acid of Tables 1- 
25 or fragment thereof, or a fusion protein thereof. Generally, either peripheral blood 
lymphocytes ("PBLs") are used if cells of human origin are desired, or spleen cells or lymph 
node cells are used if non-human mammalian sources are desired. The lymphocytes are then 
30 fused with an immortalized cell line using a suitable fusing agent, such as polyethylene 
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glycol, to form a hybridoma cell (Goding, Monoclonal Antibodies: Principles and Practice, 
pp. 59-103 (1986)). Immortalized cell lines are usually transformed mammalian cells, 
particularly myeloma cells of rodent, bovine and human origin. Usually, rat or mouse 
myeloma cell lines are employed. The hybridoma cells may be cultured in a suitable culture 
5 medium that preferably contains one or more substances that inhibit the growth or survival of 
the unfused, immortalized cells. For example, if the parental cells lack the enzyme 
hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture medium 
for the hybridomas typically will include hypoxanthine, aminopterin, and thymidine ("HAT 
5 medium"), which substances prevent the growth of HGPRT-deficient cells. 
Of 0 In one embodiment, the antibodies are bispecific antibodies. Bispecific 

V'\ antibodies are monoclonal, preferably human or humanized, antibodies that have binding 

specificities for at least two different antigens or that have binding specificities for two 
q epitopes on the same antigen. In one embodiment, one of the binding specificities is for a 
J7: protein encoded by a nucleic acid Tables 1-25 or a fragment thereof, the other one is for any 
=f ; 5 other antigen, and preferably for a cell-surface protein or receptor or receptor subunit, 
S j preferably one that is tumor specific. Alternatively, tetramer-type technology may create 
multivalent reagents. 

In a preferred embodiment, the antibodies to breast cancer protein are capable 
of reducing or eliminating a biological function of a breast cancer protein, as is described 
20 below. That is, the addition of anti-breast cancer protein antibodies (either polyclonal or 

preferably monoclonal) to breast cancer tissue (or cells containing breast cancer) may reduce 
or eliminate the breast cancer. Generally, at least a 25% decrease in activity, growth, size or 
the like is preferred, with at least about 50% being particularly preferred and about a 95- 
100% decrease being especially preferred. 
25 In a preferred embodiment the antibodies to the breast cancer proteins are 

humanized antibodies (e.g., Xenerex Biosciences, Mederex, Inc., Abgenix, Inc., Protein 
Design Labsjnc.) Humanized forms of non-human (e.g., murine) antibodies are chimeric 
molecules of immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, 
Fab, Fab', F(ab')2 or other antigen-binding subsequences of antibodies) which contain 
30 minimal sequence derived from non-human immunoglobulin. Humanized antibodies include 
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human immunoglobulins (recipient antibody) in which residues from a complementary 
determining region (CDR) of the recipient are replaced by residues from a CDR of a non- 
human species (donor antibody) such as mouse, rat or rabbit having the desired specificity, 
affinity and capacity. In some instances, Fv framework residues of the human 
5 immunoglobulin are replaced by corresponding non-human residues. Humanized antibodies 
may also comprise residues which are found neither in the recipient antibody nor in the 
imported CDR or framework sequences. In general, a humanized antibody will comprise 
y, substantially all of at least one, and typically two, variable domains, in which all or 
5; substantially all of the CDR regions correspond to those of a non-human immunoglobulin 
LiO and all or substantially all of the framework (FR) regions are those of a human 
m immunoglobulin consensus sequence. The humanized antibody optimally also will comprise 

at least a portion of an immunoglobulin constant region (Fc), typically that of a human 
a immunoglobulin (Jones et al., Nature 321 :522-525 (1986); Riechmann et al, Nature 
h' 332:323-329 (1988); and Presta, Curr. Op. Struct. Biol. 2:593-596 (1992)). Humanization 
Hl;5 can be essentially performed following the method of Winter and co-workers (Jones et al., 
O Nature 321:522-525 (1986); Riechmann et al., Nature 332:323-327 (1988); Verhoeyen et al, 
Science 239:1534-1536 (1988)), by substituting rodent CDRs or CDR sequences for the 
corresponding sequences of a human antibody. Accordingly, such humanized antibodies are 
chimeric antibodies (U.S. Patent No. 4,816,567), wherein substantially less than an intact 
20 human variable domain has been substituted by the corresponding sequence from a non- 
human species. 

Human antibodies can also be produced using various techniques known in the 
art, including phage display libraries (Hoogenboom & Winter, J. Mol. Biol. 227:381 (1991); 
Marks et al, J. Mol. Biol. 222:581 (1991)). The techniques of Cole et al. and Boerner et al. 

25 are also available for the preparation of human monoclonal antibodies (Cole et al., 

Monoclonal Antibodies and Cancer Therapy, p. 77 (1985) and Boerner et al, J. Immunol. 
147(l):86-95 (1991)). Similarly, human antibodies can be made by introducing of human 
immunoglobulin loci into transgenic animals, e.g., mice in which the endogenous 
immunoglobulin genes have been partially or completely inactivated. Upon challenge, 

30 human antibody production is observed, which closely resembles that seen in humans in all 
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respects, including gene rearrangement, assembly, and antibody repertoire. This approach is 
described, e.g., in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633,425; 
5,661,016, and in the following scientific publications: Marks et al, Bio/Technology 10:779- 
783 (1992); Lonberg et al, Nature 368:856-859 (1994); Morrison, Nature 368:812-13 
(1994); Fishwild et al, Nature Biotechnology 14:845-51 (1996); Neuberger, Nature 
Biotechnology 14:826 (1996); Lonberg & Huszar, Intern. Rev. Immunol. 13:65-93 (1995). 

By immunotherapy is meant treatment of breast cancer with an antibody raised 
against breast cancer proteins. As used herein, immunotherapy can be passive or active. 
Passive immunotherapy as defined herein is the passive transfer of antibody to a recipient 
(patient). Active immunization is the induction of antibody and/or T-cell responses in a 
recipient (patient). Induction of an immune response is the result of providing the recipient 
with an antigen to which antibodies are raised. As appreciated by one of ordinary skill in the 
art, the antigen may be provided by injecting a polypeptide against which antibodies are 
desired to be raised into a recipient, or contacting the recipient with a nucleic acid capable of 
expressing the antigen and under conditions for expression of the antigen, leading to an 
immune response. 

In a preferred embodiment the breast cancer proteins against which antibodies 
are raised are secreted proteins as described above. Without being bound by theory, 
antibodies used for treatment, bind and prevent the secreted protein from binding to its 
receptor, thereby inactivating the secreted breast cancer protein. 

In another preferred embodiment, the breast cancer protein to which 
antibodies are raised is a transmembrane protein. Without being bound by theory, antibodies 
used for treatment, bind the extracellular domain of the breast cancer protein and prevent it 
from binding to other proteins, such as circulating ligands or cell-associated molecules. The 
antibody may cause down-regulation of the transmembrane breast cancer protein. As will be 
appreciated by one of ordinary skill in the art, the antibody may be a competitive, non- 
competitive or uncompetitive inhibitor of protein binding to the extracellular domain of the 
breast cancer protein. The antibody is also an antagonist of the breast cancer protein. 
Further, the antibody prevents activation of the transmembrane breast cancer protein. In one 
aspect, when the antibody prevents the binding of other molecules to the breast cancer 
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protein, the antibody prevents growth of the cell. The antibody may also be used to target or 
sensitize the cell to cytotoxic agents, including, but not limited to TNF-a, TNF-|3, IL-1, INF-y 
and IL-2, or chemotherapeutic agents including 5FU, vinblastine, actinomycin D, cisplatin, 
methotrexate, and the like. In some instances the antibody belongs to a sub-type that 
5 activates serum complement when complexed with the transmembrane protein thereby 
mediating cytotoxicity or antigen-dependent cytotoxicity (ADCC). Thus, breast cancer is 
treated by administering to a patient antibodies directed against the transmembrane breast 

Q cancer protein. Antibody-labeling may activate a co-toxin, localize a toxin payload, or 

n=j otherwise provide means to locally ablate cells. 

OfO In another preferred embodiment, the antibody is conjugated to an effector 

Si moiety. The effector moiety can be any number of molecules, including labelling moieties 
^ such as radioactive labels or fluorescent labels, or can be a therapeutic moiety. In one aspect 
O the therapeutic moiety is a small molecule that modulates the activity of the breast cancer 
hj protein. In another aspect the therapeutic moiety modulates the activity of molecules 
3( 5 associated with or in close proximity to the breast cancer protein. The therapeutic moiety 
fU may inhibit enzymatic activity such as protease or collagenase or protein kinase activity 
associated with breast cancer. 

In a preferred embodiment, the therapeutic moiety can also be a cytotoxic 
agent. In this method, targeting the cytotoxic agent to breast cancer tissue or cells, results in 
20 a reduction in the number of afflicted cells, thereby reducing symptoms associated with 

breast cancer. Cytotoxic agents are numerous and varied and include, but are not limited to, 
cytotoxic drugs or toxins or active fragments of such toxins. Suitable toxins and their 
corresponding fragments include diphtheria A chain, exotoxin A chain, ricin A chain, abrin A 
chain, curcin, crotin, phenomycin, enomycin and the like. Cytotoxic agents also include 
25 radiochemicals made by conjugating radioisotopes to antibodies raised against breast cancer 
proteins, or binding of a radionuclide to a chelating agent that has been covalently attached to 
the antibody. Targeting the therapeutic moiety to transmembrane breast cancer proteins not 
only serves to increase the local concentration of therapeutic moiety in the breast cancer 
afflicted area, but also serves to reduce deleterious side effects that may be associated with 
30 the therapeutic moiety. 
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In another preferred embodiment, the breast cancer protein against which the 
antibodies are raised is an intracellular protein. In this case, the antibody may be conjugated 
to a protein which facilitates entry into the cell. In one case, the antibody enters the cell by 
endocytosis. In another embodiment, a nucleic acid encoding the antibody is administered to 
5 the individual or cell. Moreover, wherein the breast cancer protein can be targeted within a 
cell, i.e., the nucleus, an antibody thereto contains a signal for that target localization, i.e., a 
nuclear localization signal. 

The breast cancer antibodies of the invention specifically bind to breast cancer 
Q proteins. By "specifically bind" herein is meant that the antibodies bind to the protein with a 
flO Kd of at least about 0.1 mM, more usually at least about 1 uM, preferably at least about 0.1 
ft 1 uM or better, and most preferably, 0.01 uM or better. Selectivity of binding is also 
SI important. 

i : 
5 

Detection of breast cancer sequence for diagnostic and therapeutic applications 

ft!: 5 In one aspect, the RNA expression levels of genes are determined for different 

% cellular states in the breast cancer phenotype. Expression levels of genes in normal tissue 
fy (i.e., not undergoing breast cancer) and in breast cancer tissue (and in some cases, for varying 
severities of breast cancer that relate to prognosis, as outlined below) are evaluated to provide 
expression profiles. An expression profile of a particular cell state or point of development is 
20 essentially a "fingerprint" of the state. While two states may have any particular gene 
similarly expressed, the evaluation of a number of genes simultaneously allows the 
generation of a gene expression profile that is reflective of the state of the cell. By comparing 
expression profiles of cells in different states, information regarding which genes are 
important (including both up- and down-regulation of genes) in each of these states is 
25 obtained. Then, diagnosis may be performed or confirmed to determine whether a tissue 
sample has the gene expression profile of normal or cancerous tissue. This will provide for 
molecular diagnosis of related conditions. 

"Differential expression," or grammatical equivalents as used herein, refers to 
qualitative or quantitative differences in the temporal and/or cellular gene expression 
30 patterns within and among cells and tissue. Thus, a differentially expressed gene can 
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qualitatively have its expression altered, including an activation or inactivation, in, e.g., 
normal versus breast cancer tissue. Genes may be turned on or turned off in a particular state, 
relative to another state thus permitting comparison of two or more states. A qualitatively 
regulated gene will exhibit an expression pattern within a state or cell type which is 
5 detectable by standard techniques. Some genes will be expressed in one state or cell type, but 
not in both. Alternatively, the difference in expression may be quantitative, e.g., in that 
expression is increased or decreased; i.e., gene expression is either upregulated, resulting in 
y ; an increased amount of transcript, or downregulated, resulting in a decreased amount of 

transcript. The degree to which expression differs need only be large enough to quantify via 
IflO standard characterization techniques as outlined below, such as by use of Affymetrix 
p \ GeneChip™ expression arrays, Lockhart, Nature Biotechnology 14:1675-1680 (1996), 

hereby expressly incorporated by reference. Other techniques include, but are not limited to, 
s quantitative reverse transcriptase PCR, northern analysis and RNase protection. As outlined 
y' above, preferably the change in expression (i.e., upregulation or downregulation) is at least 
= |5 about 50%, more preferably at least about 100%, more preferably at least about 150%, more 
P preferably at least about 200%, with from 300 to at least 1000% being especially preferred. 

Evaluation may be at the gene transcript, or the protein level. The amount of 
gene expression may be monitored using nucleic acid probes to the DNA or RNA equivalent 
of the gene transcript, and the quantification of gene expression levels, or, alternatively, the 
20 final gene product itself (protein) can be monitored, e.g., with antibodies to the breast cancer 
protein and standard immunoassays (ELISAs, etc.) or other techniques, including mass 
spectroscopy assays, 2D gel electrophoresis assays, etc. Proteins corresponding to breast 
cancer genes, i.e., those identified as being important in a breast cancer phenotype, can be 
evaluated in a breast cancer diagnostic test. 
25 In a preferred embodiment, gene expression monitoring is performed 

simultaneously on a number of genes. Multiple protein expression monitoring can be 
performed as well. Similarly, these assays may be performed on an individual basis as well. 

In this embodiment, the breast cancer nucleic acid probes are attached to 
biochips as outlined herein for the detection and quantification of breast cancer sequences in 
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a particular cell. The assays are further described below in the example. PCR techniques can 

be used to provide greater sensitivity. 

In a preferred embodiment nucleic acids encoding the breast cancer protein are 

detected. Although DNA or RNA encoding the breast cancer protein may be detected, of 
5 particular interest are methods wherein an mRNA encoding a breast cancer protein is 

detected. Probes to detect mRNA can be a nucleotide/deoxynucleotide probe that is 

complementary to and hybridizes with the mRNA and includes, but is not limited to, 
y ; oligonucleotides, cDNA or RNA. Probes also should contain a detectable label, as defined 
m herein. In one method the mRNA is detected after immobilizing the nucleic acid to be 
LrtO examined on a solid support such as nylon membranes and hybridizing the probe with the 
Jfi sample. Following washing to remove the non-specifically bound probe, the label is 

detected. In another method detection of the mRNA is performed in situ. In this method 
= permeabilized cells or tissue samples are contacted with a detectably labeled nucleic acid 

probe for sufficient time to allow the probe to hybridize with the target mRNA. Following 
Pi 5 washing to remove the non-specifically bound probe, the label is detected. For example a 
p digoxygenin labeled riboprobe (RNA probe) that is complementary to the mRNA encoding a 
' breast cancer protein is detected by binding the digoxygenin with an anti-digoxygenin 

secondary antibody and developed with nitro blue tetrazolium and 5-bromo-4-chloro-3- 

indoyl phosphate. 

20 In a preferred embodiment, various proteins from the three classes of proteins 

as described herein (secreted, transmembrane or intracellular proteins) are used in diagnostic 
assays. The breast cancer proteins, antibodies, nucleic acids, modified proteins and cells 
containing breast cancer sequences are used in diagnostic assays. This can be performed on 
an individual gene or corresponding polypeptide level. In a preferred embodiment, the 

25 expression profiles are used, preferably in conjunction with high throughput screening 
techniques to allow monitoring for expression profile genes and/or corresponding 
polypeptides. 

As described and defined herein, breast cancer proteins, including 
intracellular, transmembrane or secreted proteins, find use as markers of breast cancer. 
30 Detection of these proteins in putative breast cancer tissue allows for detection or diagnosis 
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of breast cancer. In one embodiment, antibodies are used to detect breast cancer proteins. A 
preferred method separates proteins from a sample by electrophoresis on a gel (typically a 
denaturing and reducing protein gel, but may be another type of gel, including isoelectric 
focusing gels and the like). Following separation of proteins, the breast cancer protein is 
5 detected, e.g., by immunoblotting with antibodies raised against the breast cancer protein. 
Methods of immunoblotting are well known to those of ordinary skill in the art. 

In another preferred method, antibodies to the breast cancer protein find use in 
M in situ imaging techniques, e.g., in histology (e.g., Methods in Cell Biology: Antibodies in 
p Cell Biology, volume 37 (Asai, ed. 1993)). In this method cells are contacted with from one 
HO to many antibodies to the breast cancer protein(s). Following washing to remove non- 
iU specific antibody binding, the presence of the antibody or antibodies is detected. In one 
o embodiment the antibody is detected by incubating with a secondary antibody that contains a 
° = detectable label. In another method the primary antibody to the breast cancer protein(s) 
H contains a detectable label, e.g. an enzyme marker that can act on a substrate. In another 
j|5 preferred embodiment each one of multiple primary antibodies contains a distinct and 
ir! detectable label. This method finds particular use in simultaneous screening for a plurality of 
breast cancer proteins. As will be appreciated by one of ordinary skill in the art, many other 
histological imaging techniques are also provided by the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the 
20 ability to detect and distinguish emissions of different wavelengths. In addition, a 
fluorescence activated cell sorter (FACS) can be used in the method. 

In another preferred embodiment, antibodies find use in diagnosing breast 
cancer from blood, serum, plasma, stool, and other samples. Such samples, therefore, are 
useful as samples to be probed or tested for the presence of breast cancer proteins. 
25 Antibodies can be used to detect a breast cancer protein by previously described 
immunoassay techniques including ELIS A, immunoblotting (western blotting), 
immunoprecipitation, BIACORE technology and the like. Conversely, the presence of 
antibodies may indicate an immune response against an endogenous breast cancer protein. 

In a preferred embodiment, in situ hybridization of labeled breast cancer 
30 nucleic acid probes to tissue arrays is done. For example, arrays of tissue samples, including 
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breast cancer tissue and/or normal tissue, are made. In situ hybridization {see, e.g., Ausubel, 
supra) is then performed. When comparing the fingerprints between an individual and a 
standard, the skilled artisan can make a diagnosis, a prognosis, or a prediction based on the 
findings. It is further understood that the genes which indicate the diagnosis may differ from 
5 those which indicate the prognosis and molecular profiling of the condition of the cells may 
lead to distinctions between responsive or refractory conditions or may be predictive of 
outcomes. 

y. In a preferred embodiment, the breast cancer proteins, antibodies, nucleic 

acids, modified proteins and cells containing breast cancer sequences are used in prognosis 
LflO assays. As above, gene expression profiles can be generated that correlate to breast cancer, 
P*i in terms of long term prognosis. Again, this may be done on either a protein or gene level, 
Ni with the use of genes being preferred. As above, breast cancer probes may be attached to 

biochips for the detection and quantification of breast cancer sequences in a tissue or patient. 
The assays proceed as outlined above for diagnosis. PCR method may provide more 
PS 5 sensitive and accurate quantification. 

O 

1 "= ! Assays for therapeutic compounds 

In a preferred embodiment members of the proteins, nucleic acids, and 
antibodies as described herein are used in drug screening assays. The breast cancer proteins, 
20 antibodies, nucleic acids, modified proteins and cells containing breast cancer sequences are 
used in drug screening assays or by evaluating the effect of drug candidates on a "gene 
expression profile" or expression profile of polypeptides. In a preferred embodiment, the 
expression profiles are used, preferably in conjunction with high throughput screening 
techniques to allow monitoring for expression profile genes after treatment with a candidate 
25 agent (e.g., Zlokarnik, et al, Science 279:84-8 (1998); Heid, Genome Res 6:986-94, 1996). 

In a preferred embodiment, the breast cancer proteins, antibodies, nucleic 
acids, modified proteins and cells containing the native or modified breast cancer proteins are 
used in screening assays. That is, the present invention provides novel methods for screening 
for compositions which modulate the breast cancer phenotype or an identified physiological 
30 function of a breast cancer protein. As above, this can be done on an individual gene level or 
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by evaluating the effect of drug candidates on a "gene expression profile". In a preferred 
embodiment, the expression profiles are used, preferably in conjunction with high throughput 
screening techniques to allow monitoring for expression profile genes after treatment with a 
candidate agent, see Zlokarnik, supra. 

Having identified the differentially expressed genes herein, a variety of assays 
may be executed. In a preferred embodiment, assays may be run on an individual gene or 
protein level. That is, having identified a particular gene as up regulated in breast cancer, test 
compounds can be screened for the ability to modulate gene expression or for binding to the 
breast cancer protein. "Modulation" thus includes both an increase and a decrease in gene 
expression. The preferred amount of modulation will depend on the original change of the 
gene expression in normal versus tissue undergoing breast cancer, with changes of at least 
10%, preferably 50%, more preferably 100-300%, and in some embodiments 300-1000% or 
greater. Thus, if a gene exhibits a 4-fold increase in breast cancer tissue compared to normal 
tissue, a decrease of about four- fold is often desired; similarly, a 10- fold decrease in breast 
cancer tissue compared to normal tissue often provides a target value of a 10-fold increase in 
expression to be induced by the test compound. 

The amount of gene expression may be monitored using nucleic acid probes 
and the quantification of gene expression levels, or, alternatively, the gene product itself can 
be monitored, e.g., through the use of antibodies to the breast cancer protein and standard 
immunoassays. Proteomics and separation techniques may also allow quantification of 
expression. 

In a preferred embodiment, gene expression or protein monitoring of a number 
of entities, i.e., an expression profile, is monitored simultaneously. Such profiles will 
typically involve a plurality of those entities described herein.. 

In this embodiment, the breast cancer nucleic acid probes are attached to 
biochips as outlined herein for the detection and quantification of breast cancer sequences in 
a particular cell. Alternatively, PCR may be used. Thus, a series, e.g., of microtiter plate, 
may be used with dispensed primers in desired wells. A PCR reaction can then be performed 
and analyzed for each well. 
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Expression monitoring can be performed to identify compounds that modify 
the expression of one or more breast cancer-associated sequences, e.g., a polynucleotide 
sequence set out inTable 17. Generally, in a preferred embodiment, a test modulator is added 
to the cells prior to analysis. Moreover, screens are also provided to identify agents that 
5 modulate breast cancer, modulate breast cancer proteins, bind to a breast cancer protein, or 
interfere with the binding of a breast cancer protein and an antibody or other binding partner. 

The term "test compound" or "drug candidate" or "modulator" or grammatical 
equivalents as used herein describes any molecule, e.g., protein, oligopeptide, small organic 
Q molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or 
7%0 indirectly alter the breast cancer phenotype or the expression of a breast cancer sequence, 
e.g., a nucleic acid or protein sequence. In preferred embodiments, modulators alter 

f 1 ! 

%\ expression profiles, or expression profile nucleic acids or proteins provided herein. In one 
* =: embodiment, the modulator suppresses a breast cancer phenotype, e.g. to a normal tissue 
O fingerprint. In another embodiment, a modulator induced a breast cancer phenotype. 

t.i. 

hi 5 Generally, a plurality of assay mixtures are run in parallel with different agent concentrations 
jr to obtain a differential response to the various concentrations. Typically, one of these 
rU concentrations serves as a negative control, i.e., at zero concentration or below the level of 
detection. 

Drug candidates encompass numerous chemical classes, though typically they 
20 are organic molecules, preferably small organic compounds having a molecular weight of 
more than 100 and less than about 2,500 daltons. Preferred small molecules are less than 
2000, or less than 1 500 or less than 1000 or less than 500 D. Candidate agents comprise 
functional groups necessary for structural interaction with proteins, particularly hydrogen 
bonding, and typically include at least an amine, carbonyl, hydroxyl or carboxyl group, 
25 preferably at least two of the functional chemical groups. The candidate agents often 
comprise cyclical carbon or heterocyclic structures and/or aromatic or polyaromatic 
structures substituted with one or more of the above functional groups. Candidate agents are 
also found among biomolecules including peptides, saccharides, fatty acids, steroids, purines, 
pyrimidines, derivatives, structural analogs or combinations thereof. Particularly preferred 
30 are peptides. 
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In one aspect, a modulator will neutralize the effect of a breast cancer protein. 
By "neutralize" is meant that activity of a protein is inhibited or blocked and the consequent 
effect on the cell. 

In certain embodiments, combinatorial libraries of potential modulators will be 
5 screened for an ability to bind to a breast cancer polypeptide or to modulate activity. 

Conventionally, new chemical entities with useful properties are generated by identifying a 
chemical compound (called a "lead compound") with some desirable property or activity, 
y ; e.g., inhibiting activity, creating variants of the lead compound, and evaluating the property 
j£ and activity of those variant compounds. Often, high throughput screening (HTS) methods 
WO are employed for such an analysis. 

S'n In one preferred embodiment, high throughput screening methods involve 

providing a library containing a large number of potential therapeutic compounds (candidate 
= compounds). Such "combinatorial chemical libraries" are then screened in one or more 
p' assays to identify those library members (particular chemical species or subclasses) that 
^15 display a desired characteristic activity. The compounds thus identified can serve as 

sskt: 

p conventional "lead compounds" or can themselves be used as potential or actual therapeutics. 
1 v A combinatorial chemical library is a collection of diverse chemical 

compounds generated by either chemical synthesis or biological synthesis by combining a 
number of chemical "building blocks" such as reagents. For example, a linear combinatorial 

20 chemical library, such as a polypeptide (e.g., mutein) library, is formed by combining a set of 
chemical building blocks called amino acids in every possible way for a given compound 
length (i.e., the number of amino acids in a polypeptide compound). Millions of chemical 
compounds can be synthesized through such combinatorial mixing of chemical building 
blocks (Gallop etal., J. Med. Chem. 37(9):1233-1251 (1994)). 

25 Preparation and screening of combinatorial chemical libraries is well known to 

those of skill in the art. Such combinatorial chemical libraries include, but are not limited to, 
peptide libraries (see, e.g., U.S. Patent No. 5,010,175, Furka, Pept. Prot. Res. 37:487-493 
(1991), Houghton et al, Nature, 354:84-88 (1991)), peptoids (PCT Publication No WO 
91/19735), encoded peptides (PCT Publication WO 93/20242), random bio-oligomers (PCT 

30 Publication WO 92/00091), benzodiazepines (U.S. Pat. No. 5,288,514), diversomers such as 
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hydantoins, benzodiazepines and dipeptides (Hobbs et al, Proc. Nat. Acad. Sci. USA 
90:6909-6913 (1993)), vinylogous polypeptides (Hagihara et al., J. Amer. Chem. Soc. 
1 14:6568 (1992)), nonpeptidal peptidomimetics with a Beta-D-Glucose scaffolding 
(Hirschmann et al, J. Amer. Chem. Soc. 1 14:9217-9218 (1992)), analogous organic syntheses 
of small compound libraries (Chen et al., J. Amer. Chem. Soc. 116:2661 (1994)), 
oligocarbamates (Cho, et al, Science 261:1303 (1993)), and/or peptidyl phosphonates 
(Campbell et al, J. Org. Chem. 59:658 (1994)). See, generally, Gordon et al, J. Med. Chem. 
37:1385 (1994), nucleic acid libraries (see, e.g., Strategene, Corp.), peptide nucleic acid 
libraries (see, e.g., U.S. Patent 5,539,083), antibody libraries (see, e.g., Vaughn et al, Nature 
Biotechnology 14(3):309-314 (1996), and PCT7US96/10287), carbohydrate libraries (see, 
e.g., Liang et al, Science 274:1520-1522 (1996), and U.S. Patent No. 5,593,853), and small 
organic molecule libraries (see, e.g., benzodiazepines, Baum, C&EN, Jan 18, page 33 (1993); 
isoprenoids, U.S. Patent No. 5,569,588; thiazolidinones and metathiazanones, U.S. Patent No. 
5,549,974; pyrrolidines, U.S. Patent Nos. 5,525,735 and 5,519,134; morpholino compounds, 
U.S. Patent No. 5,506,337; benzodiazepines, U.S. Patent No. 5,288,514; and the like). 

Devices for the preparation of combinatorial libraries are commercially 
available (see, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, Symphony, 
Rainin, Woburn, MA, 433A Applied Biosystems, Foster City, CA, 9050 Plus, Millipore, 
Bedford, MA). 

A number of well known robotic systems have also been developed for 
solution phase chemistries. These systems include automated workstations like the 
automated synthesis apparatus developed by Takeda Chemical Industries, LTD. (Osaka, 
Japan) and many robotic systems utilizing robotic arms (Zymate II, Zymark Corporation, 
Hopkinton, Mass.; Orca, Hewlett-Packard, Palo Alto, Calif), which mimic the manual 
synthetic operations performed by a chemist. Any of the above devices are suitable for use 
with the present invention. The nature and implementation of modifications to these devices 
(if any) so that they can operate as discussed herein will be apparent to persons skilled in the 
relevant art. In addition, numerous combinatorial libraries are themselves commercially 
available (see, e.g., ComGenex, Princeton, N.J., Asinex, Moscow, Ru, Tripos, Inc., St. Louis, 
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MO, ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Exton, PA, Martek Biosciences, 
Columbia, MD, etc.). 

The assays to identify modulators are amenable to high throughput screening. 
Preferred assays thus detect enhancement or inhibition of breast cancer gene transcription, 
5 inhibition or enhancement of polypeptide expression, and inhibition or enhancement of 
polypeptide activity. 

High throughput assays for the presence, absence, quantification, or other 
properties of particular nucleic acids or protein products are well known to those of skill in 
P the art. Similarly, binding assays and reporter gene assays are similarly well known. Thus, 
= r!0 e.g., U.S. Patent No. 5,559,410 discloses high throughput screening methods for proteins, 

U.S. Patent No. 5,585,639 discloses high throughput screening methods for nucleic acid 
SI binding (i.e., in arrays), while U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high 
~ throughput methods of screening for ligand/antibody binding. 

P In addition, high throughput screening systems are commercially available 

f§5 (see, e.g., Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 
% Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, etc.). These systems 
FU typically automate entire procedures, including all sample and reagent pipetting, liquid 

dispensing, timed incubations, and final readings of the microplate in detector(s) appropriate 
for the assay. These configurable systems provide high throughput and rapid start up as well 
20 as a high degree of flexibility and customization. The manufacturers of such systems provide 
detailed protocols for various high throughput systems. Thus, e.g., Zymark Corp. provides 
technical bulletins describing screening systems for detecting the modulation of gene 
transcription, ligand binding, and the like. 

In one embodiment, modulators are proteins, often naturally occurring 
25 proteins or fragments of naturally occurring proteins. Thus, e.g., cellular extracts containing 
proteins, or random or directed digests of proteinaceous cellular extracts, may be used. In 
this way libraries of proteins may be made for screening in the methods of the invention. 
Particularly preferred in this embodiment are libraries of bacterial, fungal, viral, and 
mammalian proteins, with the latter being preferred, and human proteins being especially 
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preferred. Particularly useful test compound will be directed to the class of proteins to which 
the target belongs, e.g., substrates for enzymes or ligands and receptors. 

In a preferred embodiment, modulators are peptides of from about 5 to about 
30 amino acids, with from about 5 to about 20 amino acids being preferred, and from about 7 
to about 15 being particularly preferred. The peptides may be digests of naturally occurring 
proteins as is outlined above, random peptides, or "biased" random peptides. By 
"randomized" or grammatical equivalents herein is meant that each nucleic acid and peptide 
consists of essentially random nucleotides and amino acids, respectively. Since generally 
these random peptides (or nucleic acids, discussed below) are chemically synthesized, they 
may incorporate any nucleotide or amino acid at any position. The synthetic process can be 
designed to generate randomized proteins or nucleic acids, to allow the formation of all or 
most of the possible combinations over the length of the sequence, thus forming a library of 
randomized candidate bioactive proteinaceous agents. 

In one embodiment, the library is fully randomized, with no sequence 
preferences or constants at any position. In a preferred embodiment, the library is biased. 
That is, some positions within the sequence are either held constant, or are selected from a 
limited number of possibilities. For example, in a preferred embodiment, the nucleotides or 
amino acid residues are randomized within a defined class, e.g., of hydrophobic amino acids, 
hydrophilic residues, sterically biased (either small or large) residues, towards the creation of 
nucleic acid binding domains, the creation of cysteines, for cross-linking, prolines for SH-3 
domains, serines, threonines, tyrosines or histidines for phosphorylation sites, etc., or to 
purines, etc. 

Modulators of breast cancer can also be nucleic acids, as defined above. 

As described above generally for proteins, nucleic acid modulating agents may 
be naturally occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids. 
For example, digests of procaryotic or eucaryotic genomes may be used as is outlined above 
for proteins. 

In a preferred embodiment, the candidate compounds are organic chemical 
moieties, a wide variety of which are available in the literature. 
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After the candidate agent has been added and the cells allowed to incubate for 
some period of time, the sample containing a target sequence to be analyzed is added to the 
biochip. If required, the target sequence is prepared using known techniques. For example, 
the sample may be treated to lyse the cells, using known lysis buffers, electroporation, etc., 
5 with purification and/or amplification such as PCR performed as appropriate. For example, 
an in vitro transcription with labels covalently attached to the nucleotides is performed. 
Generally, the nucleic acids are labeled with biotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, e.g., a 
P fluorescent, a chemiluminescent, a chemical, or a radioactive signal, to provide a means of 
i fiO detecting the target sequence's specific binding to a probe. The label also can be an enzyme, 
jj; such as, alkaline phosphatase or horseradish peroxidase, which when provided with an 
Si appropriate substrate produces a product that can be detected. Alternatively, the label can be 

s ; 

a labeled compound or small molecule, such as an enzyme inhibitor, that binds but is not 
catalyzed or altered by the enzyme. The label also can be a moiety or compound, such as, an 
ff|5 epitope tag or biotin which specifically binds to streptavidin. For the example of biotin, the 
JE; streptavidin is labeled as described above, thereby, providing a detectable signal for the 
iV bound target sequence. Unbound labeled streptavidin is typically removed prior to analysis. 

As will be appreciated by those in the art, these assays can be direct 
hybridization assays or can comprise "sandwich assays", which include the use of multiple 
20 probes, as is generally outlined in U.S. Patent Nos. 5,68 1,702, 5,597,909, 5,545,730, 

5.594.117, 5,591,584, 5,571,670, 5,580,731, 5,571,670, 5,591,584, 5,624,802, 5,635,352, 

5.594.118, 5,359,100, 5,124,246 and 5,681,697, all of which are hereby incorporated by 
reference. In this embodiment, in general, the target nucleic acid is prepared as outlined 
above, and then added to the biochip comprising a plurality of nucleic acid probes, under 

25 conditions that allow the formation of a hybridization complex. 

A variety of hybridization conditions may be used in the present invention, 
including high, moderate and low stringency conditions as outlined above. The assays are 
generally run under stringency conditions which allows formation of the label probe 
hybridization complex only in the presence of target. Stringency can be controlled by 
30 altering a step parameter that is a thermodynamic variable, including, but not limited to, 
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temperature, formamide concentration, salt concentration, chaotropic salt concentration pH, 
organic solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is 
generally outlined in U.S. Patent No. 5,681,697. Thus it may be desirable to perform certain 
5 steps at higher stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways. 
Components of the reaction may be added simultaneously, or sequentially, in different orders, 
with preferred embodiments outlined below. In addition, the reaction may include a variety 
!T of other reagents. These include salts, buffers, neutral proteins, e.g. albumin, detergents, etc. 
00 which may be used to facilitate optimal hybridization and detection, and/or reduce non- 
pn specific or background interactions. Reagents that otherwise improve the efficiency of the 

assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may also be 
D used as appropriate, depending on the sample preparation methods and purity of the target. 
Pi The assay data are analyzed to determine the expression levels, and changes in 

jl|5 expression levels as between states, of individual genes, forming a gene expression profile. 
J:; Screens are performed to identify modulators of the breast cancer phenotype. 

STj In one embodiment, screening is performed to identify modulators that can induce or 

suppress a particular expression profile, thus preferably generating the associated phenotype. 
In another embodiment, e.g., for diagnostic applications, having identified differentially 
20 expressed genes important in a particular state, screens can be performed to identify 

modulators that alter expression of individual genes. In an another embodiment, screening is 
performed to identify modulators that alter a biological function of the expression product of 
a differentially expressed gene. Again, having identified the importance of a gene in a 
particular state, screens are performed to identify agents that bind and/or modulate the 
25 biological activity of the gene product. 

In addition screens can be done for genes that are induced in response to a 
candidate agent. After identifying a modulator based upon its ability to suppress a breast 
cancer expression pattern leading to a normal expression pattern, or to modulate a single 
breast cancer gene expression profile so as to mimic the expression of the gene from normal 
30 tissue, a screen as described above can be performed to identify genes that are specifically 
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modulated in response to the agent. Comparing expression profiles between normal tissue 
and agent treated breast cancer tissue reveals genes that are not expressed in normal tissue or 
breast cancer tissue, but are expressed in agent treated tissue. These agent-specific sequences 
can be identified and used by methods described herein for breast cancer genes or proteins. 
In particular these sequences and the proteins they encode find use in marking or identifying 
agent treated cells. In addition, antibodies can be raised against the agent induced proteins 
and used to target novel therapeutics to the treated breast cancer tissue sample. 

Thus, in one embodiment, a test compound is administered to a population of 
breast cancer cells, that have an associated breast cancer expression profile. By 
"administration" or "contacting" herein is meant that the candidate agent is added to the cells 
in such a manner as to allow the agent to act upon the cell, whether by uptake and 
intracellular action, or by action at the cell surface. In some embodiments, nucleic acid 
encoding a proteinaceous candidate agent (i.e., a peptide) may be put into a viral construct 
such as an adenoviral or retroviral construct, and added to the cell, such that expression of 
the peptide agent is accomplished, e.g., PCT US97/01019. Regulatable gene therapy systems 
can also be used. 

Once the test compound has been administered to the cells, the cells can be 
washed if desired and are allowed to incubate under preferably physiological conditions for 
some period of time. The cells are then harvested and a new gene expression profile is 
generated, as outlined herein. 

Thus, e.g., breast cancer tissue may be screened for agents that modulate, e.g., 
induce or suppress the breast cancer phenotype. A change in at least one gene, preferably 
many, of the expression profile indicates that the agent has an effect on breast cancer activity. 
By defining such a signature for the breast cancer phenotype, screens for new drugs that alter 
the phenotype can be devised. With this approach, the drug target need not be known and 
need not be represented in the original expression screening platform, nor does the level of 
transcript for the target protein need to change. 

In a preferred embodiment, as outlined above, screens may be done on 
individual genes and gene products (proteins). That is, having identified a particular 
differentially expressed gene as important in a particular state, screening of modulators of 
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either the expression of the gene or the gene product itself can be done. The gene products of 
differentially expressed genes are sometimes referred to herein as "breast cancer proteins" or 
a "breast cancer modulatory protein". The breast cancer modulatory protein may be a 
fragment, or alternatively, be the full length protein to the fragment encoded by the nucleic 
5 acids of the Tables. Preferably, the breast cancer modulatory protein is a fragment. In a 
preferred embodiment, the breast cancer amino acid sequence which is used to determine 
sequence identity or similarity is encoded by a nucleic acid of Table 25. In another 
embodiment, the sequences are naturally occurring allelic variants of a protein encoded by a 

p: nucleic acid of Table 25. In another embodiment, the sequences are sequence variants as 

HO further described herein. 

p!i Preferably, the breast cancer modulatory protein is a fragment of 

approximately 14 to 24 amino acids long. More preferably the fragment is a soluble 
fragment. Preferably, the fragment includes a non-transmembrane region. In a preferred 

Q embodiment, the fragment has an N-terminal Cys to aid in solubility. In one embodiment, the 
5 C-terminus of the fragment is kept as a free acid and the N-terminus is a free amine to aid in 

=f= ; coupling, i.e., to cysteine. 

rjM In one embodiment the breast cancer proteins are conjugated to an 

immunogenic agent as discussed herein. In one embodiment the breast cancer protein is 
conjugated to BSA. 

20 Measurements of breast cancer polypeptide activity, or of breast cancer or the 

breast cancer phenotype can be performed using a variety of assays. For example, the effects 
of the test compounds upon the function of the breast cancer polypeptides can be measured 
by examining parameters described above. A suitable physiological change that affects 
activity can be used to assess the influence of a test compound on the polypeptides of this 

25 invention. When the functional consequences are determined using intact cells or animals, 
one can also measure a variety of effects such as, in the case of breast cancer associated with 
tumors, tumor growth, tumor metastasis, neovascularization, hormone release, transcriptional 
changes to both known and uncharacterized genetic markers (e.g., northern blots), changes in 
cell metabolism such as cell growth or pH changes, and changes in intracellular second 
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messengers such as cGMP. In the assays of the invention, mammalian breast cancer 
polypeptide is typically used, e.g., mouse, preferably human. 

Assays to identify compounds with modulating activity can be performed in 
vitro. For example, a breast cancer polypeptide is first contacted with a potential modulator 
5 and incubated for a suitable amount of time, e.g., from 0.5 to 48 hours. In one embodiment, 
the breast cancer polypeptide levels are determined in vitro by measuring the level of protein 
or mRNA. The level of protein is measured using immunoassays such as western blotting, 
ELISA and the like with an antibody that selectively binds to the breast cancer polypeptide or 
j^' a fragment thereof. For measurement of mRNA, amplification, e.g., using PCR, LCR, or 
O0 hybridization assays, e.g., northern hybridization, RNAse protection, dot blotting, are 
%l preferred. The level of protein or mRNA is detected using directly or indirectly labeled 
fy detection agents, e.g., fluorescently or radio actively labeled nucleic acids, radioactively or 
p enzymatically labeled antibodies, and the like, as described herein. 

Alternatively, a reporter gene system can be devised using the breast cancer 
H5 protein promoter operably linked to a reporter gene such as luciferase, green fluorescent 
jg: protein, CAT, or p-gal. The reporter construct is typically transfected into a cell. After 
Jr; treatment with a potential modulator, the amount of reporter gene transcription, translation, or 
activity is measured according to standard techniques known to those of skill in the art. 

In a preferred embodiment, as outlined above, screens may be done on 
20 individual genes and gene products (proteins). That is, having identified a particular 

differentially expressed gene as important in a particular state, screening of modulators of the 
expression of the gene or the gene product itself can be done. The gene products of 
differentially expressed genes are sometimes referred to herein as "breast cancer proteins." 
The breast cancer protein may be a fragment, or alternatively, be the full length protein to a 
25 fragment shown herein. 

In one embodiment, screening for modulators of expression of specific genes 
is performed. Typically, the expression of only one or a few genes are evaluated. In another 
embodiment, screens are designed to first find compounds that bind to differentially 
expressed proteins. These compounds are then evaluated for the ability to modulate 
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differentially expressed activity. Moreover, once initial candidate compounds are identified, 
variants can be further screened to better evaluate structure activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or 
isolated gene product is used; that is, the gene products of one or more differentially 
5 expressed nucleic acids are made. For example, antibodies are generated to the protein gene 
products, and standard immunoassays are run to determine the amount of protein present. 
Alternatively, cells comprising the breast cancer proteins can be used in the assays. 

Thus, in a preferred embodiment, the methods comprise combining a breast 
(Z! cancer protein and a candidate compound, and determining the binding of the compound to 
tAO the breast cancer protein. Preferred embodiments utilize the human breast cancer protein, 
W=| although other mammalian proteins may also be used, e.g. for the development of animal 

3 i : 
• W- 

\\ models of human disease. In some embodiments, as outlined herein, variant or derivative 

breast cancer proteins may be used. 
Q Generally, in a preferred embodiment of the methods herein, the breast cancer 

rll 5 protein or the candidate agent is non-diffusably bound to an insoluble support having isolated 
S; sample receiving areas (e.g. a microtiter plate, an array, etc.). The insoluble supports may be 
!!J made of any composition to which the compositions can be bound, is readily separated from 
soluble material, and is otherwise compatible with the overall method of screening. The 
surface of such supports may be solid or porous and of any convenient shape. Examples of 
20 suitable insoluble supports include microtiter plates, arrays, membranes and beads. These are 
typically made of glass, plastic (e.g., polystyrene), polysaccharides, nylon or nitrocellulose, 
teflon™, etc. Microtiter plates and arrays are especially convenient because a large number 
of assays can be carried out simultaneously, using small amounts of reagents and samples. 
The particular manner of binding of the composition is not crucial so long as it is compatible 
25 with the reagents and overall methods of the invention, maintains the activity of the 

composition and is nondiffusable. Preferred methods of binding include the use of antibodies 
(which do not sterically block either the ligand binding site or activation sequence when the 
protein is bound to the support), direct binding to "sticky" or ionic supports, chemical 
crosslinking, the synthesis of the protein or agent on the surface, etc. Following binding of 
30 the protein or agent, excess unbound material is removed by washing. The sample receiving 
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areas may then be blocked through incubation with bovine serum albumin (BSA), casein or 
other innocuous protein or other moiety. 

In a preferred embodiment, the breast cancer protein is bound to the support, 
and a test compound is added to the assay. Alternatively, the candidate agent is bound to the 
support and the breast cancer protein is added. Novel binding agents include specific 
antibodies, non-natural binding agents identified in screens of chemical libraries, peptide 
analogs, etc. Of particular interest are screening assays for agents that have a low toxicity for 
human cells. A wide variety of assays may be used for this purpose, including labeled in 
vitro protein-protein binding assays, electrophoretic mobility shift assays, immunoassays for 
protein binding, functional assays (phosphorylation assays, etc.) and the like. 

The determination of the binding of the test modulating compound to the 
breast cancer protein may be done in a number of ways. In a preferred embodiment, the 
compound is labeled, and binding determined directly, e.g., by attaching all or a portion of 
the breast cancer protein to a solid support, adding a labeled candidate agent (e.g., a 
fluorescent label), washing off excess reagent, and determining whether the label is present 
on the solid support. Various blocking and washing steps may be utilized as appropriate. 

In some embodiments, only one of the components is labeled, e.g., the 
proteins (or proteinaceous candidate compounds) can be labeled. Alternatively, more than 
one component can be labeled with different labels, e.g., 125 I for the proteins and a fluorophor 
for the compound. Proximity reagents, e.g., quenching or energy transfer reagents are also 
useful. 

In one embodiment, the binding of the test compound is determined by 
competitive binding assay. The competitor is a binding moiety known to bind to the target 
molecule (i.e., a breast cancer protein), such as an antibody, peptide, binding partner, ligand, 
etc. Under certain circumstances, there may be competitive binding between the compound 
and the binding moiety, with the binding moiety displacing the compound. In one 
embodiment, the test compound is labeled. Either the compound, or the competitor, or both, 
is added first to the protein for a time sufficient to allow binding, if present. Incubations may 
be performed at a temperature which facilitates optimal activity, typically between 4 and 
40°C. Incubation periods are typically optimized, e.g., to facilitate rapid high throughput 
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screening. Typically between 0.1 and 1 hour will be sufficient. Excess reagent is generally 
removed or washed away. The second component is then added, and the presence or absence 
of the labeled component is followed, to indicate binding. 

In a preferred embodiment, the competitor is added first, followed by the test 
5 compound. Displacement of the competitor is an indication that the test compound is binding 
to the breast cancer protein and thus is capable of binding to, and potentially modulating, the 
activity of the breast cancer protein. In this embodiment, either component can be labeled. 
Thus, e.g., if the competitor is labeled, the presence of label in the wash solution indicates 
displacement by the agent. Alternatively, if the test compound is labeled, the presence of the 
Q0 label on the support indicates displacement. 

ff\ In an alternative embodiment, the test compound is added first, with 

J *; incubation and washing, followed by the competitor. The absence of binding by the 

p! competitor may indicate that the test compound is bound to the breast cancer protein with a 

p higher affinity. Thus, if the test compound is labeled, the presence of the label on the 

Q 5 support, coupled with a lack of competitor binding, may indicate that the test compound is 

S capable of binding to the breast cancer protein. 

Jf[ In a preferred embodiment, the methods comprise differential screening to 

identity agents that are capable of modulating the activity of the breast cancer proteins. In 
this embodiment, the methods comprise combining a breast cancer protein and a competitor 

20 in a first sample. A second sample comprises a test compound, a breast cancer protein, and a 
competitor. The binding of the competitor is determined for both samples, and a change, or 
difference in binding between the two samples indicates the presence of an agent capable of 
binding to the breast cancer protein and potentially modulating its activity. That is, if the 
binding of the competitor is different in the second sample relative to the first sample, the 

25 agent is capable of binding to the breast cancer protein. 

Alternatively, differential screening is used to identify drug candidates that 
bind to the native breast cancer protein, but cannot bind to modified breast cancer proteins. 
The structure of the breast cancer protein may be modeled, and used in rational drug design 
to synthesize agents that interact with that site. Drug candidates that affect the activity of a 
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breast cancer protein are also identified by screening drugs for the ability to either enhance or 
reduce the activity of the protein. 

Positive controls and negative controls may be used in the assays. Preferably 
control and test samples are performed in at least triplicate to obtain statistically significant 
results. Incubation of all samples is for a time sufficient for the binding of the agent to the 
protein. Following incubation, samples are washed free of non- specifically bound material 
and the amount of bound, generally labeled agent determined. For example, where a 
radiolabel is employed, the samples may be counted in a scintillation counter to determine the 
amount of bound compound. 

A variety of other reagents may be included in the screening assays. These 
include reagents like salts, neutral proteins, e.g. albumin, detergents, etc. which may be used 
to facilitate optimal protein-protein binding and/or reduce non-specific or background 
interactions. Also reagents that otherwise improve the efficiency of the assay, such as 
protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may be used. The mixture 
of components may be added in an order that provides for the requisite binding. 

In a preferred embodiment, the invention provides methods for screening for a 
compound capable of modulating the activity of a breast cancer protein. The methods 
comprise adding a test compound, as defined above, to a cell comprising breast cancer 
proteins. Preferred cell types include almost any cell. The cells contain a recombinant 
nucleic acid that encodes a breast cancer protein. In a preferred embodiment, a library of 
candidate agents are tested on a plurality of cells. 

In one aspect, the assays are evaluated in the presence or absence or previous 
or subsequent exposure of physiological signals, e.g. hormones, antibodies, peptides, 
antigens, cytokines, growth factors, action potentials, pharmacological agents including 
chemotherapeutics, radiation, carcinogenics, or other cells (i.e. cell-cell contacts). In another 
example, the determinations are determined at different stages of the cell cycle process. 

In this way, compounds that modulate breast cancer agents are identified. 
Compounds with pharmacological activity are able to enhance or interfere with the activity of 
the breast cancer protein. Once identified, similar structures are evaluated to identify critical 
structural feature of the compound. 
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In one embodiment, a method of inhibiting breast cancer cell division is 
provided. The method comprises administration of a breast cancer inhibitor. In another 
embodiment, a method of inhibiting breast cancer is provided. The method comprises 
administration of a breast cancer inhibitor. In a further embodiment, methods of treating cells 
or individuals with breast cancer are provided. The method comprises administration of a 
breast cancer inhibitor. 

In one embodiment, a breast cancer inhibitor is an antibody as discussed 
above. In another embodiment, the breast cancer inhibitor is an antisense molecule. 

A variety of cell growth, proliferation, and metastasis assays are known to 
those of skill in the art, as described below. 

Soft agar growth or colony formation in suspension 

Normal cells require a solid substrate to attach and grow. When the cells are 
transformed, they lose this phenotype and grow detached from the substrate. For example, 
transformed cells can grow in stirred suspension culture or suspended in semi-solid media, 
such as semi-solid or soft agar. The transformed cells, when transfected with tumor 
suppressor genes, regenerate normal phenotype and require a solid substrate to attach and 
grow. Soft agar growth or colony formation in suspension assays can be used to identify 
modulators of breast cancer sequences, which when expressed in host cells, inhibit abnormal 
cellular proliferation and transformation. A therapeutic compound would reduce or eliminate 
the host cells' ability to grow in stirred suspension culture or suspended in semi-solid media, 
such as semi-solid or soft. 

Techniques for soft agar growth or colony formation in suspension assays are 
described in Freshney, Culture of Animal Cells a Manual of Basic Technique (3 Td ed., 1994), 
herein incorporated by reference. See also, the methods section of Garkavtsev et al. (1996), 
supra, herein incorporated by reference. 

Contact inhibition and density limitation of growth 

Normal cells typically grow in a flat and organized pattern in a petri dish until 
they touch other cells. When the cells touch one another, they are contact inhibited and stop 
growing. When cells are transformed, however, the cells are not contact inhibited and 
continue to grow to high densities in disorganized foci. Thus, the transformed cells grow to a 
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higher saturation density than normal cells. This can be detected morphologically by the 
formation of a disoriented monolayer of cells or rounded cells in foci within the regular 
pattern of normal surrounding cells. Alternatively, labeling index with ( H)-thymidine at 
saturation density can be used to measure density limitation of growth. See Freshney (1994), 
supra. The transformed cells, when transfected with tumor suppressor genes, regenerate a 
normal phenotype and become contact inhibited and would grow to a lower density. 

In this assay, labeling index with ( 3 H)-thymidine at saturation density is a 
preferred method of measuring density limitation of growth. Transformed host cells are 
P transfected with a breast cancer-associated sequence and are grown for 24 hours at saturation 
fsliO density in non-limiting medium conditions. The percentage of cells labeling with ( H)- 
thymidine is determined autoradiographically. See, Freshney (1994), supra. 

R.I 

jS=, 

J" Growth factor or serum dependence 

p Transformed cells have a lower serum dependence than their normal 

ffl5 counterparts {see, e.g., Temin, J. Natl. Cancer Insti. 37:167-175 (1966); Eagle et ah, J. Exp. 

2: Med. 1 3 1 :836-879 (1 970)); Freshney, supra. This is in part due to release of various growth 

tU factors by the transformed cells. Growth factor or serum dependence of transformed host 
cells can be compared with that of control. 

20 Tumor specific markers levels 

Tumor cells release an increased amount of certain factors (hereinafter "tumor 
specific markers") than their normal counterparts. For example, plasminogen activator (PA) 
is released from human glioma at a higher level than from normal brain cells {see, e.g., 
Gullino, Angiogenesis, tumor vascularization, and potential interference with tumor growth. 
25 in Biological Responses in Cancer, pp. 178-184 (Mihich (ed.) 1985)). Similarly, Tumor 
angiogenesis factor (TAF) is released at a higher level in tumor cells than their normal 
counterparts. See, e.g., Folkman, Angiogenesis and Cancer, Sem Cancer Biol. (1992)). 

Various techniques which measure the release of these factors are described in 
Freshney (1994), supra. Also, see, Unkless et al. , J. Biol. Chem. 249:4295-4305 (1974); 
30 Strickland & Beers, J. Biol. Chem. 251:5694-5702 (1976); Whur et al, Br. J. Cancer 42:305- 

79 



Attorney Docket No.:018501-005210US 



312 (1980); Gullino, Angiogenesis, tumor vascularization, and potential interference with 
tumor growth, in Biological Responses in Cancer, pp. 178-184 (Mihich (ed.) 1985); 
Freshney Anticancer Res. 5:111-130 (1985). 

Invasiveness into Matrigel 

The degree of invasiveness into Matrigel or some other extracellular matrix 
constituent can be used as an assay to identify compounds that modulate breast cancer- 
associated sequences. Tumor cells exhibit a good correlation between malignancy and 
invasiveness of cells into Matrigel or some other extracellular matrix constituent. In this 
assay, tumorigenic cells are typically used as host cells. Expression of a tumor suppressor 
gene in these host cells would decrease invasiveness of the host cells. 

Techniques described in Freshney (1994), supra, can be used. Briefly, the 
level of invasion of host cells can be measured by using filters coated with Matrigel or some 
other extracellular matrix constituent. Penetration into the gel, or through to the distal side of 
the filter, is rated as invasiveness, and rated histologically by number of cells and distance 
moved, or by prelabeling the cells with 125 I and counting the radioactivity on the distal side of 
the filter or bottom of the dish. See, e.g., Freshney (1984), supra. 

Tumor growth in vivo 

Effects of breast cancer-associated sequences on cell growth can be tested in 
transgenic or immune-suppressed mice. Knock-out transgenic mice can be made, in which 
the breast cancer gene is disrupted or in which a breast cancer gene is inserted. Knock-out 
transgenic mice can be made by insertion of a marker gene or other heterologous gene into 
the endogenous breast cancer gene site in the mouse genome via homologous recombination. 
Such mice can also be made by substituting the endogenous breast cancer gene with a 
mutated version of the breast cancer gene, or by mutating the endogenous breast cancer gene, 
e.g., by exposure to carcinogens. 

A DNA construct is introduced into the nuclei of embryonic stem cells. Cells 
containing the newly engineered genetic lesion are injected into a host mouse embryo, which 
is re-implanted into a recipient female. Some of these embryos develop into chimeric mice 
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that possess germ cells partially derived from the mutant cell line. Therefore, by breeding the 
chimeric mice it is possible to obtain a new line of mice containing the introduced genetic 
lesion (see, e.g., Capecchi et al, Science 244:1288 (1989)). Chimeric targeted mice can be 
derived according to Hogan et al. , Manipulating the Mouse Embryo: A Laboratory Manual, 
5 Cold Spring Harbor Laboratory (1988) and Teratocarcinomas and Embryonic Stem Cells: A 
Practical Approach, Robertson, ed., IRL Press, Washington, D.C., (1987). 

Alternatively, various immune-suppressed or immune-deficient host animals 
can be used. For example, genetically athymic "nude" mouse (see, e.g., Giovanella et al, J. 
B Natl. Cancer Inst. 52:921 (1974)), a SCID mouse, a thymectomized mouse, or an irradiated 
rpjO mouse (see, e.g., Bradley et al, Br. J. Cancer 38:263 (1978); Selby et al.,Br. J. Cancer 
jjj" 4 1 :52 (1 980)) can be used as a host. Transplantable tumor cells (typically about 1 0 6 cells) 
-y injected into isogenic hosts will produce invasive tumors in a high proportions of cases, while 

normal cells of similar origin will not. In hosts which developed invasive tumors, cells 
O expressing a breast cancer-associated sequences are injected subcutaneously. After a suitable 
ff}5 length of time, preferably 4-8 weeks, tumor growth is measured (e.g., by volume or by its two 
J; largest dimensions) and compared to the control. Tumors that have statistically significant 
fy reduction (using, e.g., Student's T test) are said to have inhibited growth. 

Polynucleotide modulators of breast cancer 

20 Antisense Polynucleotides 

In certain embodiments, the activity of a breast cancer-associated protein is 
down-regulated, or entirely inhibited, by the use of antisense polynucleotide, i.e., a nucleic 
acid complementary to, and which can preferably hybridize specifically to, a coding mRNA 
nucleic acid sequence, e.g., a breast cancer protein mRNA, or a subsequence thereof. 

25 Binding of the antisense polynucleotide to the mRNA reduces the translation and/or stability 
of the mRNA. 

In the context of this invention, antisense polynucleotides can comprise 
naturally-occurring nucleotides, or synthetic species formed from naturally-occurring 
subunits or their close homologs. Antisense polynucleotides may also have altered sugar 
30 moieties or inter-sugar linkages. Exemplary among these are the phosphorothioate and other 
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sulfur containing species which are known for use in the art. Analogs are comprehended by 
this invention so long as they function effectively to hybridize with the breast cancer protein 
mRNA. See, e.g., Isis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., Natick, MA. 

Such antisense polynucleotides can readily be synthesized using recombinant 
5 means, or can be synthesized in vitro. Equipment for such synthesis is sold by several 
vendors, including Applied Biosystems. The preparation of other oligonucleotides such as 
phosphorothioates and alkylated derivatives is also well known to those of skill in the art. 
Antisense molecules as used herein include antisense or sense 
P; oligonucleotides. Sense oligonucleotides can, e.g., be employed to block transcription by 
4 0 binding to the anti-sense strand. The antisense and sense oligonucleotide comprise a single- 
yf ; stranded nucleic acid sequence (either RNA or DNA) capable of binding to target mRNA 
*Z\ (sense) or DNA (antisense) sequences for breast cancer molecules. A preferred antisense 
P molecule is for a breast cancer sequences in Tables 1-25, or for a ligand or activator thereof. 
Ci Antisense or sense oligonucleotides, according to the present invention, comprise a fragment 
jj5 generally at least about 14 nucleotides, preferably from about 14 to 30 nucleotides. The 
=F ability to derive an antisense or a sense oligonucleotide, based upon a cDNA sequence 
p i encoding a given protein is described in, e.g., Stein & Cohen {Cancer Res. 48:2659 (1988 and 
van der Krol et al. {BioTechniques 6:958 (1988)). 

20 Ribozymes 

In addition to antisense polynucleotides, ribozymes can be used to target and 
inhibit transcription of breast cancer-associated nucleotide sequences. A ribozyme is an 
RNA molecule that catalytically cleaves other RNA molecules. Different kinds of ribozymes 
have been described, including group I ribozymes, hammerhead ribozymes, hairpin 

25 ribozymes, RNase P, and axhead ribozymes {see, e.g., Castanotto et al, Adv. in 

Pharmacology 25: 289-317 (1994) for a general review of the properties of different 
ribozymes). 

The general features of hairpin ribozymes are described, e.g., in Hampel et al., 
Nucl. Acids Res. 18:299-304 (1990); European Patent Publication No. 0 360 257; U.S. Patent 
30 No. 5,254,678. Methods of preparing are well known to those of skill in the art {see, e.g., 
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WO 94/26877; Ojwang et al, Proc. Natl. Acad. Set USA 90:6340-6344 (1993); Yamada et 
al, Human Gene Therapy 1:39-45 (1994); Leavitt et al.,Proc. Natl. Acad. Sci. USA 92:699- 
703 (1995); Leavitt et al, Human Gene Therapy 5:1151-120 (1994); and Yamada et al, 
Virology 205: 121-126(1994)). 

Polynucleotide modulators of breast cancer may be introduced into a cell 
containing the target nucleotide sequence by formation of a conjugate with a ligand binding 
molecule, as described in WO 91/04753. Suitable ligand binding molecules include, but are 
not limited to, cell surface receptors, growth factors, other cytokines, or other ligands that 
bind to cell surface receptors. Preferably, conjugation of the ligand binding molecule does 
not substantially interfere with the ability of the ligand binding molecule to bind to its 
corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide 
or its conjugated version into the cell. Alternatively, a polynucleotide modulator of breast 
cancer may be introduced into a cell containing the target nucleic acid sequence, e.g., by 
formation of an polynucleotide-lipid complex, as described in WO 90/10448. It is 
understood that the use of antisense molecules or knock out and knock in models may also be 
used in screening assays as discussed above, in addition to methods of treatment. 

Thus, in one embodiment, methods of modulating breast cancer in cells or 
organisms are provided. In one embodiment, the methods comprise administering to a cell an 
anti-breast cancer antibody that reduces or eliminates the biological activity of an endogenous 
breast cancer protein. Alternatively, the methods comprise administering to a cell or 
organism a recombinant nucleic acid encoding a breast cancer protein. This may be 
accomplished in any number of ways. In a preferred embodiment, e.g. when the breast 
cancer sequence is down-regulated in breast cancer, such state may be reversed by increasing 
the amount of breast cancer gene product in the cell. This can be accomplished, e.g., by 
overexpressing the endogenous breast cancer gene or administering a gene encoding the 
breast cancer sequence, using known gene- therapy techniques, e.g.. In a preferred 
embodiment, the gene therapy techniques include the incorporation of the exogenous gene 
using enhanced homologous recombination (EHR), e.g. as described in PCT/US93/03868, 
hereby incorporated by reference in its entirety. Alternatively, e.g. when the breast cancer 
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sequence is up-regulated in breast cancer, the activity of the endogenous breast cancer gene is 
decreased, e.g. by the administration of a breast cancer antisense nucleic acid. 

In one embodiment, the breast cancer proteins of the present invention may be 
used to generate polyclonal and monoclonal antibodies to breast cancer proteins. Similarly, 
the breast cancer proteins can be coupled, using standard technology, to affinity 
chromatography columns. These columns may then be used to purify breast cancer 
antibodies useful for production, diagnostic, or therapeutic purposes. In a preferred 
embodiment, the antibodies are generated to epitopes unique to a breast cancer protein; that 
is, the antibodies show little or no cross-reactivity to other proteins. The breast cancer 
antibodies may be coupled to standard affinity chromatography columns and used to purify 
breast cancer proteins. The antibodies may also be used as blocking polypeptides, as outlined 
above, since they will specifically bind to the breast cancer protein. 

Methods of identifying variant breast cancer-associated sequences 

Without being bound by theory, expression of various breast cancer sequences 
is correlated with breast cancer. Accordingly, disorders based on mutant or variant breast 
cancer genes may be determined. In one embodiment, the invention provides methods for 
identifying cells containing variant breast cancer genes, e.g., determining all or part of the 
sequence of at least one endogenous breast cancer genes in a cell. This may be accomplished 
using any number of sequencing techniques. In a preferred embodiment, the invention 
provides methods of identifying the breast cancer genotype of an individual, e.g., determining 
all or part of the sequence of at least one breast cancer gene of the individual. This is 
generally done in at least one tissue of the individual, and may include the evaluation of a 
number of tissues or different samples of the same tissue. The method may include 
comparing the sequence of the sequenced breast cancer gene to a known breast cancer gene, 
i.e., a wild-type gene. 

The sequence of all or part of the breast cancer gene can then be compared to 
the sequence of a known breast cancer gene to determine if any differences exist. This can be 
done using any number of known homology programs, such as Bestfit, etc. In a preferred 
embodiment, the presence of a difference in the sequence between the breast cancer gene of 
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the patient and the known breast cancer gene correlates with a disease state or a propensity 
for a disease state, as outlined herein. 

In a preferred embodiment, the breast cancer genes are used as probes to 
determine the number of copies of the breast cancer gene in the genome. 

In another preferred embodiment, the breast cancer genes are used as probes to 
determine the chromosomal localization of the breast cancer genes. Information such as 
chromosomal localization finds use in providing a diagnosis or prognosis in particular when 
chromosomal abnormalities such as translocations, and the like are identified in the breast 
cancer gene locus. 

Administration of pharmaceutical and vaccine compositions 

In one embodiment, a therapeutically effective dose of a breast cancer protein 
or modulator thereof, is administered to a patient. By "therapeutically effective dose" herein 
is meant a dose that produces effects for which it is administered. The exact dose will depend 
on the purpose of the treatment, and will be ascertainable by one skilled in the art using 
known techniques (e.g., Ansel et ah, Pharmaceutical Dosage Forms and Drug Delivery; 
Lieberman, Pharmaceutical Dosage Forms (vols. 1-3, 1992), Dekker, ISBN 0824770846, 
082476918X, 0824712692, 0824716981; Lloyd, The Art, Science and Technology of 
Pharmaceutical Compounding (1999); and Pickar, Dosage Calculations (1999)). As is 
known in the art, adjustments for breast cancer degradation, systemic versus localized 
delivery, and rate of new protease synthesis, as well as the age, body weight, general health, 
sex, diet, time of administration, drug interaction and the severity of the condition may be 
necessary, and will be ascertainable with routine experimentation by those skilled in the art. 
U.S. Patent Application N. 09/687,576, further discloses the use of compositions and 
methods of diagnosis and treatment in breast cancer is hereby expressly incorporated by 
reference. 

A "patient" for the purposes of the present invention includes both humans 
and other animals, particularly mammals. Thus the methods are applicable to both human 
therapy and veterinary applications. In the preferred embodiment the patient is a mammal, 
preferably a primate, and in the most preferred embodiment the patient is human. 

85 



Attorney Docket No.:018501-005210US 



The administration of the breast cancer proteins and modulators thereof of the 
present invention can be done in a variety of ways as discussed above, including, but not 
limited to, orally, subcutaneously, intravenously, intranasally, transdermally, 
intraperitoneally, intramuscularly, intrapulmonary, vaginally, rectally, or intraocularly. In 
some instances, e.g., in the treatment of wounds and inflammation, the breast cancer proteins 
and modulators may be directly applied as a solution or spray. 

The pharmaceutical compositions of the present invention comprise a breast 
cancer protein in a form suitable for administration to a patient. In the preferred embodiment, 
the pharmaceutical compositions are in a water soluble form, such as being present as 
pharmaceutically acceptable salts, which is meant to include both acid and base addition 
salts. "Pharmaceutically acceptable acid addition salt" refers to those salts that retain the 
biological effectiveness of the free bases and that are not biologically or otherwise 
undesirable, formed with inorganic acids such as hydrochloric acid, hydrobromic acid, 
sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as acetic acid, 
propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic 
acid, fumaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, 
methanesulfonic acid, ethanesulfonic acid, p-toluenesulfonic acid, salicylic acid and the like. 
"Pharmaceutically acceptable base addition salts" include those derived from inorganic bases 
such as sodium, potassium, lithium, ammonium, calcium, magnesium, iron, zinc, copper, 
manganese, aluminum salts and the like. Particularly preferred are the ammonium, 
potassium, sodium, calcium, and magnesium salts. Salts derived from pharmaceutically 
acceptable organic non-toxic bases include salts of primary, secondary, and tertiary amines, 
substituted amines including naturally occurring substituted amines, cyclic amines and basic 
ion exchange resins, such as isopropylamine, trimethylamine, diethylamine, triethylamine, 
tripropylamine, and ethanolamine. 

The pharmaceutical compositions may also include one or more of the 
following: carrier proteins such as serum albumin; buffers; fillers such as microcrystalline 
cellulose, lactose, corn and other starches; binding agents; sweeteners and other flavoring 
agents; coloring agents; and polyethylene glycol. 
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The pharmaceutical compositions can be administered in a variety of unit 
dosage forms depending upon the method of administration. For example, unit dosage forms 
suitable for oral administration include, but are not limited to, powder, tablets, pills, capsules 
and lozenges. It is recognized that breast cancer protein modulators (e.g., antibodies, 
antisense constructs, ribozymes, small organic molecules, etc.) when administered orally, 
should be protected from digestion. This is typically accomplished either by complexing the 
molecule(s) with a composition to render it resistant to acidic and enzymatic hydrolysis, or by 
packaging the molecule(s) in an appropriately resistant carrier, such as a liposome or a 
protection barrier. Means of protecting agents from digestion are well known in the art. 

The compositions for administration will commonly comprise a breast cancer 
protein modulator dissolved in a pharmaceutically acceptable carrier, preferably an aqueous 
carrier. A variety of aqueous carriers can be used, e.g., buffered saline and the like. These 
solutions are sterile and generally free of undesirable matter. These compositions may be 
sterilized by conventional, well known sterilization techniques. The compositions may 
contain pharmaceutically acceptable auxiliary substances as required to approximate 
physiological conditions such as pH adjusting and buffering agents, toxicity adjusting agents 
and the like, e.g., sodium acetate, sodium chloride, potassium chloride, calcium chloride, 
sodium lactate and the like. The concentration of active agent in these formulations can vary 
widely, and will be selected primarily based on fluid volumes, viscosities, body weight and 
the like in accordance with the particular mode of administration selected and the patient's 
needs (e.g., Remington s Pharmaceutical Science (15th ed., 1980) and Goodman & Gillman, 
The Pharmacologial Basis of Therapeutics (Hardman et al.,eds., 1996)). 

Thus, a typical pharmaceutical composition for intravenous administration 
would be about 0.1 to 10 mg per patient per day. Dosages from 0. 1 up to about 100 mg per 
patient per day may be used, particularly when the drug is administered to a secluded site and 
not into the blood stream, such as into a body cavity or into a lumen of an organ. 
Substantially higher dosages are possible in topical administration. Actual methods for 
preparing parenterally administrable compositions will be known or apparent to those skilled 
in the art, e.g., Remington 's Pharmaceutical Science and Goodman and Gillman, The 
Pharmacologial Basis of Therapeutics, supra. 
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The compositions containing modulators of breast cancer proteins can be 
administered for therapeutic or prophylactic treatments. In therapeutic applications, 
compositions are administered to a patient suffering from a disease (e.g., a cancer) in an 
amount sufficient to cure or at least partially arrest the disease and its complications. An 
amount adequate to accomplish this is defined as a "therapeutically effective dose." Amounts 
effective for this use will depend upon the severity of the disease and the general state of the 
patient's health. Single or multiple administrations of the compositions may be administered 
depending on the dosage and frequency as required and tolerated by the patient. In any event, 
the composition should provide a sufficient quantity of the agents of this invention to 
effectively treat the patient. An amount of modulator that is capable of preventing or slowing 
the development of cancer in a mammal is referred to as a "prophylactically effective dose." 
The particular dose required for a prophylactic treatment will depend upon the medical 
condition and history of the mammal, the particular cancer being prevented, as well as other 
factors such as age, weight, gender, administration route, efficiency, etc. Such prophylactic 
treatments may be used, e.g., in a mammal who has previously had cancer to prevent a 
recurrence of the cancer, or in a mammal who is suspected of having a significant likelihood 
of developing cancer. 

It will be appreciated that the present breast cancer protein-modulating 
compounds can be administered alone or in combination with additional breast cancer 
modulating compounds or with other therapeutic agent, e.g., other anti-cancer agents or 
treatments. 

In numerous embodiments, one or more nucleic acids, e.g., polynucleotides 
comprising nucleic acid sequences set forth in Tables 1-25, such as antisense polynucleotides 
or ribozymes, will be introduced into cells, in vitro or in vivo. The present invention provides 
methods, reagents, vectors, and cells useful for expression of breast cancer-associated 
polypeptides and nucleic acids using in vitro (cell-free), ex vivo or in vivo (cell or 
organism-based) recombinant expression systems. 

The particular procedure used to introduce the nucleic acids into a host cell for 
expression of a protein or nucleic acid is application specific. Many procedures for 
introducing foreign nucleotide sequences into host cells may be used. These include the use 
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of calcium phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, 
plasma vectors, viral vectors and any of the other well known methods for introducing cloned 
genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell (see, 
e.g., Berger & Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology 
volume 152 (Berger), Ausubel et al, eds., Current Protocols (supplemented through 1999), 
and Sambrook et al, Molecular Cloning - A Laboratory Manual (2nd ed., Vol. 1-3, 1989. 

In a preferred embodiment, breast cancer proteins and modulators are 
administered as therapeutic agents, and can be formulated as outlined above. Similarly, 
breast cancer genes (including both the full-length sequence, partial sequences, or regulatory 
sequences of the breast cancer coding regions) can be administered in a gene therapy 
application. These breast cancer genes can include antisense applications, either as gene 
therapy (i.e. for incorporation into the genome) or as antisense compositions, as will be 
appreciated by those in the art. 

Breast cancer polypeptides and polynucleotides can also be administered as 
vaccine compositions to stimulate HTL, CTL and antibody responses.. Such vaccine 
compositions can include, e.g., lipidated peptides (see, e.g. , Vitiello, A. et al., J. Clin. Invest. 
95:341 (1995)), peptide compositions encapsulated in poly(DL-lactide-co-glycolide) ("PLG") 
microspheres (see, e.g., Eldridge, et al, Molec. Immunol. 28:287-294, (1991); Alonso et al, 
Vaccine 12:299-306 (1994); Jones et al, Vaccine 13:675-681 (1995)), peptide compositions 
contained in immune stimulating complexes (ISCOMS) (see, e.g., Takahashi et al, Nature 
344:873-875 (1990); Hu et al, Clin Exp Immunol. 113:235-243 (1998)), multiple antigen 
peptide systems (MAPs) (see, e.g., Tarn, Proc. Natl. Acad. Set U.S.A. 85:5409-5413 (1988); 
Tarn, /. Immunol Methods 196:17-32 (1996)), peptides formulated as multivalent peptides; 
peptides for use in ballistic delivery systems, typically crystallized peptides, viral delivery 
vectors (Perkus, et al, In: Concepts in vaccine development (Kaufmann, ed., p. 379, 1996); 
Chakrabarti, et al, Nature 320:535 (1986); Hu et al, Nature 320:537 (1986); Kieny, et al, 
AIDS Bio/Technology 4:790 (1986); Top et al, J. Infect. Dis. 124:148 (1971); Chanda et al, 
Virology 175:535 (1990)), particles of viral or synthetic origin (see, e.g., Kofler et al, J. 
Immunol. Methods. 192:25 (1996); Eldridge et al, Sem. Hematol. 30:16 (1993); Falo et al, 
Nature Med. 7:649 (1995)), adjuvants (Warren et al, Annu. Rev. Immunol. 4:369 (1986); 
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Gupta et al, Vaccine 1 1:293 (1993)), liposomes (Reddy et al.,J. Immunol. 148:1585 (1992); 
Rock, Immunol. Today 17:131 (1996)), or, naked or particle absorbed cDNA (Ulmer, et al., 
Science 259:1745 (1993); Robinson et al, Vaccine 11:957 (1993); Shiver et al, In: Concepts 
in vaccine development (Kaufrnann, ed., p. 423, 1996); Cease & Berzofsky, Annu. Rev. 
5 Immunol. 12:923 (1994) and Eldridge et al, Sem. Hematol 30:16 (1993)). Toxin-targeted 
delivery technologies, also known as receptor mediated targeting, such as those of Avant 
Immunotherapeutics, Inc. (Needham, Massachusetts) may also be used. 

Vaccine compositions often include adjuvants. Many adjuvants contain a 
p substance designed to protect the antigen from rapid catabolism, such as aluminum hydroxide 
hSO or mineral oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis or 
W Mycobacterium tuberculosis derived proteins. Certain adjuvants are commercially available 

f 1 ! 

Si as, e.g., Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, 

Q MI); Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline 

O Beecham, Philadelphia, PA); aluminum salts such as aluminum hydroxide gel (alum) or 

pi 5 aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of acylated 

J; tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; 

z : 

W polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A. 

Cytokines, such as GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be 
used as adjuvants. 

20 Vaccines can be administered as nucleic acid compositions wherein DNA or 

RNA encoding one or more of the polypeptides, or a fragment thereof, is administered to a 
patient. This approach is described, for instance, in Wolff et. al, Science 247:1465 (1990) as 
well as U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,118; 5,736,524; 5,679,647; 
WO 98/04720; and in more detail below. Examples of DNA-based delivery technologies 

25 include "naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) delivery, 

cationic lipid complexes, and particle-mediated ("gene gun") or pressure-mediated delivery 
{see, e.g., U.S. Patent No. 5,922,687). 

For therapeutic or prophylactic immunization purposes, the peptides of the 
invention can be expressed by viral or bacterial vectors. Examples of expression vectors 

30 include attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of 
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vaccinia virus, e.g., as a vector to express nucleotide sequences that encode breast cancer 
polypeptides or polypeptide fragments. Upon introduction into a host, the recombinant 
vaccinia virus expresses the immunogenic peptide, and thereby elicits an immune response. 
Vaccinia vectors and methods useful in immunization protocols are described in, e.g., U.S. 
5 Patent No. 4,722,848. Another vector is BCG (Bacille Calmette Guerin). BCG vectors are 
described in Stover et al, Nature 351:456-460 (1991). A wide variety of other vectors useful 
for therapeutic administration or immunization e.g. adeno and adeno-associated virus vectors, 
retroviral vectors, Salmonella typhi vectors, detoxified anthrax toxin vectors, and the like, 
C will be apparent to those skilled in the art from the description herein {see, e.g., Shata et al. , 
fgO Mol Med Today 6:66-71 (2000); Shedlock et al, JLeukoc Biol 68:793-806 (2000); Hipp et 
J al, In Vivo 14:571-85 (2000)). 

v; Methods for the use of genes as DNA vaccines are well known, and include 

w placing a breast cancer gene or portion of a breast cancer gene under the control of a 
O regulatable promoter or a tissue-specific promoter for expression in a breast cancer patient, 
pi 5 The breast cancer gene used for DNA vaccines can encode full-length breast cancer proteins, 
J:! but more preferably encodes portions of the breast cancer proteins including peptides derived 
ft! from the breast cancer protein. In one embodiment, a patient is immunized with a DNA 

vaccine comprising a plurality of nucleotide sequences derived from a breast cancer gene. For 
example, breast cancer-associated genes or sequence encoding subfragments of a breast 
20 cancer protein are introduced into expression vectors and tested for their immunogenicity in 
the context of Class I MHC and an ability to generate cytotoxic T cell responses. This 
procedure provides for production of cytotoxic T cell responses against cells which present 
antigen, including intracellular epitopes. 

In a preferred embodiment, the DNA vaccines include a gene encoding an 
25 adjuvant molecule with the DNA vaccine. Such adjuvant molecules include cytokines that 
increase the immunogenic response to the breast cancer polypeptide encoded by the DNA 
vaccine. Additional or alternative adjuvants are available. 

In another preferred embodiment breast cancer genes find use in generating 
animal models of breast cancer. When the breast cancer gene identified is repressed or 
30 diminished in cancer tissue, gene therapy technology, e.g., wherein antisense RNA directed 
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to the breast cancer gene will also diminish or repress expression of the gene. Animal models 
of breast cancer find use in screening for modulators of a breast cancer-associated sequence 
or modulators of breast cancer. Similarly, transgenic animal technology including gene 
knockout technology, e.g. as a result of homologous recombination with an appropriate gene 
targeting vector, will result in the absence or increased expression of the breast cancer 
protein. When desired, tissue-specific expression or knockout of the breast cancer protein 
may be necessary. 

It is also possible that the breast cancer protein is overexpressed in breast 
cancer. As such, transgenic animals can be generated that overexpress the breast cancer 
protein. Depending on the desired expression level, promoters of various strengths can be 
employed to express the transgene. Also, the number of copies of the integrated transgene 
can be determined and compared for a determination of the expression level of the transgene. 
Animals generated by such methods find use as animal models of breast cancer and are 
additionally useful in screening for modulators to treat breast cancer. 

Kits for Use in Diagnostic and/or Prognostic Applications 

For use in diagnostic, research, and therapeutic applications suggested above, 
kits are also provided by the invention. In the diagnostic and research applications such kits 
may include any or all of the following: assay reagents, buffers, breast cancer-specific nucleic 
acids or antibodies, hybridization probes and/or primers, antisense polynucleotides, 
ribozymes, dominant negative breast cancer polypeptides or polynucleotides, small molecules 
inhibitors of breast cancer-associated sequences etc. A therapeutic product may include 
sterile saline or another pharmaceutically acceptable emulsion and suspension base. 

In addition, the kits may include instructional materials containing directions 
(i.e., protocols) for the practice of the methods of this invention. While the instructional 
materials typically comprise written or printed materials they are not limited to such. Any 
medium capable of storing such instructions and communicating them to an end user is 
contemplated by this invention. Such media include, but are not limited to electronic storage 
media (e.g., magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the 



92 



Attorney Docket No.:018501-005210US 



like. Such media may include addresses to internet sites that provide such instructional 
materials. 

The present invention also provides for kits for screening for modulators of 
breast cancer-associated sequences. Such kits can be prepared from readily available 
materials and reagents. For example, such kits can comprise one or more of the following 
materials: a breast cancer-associated polypeptide or polynucleotide, reaction tubes, and 
instructions for testing breast cancer-associated activity. Optionally, the kit contains 
biologically active breast cancer protein. A wide variety of kits and components can be 
prepared according to the present invention, depending upon the intended user of the kit and 
the particular needs of the user. Diagnosis would typically involve evaluation of a plurality 
of genes or products. The genes will be selected based on correlations with important 
parameters in disease which may be identified in historical or outcome data. 



EXAMPLES 

Example 1: Tissue Preparation, Labeling Chips, and Fingerprints 

Purifying total RNA from tissue sample usi ng TRIzol Reagent 

The sample weight is first estimated. The tissue samples are homogenized in 
1 ml of TRIzol per 50 mg of tissue using a homogenizer (e.g., Polytron 3100). The size of 
the generator/probe used depends upon the sample amount. A generator that is too large for 
the amount of tissue to be homogenized will cause a loss of sample and lower RNA yield. A 
larger generator (e.g., 20 mm) is suitable for tissue samples weighing more than 0.6 g. Fill 
tubes should not be overfilled. If the working volume is greater than 2 ml and no greater than 
10 ml, a 15 ml polypropylene tube (Falcon 2059) is suitable for homogenization. 

Tissues should be kept frozen until homogenized. The TRIzol is added 
directly to the frozen tissue before homogenization. Following homogenization, the insoluble 
material is removed from the homogenate by centrifugation at 7500 x g for 15 min. in a 

Sorvall superspeed or 12,000 x g for 10 min. in an Eppendorf centrifuge at 4°C. The cleared 
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homogenate is then transferred to a new tube(s). Samples may be frozen and stored at -60 to 

-70°C for at least one month or else continue with the purification. 

The next process is phase separation. The homogenized samples are incubated 

for 5 minutes at room temperature. Then, 0.2 ml of chloroform per 1ml of TRIzol reagent is 
5 added to the homogenization mixture. The tubes are securely capped and shaken vigorously 

by hand (do not vortex) for 15 seconds. The samples are then incubated at room temp, for 

2-3 minutes and next centrifuged at 6500 rpm in a Sorvall superspeed for 30 min. at 4oC. 
y : The next process is RNA Precipitation. The aqueous phase is transferred to a 

k£ fresh tube. The organic phase can be saved if isolation of DNA or protein is desired. Then 
fjlO 0.5 ml of isopropyl alcohol is added per 1ml of TRIzol reagent used in the original 
ST '■' homogenization. Then, the tubes are securely capped and inverted to mix. The samples are 
2 ! then incubated at room temp, for 10 minutes an centrifuged at 6500 rpm in Sorvall for 20 

l_ min. at 4°C. 

u. The RNA is then washed. The supernatant is poured off and the pellet washed 

: 1 5 with cold 75% ethanol. 1 ml of 75% ethanol is used per 1 ml of the TRIzol reagent used in 
p the initial homogenization. The tubes are capped securely and inverted several times to 

loosen pellet without vortexing . They are next centrifuged at <8000 rpm (<7500 x g) for 5 

minutes at 4°C. 

The RNA wash is decanted. The pellet is carefully transferred to an 
20 Eppendorf tube (sliding down the tube into the new tube by use of a pipet tip to help guide it 
in if necessary). Tube(s) sizes for precipitating the RNA depending on the working volumes. 
Larger tubes may take too long to dry. Dry pellet. The RNA is then resuspended in an 
appropriate volume (e.g., 2 -5 ug/ul) of DEPC H2O. The absorbance is then measured. 

The poly A+ mRNA may next be purified from total RNA by other methods 
25 such as Qiagen' s RNeasy kit. The poly A + mRNA is purified from total RNA by adding the 
oligotex suspension which has been heated to 37°C and mixing prior to adding to RNA. 
The Elution Buffer is incubated at 70°C. If there is precipitate in the buffer, warm up the 2 x 
Binding Buffer at 65°C. The the total RNA is mixed with DEPC-treated water, 2 x Binding 
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Buffer, and Oligotex according to Table 2 on page 16 of the Oligotex Handbook and next 

incubated for 3 minutes at 65°C and 10 minutes at room temperature. 

The preparation is centrifuged for 2 minutes at 14,000 to 18,000 g, preferably, 
at a "soft setting," The supernatant is removed without disturbing Oligotex pellet. A little bit 
5 of solution can be left behind to reduce the loss of Oligotex. The supernatant is saved until 

satisfactory binding and elution of poly A + mRNA has been found. 

Then, the preparation is gently resuspended in Wash Buffer OW2 and pipetted 
iy : onto the spin column and centrifuged at full speed (soft setting if possible) for 1 minute, 
fr; Next, the spin column is transferred to a new collection tube and gently 

If il 0 resuspended in Wash Buffer OW2 and centrifuged as described herein. 
f$\ Then, the spin column is transferred to a new tube and eluted with 20 to 100 ul 

s«i of preheated (70°C) Elution Buffer. The Oligotex resin is gently resuspended by pipetting up 
l_ and down. The centrifugation is repeated as above and the elution repeated with fresh elution 
y buffer or first eluate to keep the elution volume low. 

"j. 15 The absorbance is next read to determine the yield, using diluted Elution 

P Buffer as the blank. 

Before proceeding with cDNA synthesis, the mRNA is precipitated before 
proceeding with cDNA synthesis, as components leftover or in the Elution Buffer from the 
Oligotex purification procedure will inhibit downstream enzymatic reactions of the mRNA. 
20 0.4 vol. of 7.5 M NEMO Ac + 2.5 vol. of cold 1 00% ethanol is added and the preparation 
precipitated at -20°C 1 hour to overnight (or 20-30 min. at -70°C), and centrifuged at 
14,000-16,000 x g for 30 minutes at 4°C. Next, the pellet is washed with 0.5 ml of 80% 
ethanol (-20°C) and then centrifuged at 14,000-16,000 x g for 5 minutes at room temperature. 
The80% ethanol wash is then repeated. The last bit of ethanol from the pellet is then dried 
25 without use of a speed vacuum and the pellet is then resuspended in DEPC H2O at lug/ul 
concentration. 



Alternatively the RNA may be purified using other methods (e.g.. Oiagen's RNeasy kit) 
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No more than 100 ug is added to the RNeasy column. The sample volume is 
adjusted to 1 00 ul with RNase-free water. 3 50 ul Buffer RLT and then 250 ul ethanol 
(100%) are added to the sample. The preparation is then mixed by pipetting and applied to an 
RNeasy mini spin column for centrifugation (15 sec at >1 0,000 rpm). If yield is low, reapply 
5 the fiowthrough to the column and centrifuge again. 

Then, transfer column to a new 2 ml collection tube and add 500 ul Buffer 
RPE and centrifuge for 15 sec at >1 0,000 rpm. The fiowthrough is discarded. 500 ul Buffer 
RPE and is then added and the preparation is centriuged for 15 sec at >1 0,000 rpm. The 

fgsst: 

Q fiowthrough is discarded, and the column membrane dried by centrifuging for 2 min at 
||0 maximum speed. The column is transferred to a new 1.5 -ml collection tube. 30-50 ul of 

RNase-free water is applied directly onto column membrane. The column is then centrifuged 

I y 

'%! for 1 min at > 10,000 rpm and the elution step repeated. 

w The absorbance is then read to determine yield. If necessary, the material may 

? 

© be ethanol precipitated with ammonium acetate and 2.5X volume 100% ethanol. 

m 

j: First Strand cDNA Synthesis 

fll The first strand can be make using using Gibco's "Superscript Choice System 

for cDNA Synthesis" kit. The starting material is 5 ug of total RNA or 1 ug of polyA+ 
mRNAl. For total RNA, 2 ul of Superscript RT is used; for polyA+ mRNA, 1 ul of 
20 Superscript RT is used. The final volume of first strand synthesis mix is 20 ul. The RNA 
should be in a volume no greater than 10 ul. The RNA is incubated with 1 ul of 100 pmol 
T7-T24 oligo for 10 min at 70°C followed by addition on ice of 7 ul of: 4ul 5X 1 st Strand 
Buffer, 2 ul of 0.1M DTT, and 1 ul of lOmM dNTP mix. The preparation is then incubated at 
37°C for 2 min before addition of the Superscript RT followed by incubation at 37°C for 1 
25 hour. 

Second Strand Synthesis 

For the second strand synthesis, place 1st strand reactions on ice and add: 91 

ul DEPC H 2 0; 30 ul 5X 2nd Strand Buffer; 3 ul lOmM dNTP mix; 1 ul 10 U/ul E.coli DNA 

30 Ligase; 4 ul 10 U/ul E.coli DNA Polymerase; and 1 ul 2 U/ul RNase H. Mix and incubate 2 
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hours at 16°C. Add 2 ul T4 DNA Polymerase. Incubate 5 min at 16°C. Add 10 ul of 0.5M 
EDTA. 

Cleaning up cDNA 

The cDNA is purified using Phenol:Chloroform:Isoamyl Alcohol (25 :24: 1) 
and Phase-Lock gel tubes. The PLG tubes are centrifuged for 30 sec at maximum speed. 
The cDNA mix is then transferred to PLG tube. An equal volume of 
phenol :chloroform:isamyl alcohol is then added, the preparation shaken vigorously (no 
vortexing), and centrifuged for 5 minutes at maximum speed. The top aqueous solution is 
transferred to a new tube and ethanol precipitated by adding 7.5X 5M NH40Ac and 2.5X 
volume of 100% ethanol. Next, it is centrifuged immediately at room temperature for 20 
min, maximum speed. The supernatant is removed, and the pellet washed with 2X with cold 
80% ethanol. As much ethanol wash as possible should be removed before air drying the 
pellet; and resuspending it in 3 ul RNase-free water. 

In vitro Transcription (TVT) and labeling with biotin 

In vitro Transcription (IVT) and labeling with biotin is performed as follows: 
Pipet 1.5 ul of cDNA into a thin-wall PCR tube. Make NTP labeling mix by combining 2 ul 
T7 lOxATP (75 mM) (Ambion); 2 ul T7 lOxGTP (75 mM) (Ambion); 1.5 ul T7 lOxCTP (75 
mM) (Ambion); 1.5 ul T7 lOxUTP (75 mM) (Ambion); 3.75 ul 10 mM Bio-ll-UTP 
(Boehringer-Mannheim/Roche or Enzo); 3.75 ul 10 mM Bio-16-CTP (Enzo); 2 ul lOx T7 
transcription buffer (Ambion); and 2 ul lOx T7 enzyme mix (Ambion). The final volume is 
20 ul. Incubate 6 hours at 37°C in a PCR machine. The RNA can be furthered cleaned. 
Clean-up follows the previous instructions for RNeasy columns or Qiagen's RNeasy protocol 
handbook. The cRNA often needs to be ethanol precipitated by resuspension in a volume 
compatible with the fragmentation step. 

Fragmentation is performed as follows. 15 ug of labeled RNA is usually 
fragmented. Try to minimize the fragmentation reaction volume; a 10 ul volume is 
recommended but 20 ul is all right. Do not go higher than 20 ul because the magnesium in 
the fragmentation buffer contributes to precipitation in the hybridization buffer. Fragment 
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RNA by incubation at 94 C for 35 minutes in 1 x Fragmentation buffer (5 x Fragmentation 
buffer is 200 mM Tris-acetate, pH 8.1; 500 mM KOAc; 150 mM MgOAc). The labeled 
RNA transcript can be analyzed before and after fragmentation. Samples can be heated to 
65 °C for 15 minutes and electrophoresed on 1% agarose/TBE gels to get an approximate idea 
5 of the transcript size range 

For hybridization, 200 ul (10 ug cRNA) of a hybridization mix is put on the 
chip. If multiple hybridizations are to be done (such as cycling through a 5 chip set), then it 
ij is recommended that an initial hybridization mix of 300 ul or more be made. The 
jr hybridization mix is: fragment labeled RNA (50 ng/ul final cone); 50 pM 948-b control 

Iks; 1 

yiO oligo; 1 .5 pM BioB; 5 pM BioC; 25 pM BioD; 100 pM CRE; 0. 1 mg/ml herring sperm DNA; 

ST; 0.5 mg/ml acetylated BSA; and 300 ul with lxMES hyb buffer. 

^ The hybridization reaction is conducted with non-biotinylated IYT (purified 

s by RNeasy columns) (see example 1 for steps from tissue to IVT): The following mixture is 

g prepared: 

*|jl5 IVT antisense RNA; 4 ug: pi 

O Random Hexamers (1 pg/pl): 4 ul 

m j 

H 2 0: ul _ 

14 ul 

Incubate the above 14 pi mixture at 70°C for 10 min.; then put on ice. 
20 The Reverse transcription procedure uses the following mixture: 

0.1MDTT: 3 ul 

50X dNTP mix: 0.6 ul 

H 2 0: 2.4 ul 

Cy3 or Cy5 dUTP (ImM): 3 pi 
25 SS RT n (BRL): 1 pi 

16 pi 

The above solution is added to the hybridization reaction and incubated for 30 min., 42°C. 

Then, 1 pi SSII is added and incubated for another hour before being placed on ice. 

98 



Attorney Docket No.:018501-005210US 



The 50X dNTP mix contains 25mM of cold dATP, dCTP, and dGTP, lOmM 
of dTTP and is made by adding 25 ul each of lOOmM dATP, dCTP, and dGTP; 10 ul of 
lOOmM dTTP to 15 ul H 2 0. ] 

RNA degradation is performed as follows. Add 86 ul H 2 0, 1.5 ul 1M NaOH/ 
2 mM EDTA and incubate at 65°C, 10 min.. For U-Con 30, 500 ul TE/sample spin at 7000 g 
for 10 min, save flow through for purification. For Qiagen purification, suspend u-con 
recovered material in 500 ul buffer PB and proceed using Qiagen protocol. For DNAse 
digestion, add 1 ul of 1/100 dilution of DNAse/30 ul Rx and incubate at 37°C for 15 min. 
Incubate at 5 min 95 °C to denature the DNAse. 

Sample preparation 

For sample preparation, add Cot-1 DNA, 10 ul; 50X dNTPs, 1 ul; 20X SSC, 2.3 ul; Napyro 
phosphate, 7.5 ul; 10 mg/ml Herring sperm DNA; 1 ul of 1/10 dilution to 21.8 final vol. Dry 
in speed vac. Resuspend in 15 ul H 2 0. Add 0.38 ul 10% SDS. Heat 95°C, 2 min and slow 
cool at room temp, for 20 min. Put on slide and hybridize overnight at 64°C. Washing after 
the hybridization: 3X SSC/0.03% SDS: 2 min., 37.5 ml 20X SSC+0.75ml 10% SDS in 
250ml H 2 0; IX SSC: 5 min., 12.5 ml 20X SSC in 250ml H 2 0; 0.2X SSC: 5 min., 2.5 ml 20X 
SSC in 250ml H 2 0. Dry slides and scan at appropiate PMT's and channels. 
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TABLE 1: Figure 1 from BRCA 001 US 

Table 1 shows genes, (incorporated in their entirety here and throughout the application 
where primekeys are provided), downregulated in tumor tissue compared to normal breast 
tissue. 



Pkey: 
ExAccn: 
UnkjenelD: 
Unigene Title: 
R1: 



Unique Eos probeset identifier number 

Exemplar Accession number, Genbank accession number 

Unigene number 



Ratio of normal breast tissue to tumor 



Pkey ExAccn UnigenelD UnigeneTHIe R1 

100472 D90084 Hs.1023 pyruvate dehydrogenase (lipoamide) alpha 5 

100499 T51986 Hs.283108 hemoglobin, gamma G 10 

100545 M55405 gb:Homo sapiens mucin (MUC-3) mRNA, part 5 

100549 BE142019 Hs.222056 Homo sapiens CDNAFLJ1 1572 fis, clone HE 10 

100613 X52078 Hs.101047 transcription factor 3 (E2A immunoglobul 5 

100635 BE259039 Hs.129953 Ewing sarcoma breakpoint region 1 5 

100645 X16841 Hs.167988 neural cell adhesion molecule 1 5 

100654 A03758 NM_000477*:Homo sapiens albumin (ALB), m 10 

100702 L27065 gb:Human neurofibromatosis 2 (NF2) mRNA, 5 

100915 M60832 Hs.249239 collagen, type VIII, alpha 2 5 

100971 BE379727 Hs.83213 fatty acid binding protein 4, adipocyte 10 

101125 AJ250562 Hs.82749 transmembrane 4 superfamily member 2 5 

101166 M90424 Hs.2099 lipocalin 1 (protein migrating faster th 5 

101184 NM_001674 Hs.460 activating transcription factor 3 10 

101336 NM_006732 Hs.75678 FBJ murine osteosarcoma viral oncogene h 10 

101367 X03350 Hs.4 alcohol dehydrogenase 1B (class I), beta 10 

101447 M21305 gb:Human alpha satellite and satellite 3 10 

101461 N98569 Hs.76422 phospholipase A2, group IIA (platelets, 10 

101511 M27826 Hs.267319 endogenous retroviral protease 10 

101634 AV650262 Hs.75765 GR02 oncogene 5 

101736 M74447 Hs.502 transporter 2, ATP-binding cassette, sub 10 

102208 U22961 gb:Human mRNA clone with similarity to L 10 

102297 NM_001504 Hs.198252 G protein-coupled receptor 9 5 

102450 U48251 Hs.75871 protein kinase C binding protein 1 10 

102515 U89337 Hs.169886 tenascinXB 10 

102571 U60115 Hs.239069 four and a half LIM domains 1 5 

102800 AA313538 gb:EST185419 Colon carcinoma (HCC) cell 10 

102857 NM_006744 Hs.76461 retinol-binding protein 4, interstitial 10 

102990 AA829286 Hs.332053 serum amyloid A1 10 

103434 X98085 Hs.54433 tenascin R (restrictin, janusin) 5 

103747 AA081 995 gb:zn26d06,r1 Stratagene neuroepithelium 10 

103750 AA126129 gb:zm78c07.r1 Stratagene neuroepithelium 5 

103812 AA137107 Hs.326391 Homo sapiens, clone MGC:1 6638, mRNA, com 10 

103851 AA326216 Hs.8719 hypothetical protein MGC1 136 5 

104080 AB041036 Hs.57771 kallikrein 1 1 (KLK11; TLSP; PRSS20; hipp 5 

104093 R50727 Hs.336970 ESTs 10 

104106 M422123 gb:zv26h12.M Soares_NhHMPu_S1 Homo sapi 5 

104109 AL353957 Hs.284181 hypothetical protein DKFZp434P0531 10 

104250 F06638 Hs.12440 Homo sapiens clone 24734 mRNA sequence 10 

104340 M426189 gb:zw11e09.r1 Soares_NhHMPu_S1 Homo sapi 5 

104492 N73185 Hs.94285 EST 10 

104506 N91071 Hs.109650 ESTs 10 

104511 N99542 Hs.572 orosomucoid 1 5 

104532 AI498763 Hs.203013 hypothetical protein FLJ12748 10 
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Hs.1 58725 


ESTs 


10 


108827 


AI273692 


Hs.1 10470 


ESTs 


10 


109123 


AI028376 


Hs.73232 


ESTs 


10 
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15 



m 



: 25 



35 



40 



45 



50 
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109389 
109546 
109919 
110006 
110141 
110354 
110433 
110448 
110455 
110540 
110553 
110976 
110987 
111158 
111168 
111187 
111307 
111400 
111498 
111651 
111738 
111803 
111995 
112071 
112204 
112258 
112490 
112588 
112654 
112784 
112817 
112885 
112913 
113149 
113174 
113203 
113299 
113367 
113457 
113563 
113574 
113776 
113790 
113807 
113958 
114211 
114254 
114349 
114449 
114484 
114576 
114624 
114844 
114906 
115624 
115666 
115712 
115889 
115949 
116107 
116180 
116267 
116291 
116527 
116659 
116708 
117058 



AA101325 

F01449 

R40604 

AI094674 

H46749 

W22165 

AW294162 

H51276 

H52576 

H72639 

H60593 

AL044174 

AI753316 

N66616 

AI798376 

AJ224864 

AA641636 

R00144 

AI168511 

R16733 

R26065 

AA593731 

R42333 

AL1 17490 

NM_006668 

R51889 

R31094 

R77302 

BE618629 



AI057205 

AA581428 

T16837 

T51588 

T54659 

AA743563 

AW207424 

N92359 

R16763 

AA913635 

R06874 

AI791905 

AI244311 

W07586 

W86195 

Z39319 

AB018263 

AA745978 

AA020736 

AA034378 

AA065096 

AA081507 

AA234826 

AA234462 

AK000725 

AF1 73081 

AB020649 

AA398841 

AI478427 

AL133916 

AA463902 

AW968703 

AW410377 

AW194253 

BE314852 

F10528 

AW801806 



Hs.86154 

Hs.26954 

Hs.1 29539 

Hs.30524 

Hs.31540 

Hs.22586 

Hs.301062 

Hs.13526 

Hs.167608 

Hs.124990 

Hs.159526 

Hs.26034 

Hs.138629 

Hs.9688 

Hs.37477 

Hs.189771 

Hs.20499 

Hs.325823 

Hs.302292 

Hs.47225 

Hs.25121 

Hs.24990 

Hs.24378 

Hs.268809 

Hs.191290 

Hs.14584 

Hs.5021 

Hs.4241 

Hs.301755 

Hs.10305 

Hs.332594 

Hs.14518 

Hs.268679 

Hs.326413 

Hs.268628 

Hs.95549 

Hs.26912 

Hs.8045 

Hs.27347 

Hs.180338 

Hs.28273 

Hs.267319 



Hs.87386 

Hs.87350 

Hs.50579 

Hs.178215 

Hs.74569 

Hs.39850 

Hs.43125 

Hs.1 72572 

Hs.13522 

Hs.30085 

Hs.41502 

Hs.68607 

Hs.1 68694 

Hs.70001 



hypothetical protein FLJ12457 10 

Homo sapiens mRNA; cDNA DKFZp762G123 (fr 5 

ESTs, Weakly similar to MCATJHUMAN MITOC 1 0 

ring finger protein 24 10 

ESTs 10 

ESTs 5 

UDP-N-acetyl-alpha-D-galactosamine:polyp 1 0 

hypothetical protein FLJ12688 1 0 

gb:yt85e08.rl Soares_pineal_giand_N3HPG 5 

ESTs 5 

ESTs 10 

patched (Drosophila) homolog 1 0 

ESTs 5 

H .sapiens mRNA for subtelomeric repeat s 5 

gb:tr34b07.x1 NCI_CGAP_Ov23 Homo sapiens 10 

leukocyte membrane antigen 5 

ESTs, Weakly similar to T46908 hypotheti 5 

ESTs 10 

gb:ow90h09.s1 Soares_fetal_liver_spleen_ 10 

ESTs 10 

gb:yh39d03.s1 Soares placenta Nb2HP Homo 5 

ESTs, Moderately similar to ALU5JHUMAN A 10 

ESTs 10 

Ras-associated protein Rap1 10 

cytochrome P450, subfamily 46 (cholester 1 0 

ESTs 5 

ESTs 10 

gb:yi75h08.s1 Soares placenta Nb2HP Homo 1 0 

ESTs 5 

ESTs 5 

ESTs 5 

EST 10 

ESTs 5 

gb:yb27e06.s1 Stratagene fetal spleen (9 1 0 

Homo sapiens cDNA FLJ1 1465 fis, clone HE 5 

ESTs 5 

ESTs 10 

ESTs, Moderately similar to A48752 B-cel 10 

ESTs 5 

Homo sapiens cDNA FLJ20812 fis, clone AD 10 

ESTs, Moderately similar to ALU1_HUMAN A 5 

hypothetical protein 10 

ESTs 10 

ESTs 3 

gb:zh54e05.s1 Soares_fetal_liver_spleen_ 10 

EST 10 

tumor necrosis factor receptor superfami 5 

ESTs 5 

gb:ze63b1 1 .s1 Soares retina N2b4HR Homo 5 

endogenous retroviral protease 5 

gb:zm50a02.s1 Stratagene fibroblast (937 5 

gb:zn05b10.M Stratagene hNT neuron (937 5 

EST 5 

ESTs 5 

hypothetical protein FLJ20718 3 

Vertebrate LIN7 homolog 1 , Tax interact! 5 

KIAA0842 protein 5 

hypothetical protein FLJ20517 10 

esophageal cancer related gene 4 protein 10 

hypothetical protein FLJ20093 10 

ESTs, Weakly similar to 138022 hypotheti 5 

hypothetical protein FLJ23186 5 

hypothetical protein FLJ21276 5 

ESTs 10 

Homo sapiens clone 23763 unknown mRNA, p 5 

ESTs, Moderately similar to JC6169 nude 5 

gb:IL5-UM0070-1 10400-062-g07 UM0070 Homo 5 
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10 



15 



So 



55 

c 

jp=l50 



40 



45 



50 



55 



60 



65 



117151 
117226 
117323 
117571 
117624 
117673 
117847 
117877 
117919 
118049 
118413 
118613 
118664 
118858 
118902 
119039 
119159 
119216 
119317 
119366 
119378 
119528 
119792 
119800 
119817 
119835 
119923 
119961 
120379 
120931 
121037 
121282 
121382 
121764 
122034 
122441 
122756 
122771 
123601 
123623 
123941 
124215 
124276 
124680 
125099 
125121 
125188 
125284 
125906 
128484 
128511 
128538 
128606 
128850 
128870 
128903 
128931 
129001 
129091 
129101 
129146 
129213 
129228 
129265 
129285 
129346 
129368 



AI803656 

N20468 

AI472863 

N34417 

N26627 

N40551 

N49285 

AW263476 

BE222341 

N53145 

AW955696 

AI078236 

N70907 

AL122040 

AA993527 

AI1 60570 

AF142419 

AA514422 

AK002001 

T77892 

T81824 

W38051 

AL049798 

AF086332 

AF088061 

AF086429 

AW803308 

U34249 

AL042725 

AW136934 

AA907743 

AA401695 

AA405763 

AA421452 

AK000229 

AA447555 

AA458945 

AW135093 

AA609122 

AI024595 

AA621529 

H62570 

H83465 

AK001527 

NM 014312 

T98199 

BE299567 

NM.002666 

BE256206 

AA485421 

NM.002250 

R44214 

C16161 

AA193106 

H39537 

AW150717 

N62889 

AA443323 

AA056483 

NM.013403 

AL1 17472 

AI146494 

U40714 

AA530892 

BE617015 

AF1 10141 

NM_003877 



Hs.42373 

Hs.43387 

Hs.44584 

Hs.82364 

Hs.184043 

Hs.182391 

Hs.44268 

Hs.279472 

Hs.90960 

Hs.49688 

Hs.230619 

Hs.102981 

Hs.293907 

Hs.252097 

Hs.15020 

Hs.221849 

Hs.51305 

Hs.90949 

Hs.80552 

Hs.58314 

Hs.159690 

Hs.58429 

Hs.62954 

Hs.337461 

Hs.97162 

Hs.142373 

Hs.97334 

Hs.111939 

Hs.164851 

Hs.98017 

Hs.99116 

Hs.95898 

Hs.97282 

Hs.112645 

Hs.97508 



Hs.163953 

Hs.112377 

Hs.48403 

Hs.271749 

Hs.103253 

Hs.17775 

Hs.270503 

Hs.10082 

Hs.101189 

Hs.283040 

Hs.180817 

Hs.75309 

Hs.296176 

Hs.107242 

Hs.107812 

Hs.301463 

Hs.108665 

Hs.108924 

Hs.109525 

Hs.239307 

Hs.171695 

Hs.11006 

Hs.288908 

Hs.1 10776 



ESTs 5 

gb:yx39b1 0.s1 Soares melanocyte 2NbHM Ho 1 0 

ESTs 5 

ESTs 3 

ESTs, Weakly similar to JC4124 pregnancy 5 

Homo sapiens Ets-1 binding protein (E1B) 10 

ESTs 10 

myelin gene expression factor 2 1 0 

ESTs 5 

gb:yv55f09.s1 Soares fetal liver spleen 3 

ESTs 10 

ESTs 5 

EST 10 

Homo sapiens mRNA; cDNA DKFZp434G1972 (f 3 

hypothetical protein FLJ23403 5 

pregnancy specific beta-1 -glycoprotein 6 3 

homolog of mouse quaking QKI (KH domain 5 

ESTs 5 

v-maf muscu loaponeurotic fibrosarcoma (a 1 0 

gb:yd20f04.s1 Soares fetal liver spleen 5 

EST 5 

Empirically selected from AFFX single pr 10 

dermatopontin 3 

ESTs 10 

ESTs 5 

ESTs 5 

ferritin, heavy polypeptide 1 5 

Human putative zinc finger protein (ZNFB 5 

gb:DKFZp434B1 822_r1 434 (synonym: htes3) 1 0 

ESTs 5 

ESTs 5 

ESTs 5 

Homo sapiens cDNA FLJ20470 fis, clone KA 5 

ESTs, Weakly similar to KIAA0926 protein 5 

Homo sapiens cDNA FLJ20222 fis, clone CO 10 

EST 10 

ESTs 10 

ESTs, Highly similar to G100JHUMAN 1 10 K 5 

Homo sapiens mRNA; cDNA DKFZp434D2472 (f 5 

a disintegrin and metalloproteinase doma 5 

gb:af47a02.s1 Soares_total_fetus_Nb2HF8_ 10 

gb:yr44a01 .rl Soares fetal liver spleen 5 

gb:ys91a11.s1 Soares retina N2b5HR Homo 5 

hypothetical protein FU10665 5 

cortic al thymocyte receptor (X. laevis 1 0 

hypothetical protein FLJ10847 10 

ESTs, Moderately similar to ALU8_HUMAN A 5 

perilipin 10 

p75NTR-associated cell death executor o 5 

ESTs, Weakly similar to ALU7.HUMAN ALUS 10 

potassium intermediate/small conductance 10 

ESTs 5 

hypothetical protein PR02543 5 

chromosome 11 open reading frame 23 10 

eukaryotic translation elongation factor 1 0 

STAT induced STAT inhibitor 3 10 

Homo sapiens cDNA FLJ12965 fis, clone NT 10 

BPOZ protein 5 

Human Chromosome 1 6 BAC clone CIT987SK-A 5 

zinedin 10 

SH3-domain protein 5 (ponsin) 5 

ESTs, Weakly similar to IRX2_HUMAN IROQU 3 

tyrosyl-tRNA synthetase 5 

dual specificity phosphatase 1 5 

ESTs, Moderately similar to T1 7372 plasm 10 

WAS protein family, member 2 10 

STAT induced STAT inhibitor-2 5 
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129371 


X06828 




129381 


AW245805 




129440 


W37944 




129441 


BE061069 


5 


129516 


AF020038 




129554 


BE222078 




129684 


BE622468 




129702 


AI304966 


10 


129778 


AK001676 


129893 


AK000956 




129928 


AI338993 




129973 


AJ251760 




129977 


NM_000399 


15 


130014 


NM_001158 


130085 


M62402 


3 B 

-. - - 

■ 


130089 


AA452006 




130162 


W80711 


Iks' 


130243 


D88435 




130315 


AI241084 




130339 


AA435746 


U I 

rr- 


130400 


V00517 




130436 


mm 001928 


H r. : 

: 


130478 


X72308 


A 5 


130480 


BE222978 


130494 


MVVOoUOOt 


130563 


BE270472 






AL1 10226 


i~ 

:'" : 


130606 


AI652143 






AI7fiQf)R7 

1 / UcJUU i 


NlO 


130683 






130689 


mm 006691 




130716 


AA232075 


130718 


AF263462 




130722 


N41322 


rf 5 


130798 


M81349 




130840 


BE048821 




131184 


AB040935 




131261 


AA360419 


40 


131282 


X03350 


131328 


AW9392"S1 




131340 


AK000393 




131341 


AF1 10908 




131406 


H83294 


45 


131489 


BE394648 


131543 


AWQRR881 




131692 


BE559681 




131753 








AA44^Qfifi 


50 


131785 


H69342 


IO IO I o 


AAn919*ift 




131810, 

IO IO 


RF?44Qfi1 




131828 


a mnnpfi^ 

nJUUU^OO 




131888 


AW294659 


55 


131927 


AJ003112 


131949 


AK000010 




132115 


H81604 




132177 


X80818 




132296 


AA467752 


60 


132426 


AW1 18072 


132477 


S68874 




132675 


AI291496 




132796 


NM 006283 




132898 


W28548 


65 


132905 


NM 004235 


132953 


BE175645 




133116 


BE563966 




133120 


NM 003278 



Hs.1 10802 von Willebrand factor 

Hs.1 10903 claudin 5 (transmembrane protein deleted 

Hs.4007 Sarcolemmal-associated protein 

Hs.301943 KIAA0467 protein 

Hs.1 1223 isocitrate dehydrogenase 1 (NADP+), solu 

Hs.1 13069 ESTs 

Hs.1 1924 ESTs, Weakly similar to I38022 hypotheti 
Hs.12035 ESTs, Weakly similar to I38022 hypotheti 
Hs.12457 hypothetical protein FLJ10814 
Hs.13209 hypothetical protein FLJ10094 
Hs.134535 ESTs 

Hs.273385 guanine nucleotide binding protein (G pr 
Hs.1395 early growth response 2 (Krox-20 (Drosop 
Hs. 1431 02 amine oxidase, copper containing 2 (reti 
Hs.274313 insulin-like growth factor binding prate 
Hs.333199 ESTs 

Hs.319946 Homo sapiens mRNA for KIAA1727 protein, 

Hs. 153227 cyclin G associated kinase 

Hs.154353 nonselective sodium potassium/proton exc 

gb:zt79e03.s1 Soares_testis_NHT Homo sap 
Hs.283108 hemoglobin, gamma G 
Hs.155597 D component of complement (adipsin) 
Hs.251526 small inducible cytokine A7 (monocyte ch 
Hs.15760 MYG1 protein 
Hs.75874 pregnancy-associated plasma protein A 
Hs.279900 HSPC015 protein 
Hs.16441 DKFZP434H204 protein 
Hs.288382 hypothetical protein FLJ131 1 1 
Hs.127824 ESTs, Weakly similar to T28770 hypotheti 
Hs.17872 Homo sapiens, clone IMAGE:3875012, mRNA 
Hs.17917 extracellular link domain-containing 1 
Hs.18259 XPA binding protein 1; putative ATP(GTP) 
Hs.18376 KIAA1319 protein 
Hs.18441 ESTs 

Hs.1 955 serum amyloid A4, constitutive 
Hs.20144 small inducible cytokine subfamily A (Cy 
Hs.23954 cerebral cell adhesion molecule 
Hs.171776 inositol(myo)-1(or4)-monophosphatase 1 
Hs.4 alcohol dehydrogenase 1B (class I), beta 
Hs.25647 v-fos FBJ murine osteosarcoma viral onco 
Hs.25817 BTB (POZ) domain containing 2 
Hs.297660 TNF receptor-associated factor 3 
Hs.284122 Wnt inhibitory factor-1 
Hs.27414 hypothetical protein 
Hs.41639 programmed cell death 2 
Hs.30736 KIAA0124 protein 
Hs.332053 serum amyloid A1 
Hs.31595 ESTs 
Hs.26320 TRABID protein 
Hs.32753 ESTs 
Hs.173103 FE65-UKE 2 
Hs.278658 keratin, hair, basic, 6 (monilethrix) 
Hs.34054 Homo sapiens cDNA: FLJ22488 fis, clone H 
Hs.34780 doublecortex; lissencephaly, X-linked (d 
Hs.258798 hypothetical protein FU20003 
Hs.178471 KIAA0798 gene product 
Hs.178078 glutamate receptor, metabotropic 4 
Hs.195161 ESTs 

Hs.89981 diacylglycerol kinase, zeta (104kD) 
Hs.170917 prostaglandin E receptor 3 (subtype EP3) 
Hs.5476 Homo sapiens, clone IMAGE:35301 23, mRNA, 
Hs.173159 transforming, acidic coiled-coil contain 
Hs.224829 ESTs 

Hs.7934 Kruppel-like factor 4 (gut) 
Hs.321264 LBP protein 32 
Hs.6529 ESTs, Weakly similar to I78885 serine/th 
Hs.65424 tetranectin (plasminogen-binding protein 
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133139 AF052138 Hs.6580 Homo sapiens cDNA: FLJ23227 fis, clone C 5 

133163 AA668224 Hs.6634 Homo sapiens cDNA: FLJ22547 fis, clone H 5 

133268 AW956781 Hs.293937 ESTs, Weakly similar to FXD2JHUMAN FORKH 5 

133272 NM_002776 Hs.69423 kallikrein 10 (KLK10) (PRSSL1) (nes1) 5 

5 133379 AA207059 gb:zq80h09.s1 Stratagene hNT neuron (937 5 

133407 AF017987 Hs.7306 secreted frizzled-related protein 1 5 

133552 H21497 Hs.7471 BBP-like protein 1 5 

133702 L02321 Hs.75652 glutathione S-transferase M5 5 

133719 H26904 Hs.75736 apolipoprotein D 5 

10 133731 N71725 Hs.272572 hemoglobin, alpha 2 10 

133789 T85626 Hs.76239 hypothetical protein FLJ20608 5 

134007 AF072441 Hs.7840 calcineurin binding protein 1 10 

134055 D86062 Hs.182423 ES1 (zebrafish) protein, human homobg o 10 

134111 AI372588 Hs.8022 TU3A protein 10 

15 134117 AA081846 Hs.7921 Homo sapiens mRNA; cDNA DKFZp566E1 83 (fr 10 

134177 BE243319 Hs.79672 KIAA0652 gene product 5 

I s * 134308 AW905827 Hs.81454 ketohexokinase (fructokinase) 10 

Ci 134361 BE549343 Hs.82208 acyl-Coenzyme A dehydrogenase, very long 5 

134369 AF207664 Hs.8230 a disintegrin-like and metalloprotease ( " 5 

! ; f0 134449 L34155 Hs.83450 laminin, alpha 3 (nicein (150kD), kalini 5 

Hi 134467 AI190413 Hs.8373 ESTs 10 

ffi 134496 M64936 gb:Homo sapiens ret'noic acid-inducible 10 

ST; 134510 NMJ02757 Hs.250870 mitogen-actvated protein kinase kinase 10 

134550 M26315 Hs.85258 CD8 antigen, alpha polypeptide (p32) 10 

~-Q,5 134577 BE244323 Hs.85951 exportin, tRNA (nuclear export receptor 5 

f"\ 134591 U73394 Hs.1 66085 killer cell immunoglobulin-like receptor 5 

134678 AL008583 Hs.1 82595 dynein, axonemal, light polypeptide 4 5 

134728 D10216 Hs.89394 POU domain, class 1, transcription facto 5 

Q 134758 NM_000078 Hs.89538 cholesteryl ester transfer protein, plas 10 

ijO 134786 T29618 Hs.89640 TEK tyrosine kinase, endothelial (venous 10 

1 , 134912 T87521 Hs.261457 ESTs 5 

.H= ! 134963 NM_003394 Hs.91985 wingless-type MMTV integration site fami 10 

£.. 134969 H22570 Hs.172572 hypothetical protein FLJ20093 5 

==: 135001 AA302517 Hs.92732 KIAA1 444 protein 5 

'3.5 135066 X04430 Hs.93913 interleukin 6 (interferon, beta 2) 10 

iy 135173 AL036557 Hs.95910 putative lymphocyte G0/G1 switch gene 10 

135197 U76456 Hs.190787 tissue inhibitor of metalloproteinase 4 5 

135219 AB002361 Hs.96633 KIAA0363 protein 5 

135250 U83171 Hs.97203 small inducible cytokine subfamily A (Cy 5 

40 135304 AA416829 Hs.191597 ESTs 5 

135337 AA905406 Hs.9905 ESTs, Weakly similar to unnamed protein 3 

135417 X55019 Hs.99975 cholinergic receptor, nicotinic, delta p 10 

101367 X03350 Hs.4 alcohol dehydrogenase 1B (class I), beta 5 

128870 H39537 Hs.75309 eukaryotic translation elongation factor 5 

45 129381 AW245805 Hs.1 10903 claudin 5 (transmembrane protein deleted 5 

130085 M62402 Hs.274313 insulin-like growth factor binding prate 5 

130689 NM_006691 Hs.17917 extracellular link domain-containing 1 10 

133120 NM_003278 Hs.65424 tetranectin (plasminogen-binding protein 3 

133407 AF017987 Hs.7306 secreted frizzled-related protein 1 5 

50 133731 N71725 Hs.272572 hemoglobin, alpha 2 5 

134369 AF207664 Hs.8230 a disintegrin-like and metalloprotease ( 5 

135066 X04430 Hs.93913 interleukin 6 (interferon, beta 2) 10 

135173 AL036557 Hs.95910 putative lymphocyte G0/G1 switch gene 5 

322580 AK001852 Hs.274151 ligatin 5 

55 408790 AW580227 Hs.47860 neurotrophic tyrosine kinase, receptor, type 2 10 

418043 AW377752 Hs.83341 AXL receptor tyrosine kinase 5 

427458 BE208364 Hs.29283 ESTs, Weakly similar to LKHU proteoglycan link 5 

446674 AA563892 Hs.306000 solute carrier family 4 (anion exchanger), memb 10 

449826 U85642 Hs.1 38506 ESTs 5 

60 RC_H15814_s Human apM1 mRNA for GS3109 (novel adipose specific collagen 10 

YEL024w7RIP1 EST -YEL024w/RIP1 3 
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TABLE 1A 

Table 1 A shows the accession numbers for those pkeys lacking unigenelD's for Table 1. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey 



CAT Number Accessions 



108446 112224J M085383 AA126091 M074174 M075373 M079120M070831 AA075978 AA075372 M1 28503 
108497 110079_2 M074897 M1 13914 AA064871 AA079329 M071309AA084710 AA129030 AA075042 M074794 AA071453 AA078803 
AA148628 AA122204 AA074159 AA126185 M079117 M127089 AA07091 2 AA079280 M131372 AA078833 AA071087 
AA076131 AA071047 M079401 AA083070 AA102076 AA1 1 51 63 M0741 98 AA 134725 AA1 13889 AA1 21 1 03 AA075041 
AA065148 AA071310 AA101 144 AA079659 M078931 M079209 AA070928 AA068994 AA06981 7 AA0761 87 AA069053 
M131489 AA071308 M063317 AA070156 AA071430 AA076056 AA075684 AA070053 AA126283 AA126078 AA075895 
AA079208 AA074583 AA071086 AA079623 AA070627 M078802 M076622 AA065051 M079143 AA071 1 10 AA079434 
AA148748 AA079230 AA085188 AA074485 AA070580 AA076151 AA083166 AA0851 1 8 AA079450 M085044 AA120938 
AA079200 AA100188 AA081472 AA122355 AA129031 M085362 AA069220 AA070940 AA075968 AA074563 AA084027 
AA1 15929 

124215 1597154J H62570 H59063 

1 1 7058 1 21 9924J AW801 806 H90434 BE086530 

110455 46874J H52576 AF085971 H52172 

111168 38585J AI798376 S46400AW811617AW811616W00557BE142245AW858232AW861851 AW858362 AA232351 AA218567 
AA055556 AW858231 AW857541 AW814172 H66214 AW814398 AF134164 AA243093 AA173345 AA199942 AA223384 
AA227092 AA227080 T12379 AA092174 T61139 M149776 M699829 AW879188 AW813567 AW813538 AI267168 
AA157718 AA157719 AA100472 AA100774 M130756 AA157705 AA157730 AA157715 AA053524 AW849581 AW854566 
C05254 AW882836 T92637 AW812621 AA206583 AA209204 BE156909 AA226824 AI829309 AW991957 N66951 AA527374 
H66215 AA045564 AI694265 H60808 AA149726 AW195620 BE081333 BE073424 AW817662 AW817705 AW817703 
AW817659BE081531 H59570 

111498 411008J AI168511 AI022712 AA700366 R07371 R07324 

104340 46289J0 AA426189 F15201 

103747 117944J AA081995 AA101099 

134496 46501J M64936 AI025512 AI382987 BE061777 AA089966 BE169930 T41176 AW594624 BE502415 AA121893 AI269283 T40311 
AI684569AA257011 

AI079277 AI241318 BE327710 AW975215 AW896268 AA884990 BE327514 
103750 118365J AA126129 AA126033 AA082561 

105239 34624J AA221036 R87170 BE537068 BE544757 C18935 AW812058 T92565 AA227415 AA233942 AA223237 M668403 AA601627 
AW869639 BE061833 BE000620 AW961 170 AW847519 M308542 AW821833 AW945688 C04699 AA205504 AA377241 
AW821 667 AA055720 AW817981 AW856468 AA155719 AA179928 T03007 AW754298 AA227407 AA1 13928 M307904 
C16859 

120379 34624_3 AL042725 BE063316 AW975610 AA457591 BE062092 AI655202 AA714296 AI267264 AI075321 M223286 AA071 122 
AA227849 AA216700 AI696002 AA101867 AA099426 M135997 AL041 698 T02815 T51 824 M207189 T59230 T51868 
M663341 BE165757 AW818104 AW392886 AA584918 AA099408 AW856396 AW861859 M053045 

114624 111686J AA081507 AA070071 AA070840 AA084362 

106851 322947J AI458623 AA639708 AA485409 R22065 M485570 

108392 113549J M075124 AA075208 

100545 22955J1 M55405 AW752552 

100654 tigr_HT2969 A03758A06977A15293D17029D17107D17171 L00132L00133 M12523M13075M13076 M92816U22961 V00494 V00495 

X51363X51364X51365 
100702 tkjr_HT3413 L27065 

102208 6735_9 U22961 AA203623 M503337 AI174733 AI192802 C06092 AA035357 AI190619 AI199244 AI828450 AA602296 AI378195 
AI209170 AM 86653 AI127795 AI183846 H77389 AI589465 AA629390 H94306 AI01 8388 R68584 AA0271 96 AI745413 
AI685092 AI093426 AI623873 AI074570 N50096 AA047486 N25060 AA327614 AI042512 AI383957 AA156873 AI333101 
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15 



M) 



5=5 

"25 



P0 

m 



35 



40 



45 



50 



N70806 AI141254 AI3831 91 AI401 237 AI080709 AI093400 W84549 T90806 R0001 2 W01413 AA630557 AI378348 AI559265 
AA877103 W84464 AA625146 R68379 AM 33207 AI132980 AI133214 AI064826 AI061615 AI133473 AI174852 AI133404 
AI133272 V00494 M12523 M12523 AI207526 AI133120 AI064802 AI174993 AM 14729 AI061645 AI064716 AI064959 H77388 
T85706AF075298A!110799D17107NM_000477AF190168 R50724AI248416AI207432AI133684AI133345AI174710 
AI133290 AI133304 AI174948 AI207484 AI1 10717 AF074624 AI1 14515 AF063516 AM 10642 AM 14559 AI114498 AI114759 
AI207568 AI064960 AI174753 AI114666 R69184 R0001 1 AI064997 T60501 AI207701 T71735 AA385318 H73569 T60496 
H94399 AI133158 T74675 AA484750 T73413 T55909 R50261 T72061 N80533 T51189 T74936 AI207490 AH32925 AI064701 
AM74748 AI1 14663 AI133104 AI132999 AI133100 AI064925 AI064979 AI133063 AA343347 T69091 AA233989 T39772 
AI444620T52290 D16931 T40012 T48403T58926 T69195 AI133061 T50850 AI400677 AI091136 AA334608T57411 Z20979 
N56507T87485AI133622 AA343370 T40075 T69671 T53849T74820AF075316AI110818T40121 T57381 AI114468 
M332728 T51362 AI1 14589 R06691 AM 10629 AF063503 AI140543 AA334661 M332720 M343262 T73513 T86549 
AM 14840 T57284 T39981 T61407 T72757 T51749 T56630 AA343125 T72126 R94135 T83028 T39972 T39896 AI174786 
AI132926 R09237 AI064838 AM 33660 T60398 T88753 T55930 T92126 AI444602 T60996 AI1 14792 H93911 AM 331 06 
R10779 AI065020 T90925 T50889 D17029 AI133703 AA333805 AI133040 AI133017 AI064857 AI110730 AF074637 AI207567 
H71080 T73217 AA343950 AI174743 AA334224 AA334281 R06692 T64739 T40163 T60628 T81661 T73179 R01842 
AA501730 T39931 T39662 T40136 M334904 T71425 H77784 R00874 AI065049 T84512 T55918 AI207595 T39951 
AA005016 T60361 T69176 T73356 T58795 T61 233 T39955 T60612 AI1 14676 AI064778 AA03571 0 W52763 AI1 14786 
T83564 AA341859 T81684 T55769 AH 14710 T51776 AA34321 3 AM 14714 T58102 AH 10809 R28984 AI174854 AA305675 
AA343592 T53836 T46869 T64721 T55508 W05241 T54019 T57945 T60513 T48364 AF075308 W86731 T82851 T48269 
H54053 T7321 1 AH 14590 T48317 T55965 T74857 R84226 T56552 T52231 T74946 T76976 R02576 T95666 AI203974 
AI189471 AA005147AI478102AI207662AI192792AI768421 AI064737 AW051713 M936693 AI133117 AI766232 AI913646 
T83962 AI065112 AI207689 AI174684 AI207702 T81475 AI133325 AI032512 M701169 AI936354 AH14720 AI433289 
AA046980 AI823482 AH 14536 M860651 AW242644 R07469 AW300438 AI133416 AW271670 AI991363 T78943 AI823481 
AA845518 AA719124 AA883454 T68850 T69115 AI935509 AI150977 T62890 T71374 T68294 AI174774 T67411 T68318 
AI064689 T56624 T69010 T68982 T68302 AI332829 T72908 AI06481 9 AI205880 T62895 T69430 T951 1 1 M025050 T73330 
W52657 T71984 T691 18 W92684 AI114860 T62093 T61797 AI522333 T73322 H92981 T56018 T61811 T57232 AI336158 
T61821 T69457 T62900 T6291 2 T72917 T46885 AI702448 T5721 2 T57203 R94581 T71 31 1 T61819 T89358 T67708 T70918 
T59166 AI187111 T64308 T62071 T69427 AH 14750 T60430 R09734 T69033 T69141 T69453 T67908 R16809 T69394 
AI207729 T55839 T90273 T73339 AW194909 T75486 T71850 T71305 T71287 T53877 T73452 T68852 N75290 AI312890 
T67751 AI174983 T51679 T54851 H69880 N73734 AA443453 T73466 H69672 N53869 T68447 D1 1809 D12412 T64300 
T28321 T55864 



123941 


genbank_AA621529 


AA621529 


118049 


genbank_N53145 


N53145 


102800 


14782_20 AA313538 U88895 U88902 


104106 


M422123_i_atAA422123_i 


111738 


genbank_R26065 


R26065 


113149 


genbank_T51588 


T51588 


113958 


genbank_W86195 


W86195 


108335 


genbank_M070500 


AA070500 


108351 


genbank_M071193 


AA071193 


108441 


genbank_M079079 


AA079079 


124276 


genbank_H83465 


H83465 


101447 


entrez_M21305 


M21305 


117226 


genbank_N20468 


N20468 


133379 


genbank_M207059 


AA207059,AA207241 


119366 


genbank_T77892 


T77892 


119528 


NOT_FOUND_entrez_W38051 W38051 


112588 


genbank_R77302 


R77302 


114449 


genbank_AA020736 


AA020736 


114576 


genbank_M065096 


AA065096 


107459 


W38002_s_at W38002_s 


130339 


genbank_M435746 


AA435746 
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TABLE 2: Figure 2 from BRCA 001 US 



Table 2 shows genes downregulated in tumor tissue compared to normal breast tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unfcjene number 

Unkjene Title: Unfcjene gene title 

R1: Ratio of normal breast tissue to tumor 



Pkey 


ExAccn 


UnigenelD 


Unigene Tittle 


R1 


100499 


T51986 


Hs.283108 


hemoglobin, gamma G 


10 


100549 


BE142019 


Hs.222056 


Homo sapiens cDNA FLJ11572 fis, clone HE 


10 


100654 


A03758 




NM_00Q477*:Homo sapiens albumin (ALB), m 


10 


100971 


BE379727 


Hs.83213 


fatty acid binding protein 4, adipocyte 


10 


101184 


NM_001674 


Hs.460 


activating transcription factor 3 


10 


101336 


NM_006732 


Hs.75678 


FBJ murine osteosarcoma viral oncogene h 


10 


101367 


X03350 


Hs.4 


alcohol dehydrogenase 1B (class I), beta 


10 


101447 


M21305 




gb:Human alpha satellite and satellite 3 


10 


101461 


N98569 


Hs.76422 


phospholipase A2, group IIA (platelets, 


10 


101511 


M27826 


Hs.267319 


endogenous retroviral protease 


10 


101736 


M74447 


Hs.502 


transporter 2, ATP-binding cassette, sub-famil 


10 


102208 


U22961 




gb:Human mRNA clone with similarity to L 


10 


102450 


U48251 


Hs.75871 


protein kinase C binding protein 1 


10 


102800 


AA313538 




gb:EST185419 Colon carcinoma (HCC) cell 


10 


102857 


NM 006744 


Hs 76461 


rptinnl-hindinn nrotpin 4 interstitial 


10 


102990 


AA829286 


Hs.332053 


serum amyloid A1 


10 


103747 


AA081995 




gb:zn26d06.r1 Stratagene neuroepithelium 


10 


103812 


AA137107 ' 


Hs.326391 


Homo sapiens, clone MGC:16638, mRNA, com 


10 


104093 


R50727 


Hs.336970 


ESTs 


10 


104109 


AL353957 


Hs.284181 


hypothetical protein DKFZp434P0531 


10 


104250 


F06638 


Hs.12440 


Homo sapiens clone 24734 mRNA sequence 


10 


104492 


N73185 


Hs.94285 


EST 


10 


104506 


N91071 


Hs.109650 


ESTs 


10 


104532 


AI498763 


Hs.203013 


hypothetical protein FU 12748 


10 


104677 


AA009764 


Hs.190380 


ESTs 


10 


104711 


AA017245 


Hs.32794 


ESTs 


10 


104731 


AA019300 


Hs.125070 


ESTs, Moderately similar to I54374 gene 


10 


105005 


AI298208 


Hs.28805 


ESTs 


10 


105036 


AA1 30390 


Hs.25549 


hypothetical protein FLJ20898 


10 


105239 


M221036 




gb:zr03f12.r1 Stratagene NT2 neuronal pr 


10 


106052 


N79885 


Hs.6382 


ESTs, Highly similar to T00391 hypotheti 


10 


106181 


AI803651 


Hs.191608 


ESTs 


10 


106283 


AI085846 


Hs.25522 


KIAA1808 protein 


10 


106379 


AL042069 


Hs.1 19021 


DKFZP434N061 protein 


10 


106451 


AW235928 


Hs.313182 


ESTs 


10 


106491 


AA1 35688 


Hs.10083 


Homo sapiens, clone IMAGE:4139786, mRNA, 


10 


106782 


AW054886 


Hs.25682 


Homo sapiens mRNA for KIAA1863 protein, 


10 


107124 


AB006532 


Hs.31442 


RecQ protein-like 4 


10 


107148 


AI005036 


Hs.334305 


GS1999full 


10 


107214 


AF1 27026 


Hs.5394 


myosin IA 


10 


107242 


AB020672 


Hs.175411 


KIAA0865 protein 


10 


107331 


AI905985 


Hs.111805 


ESTs 


10 


107447 


W28516 


Hs.19210 


hypothetical protein MGC11308 


10 


107451 


AL042425 


Hs.283976 


hypthetical protein PR02389 


10 


107872 


BE271708 


Hs.95110 


ESTs, Weakly similar to A55943 1-phospha 


10 


108351 


AA071193 




gb:zf79b1 2.s1 Soares_pineal_gland_N3HPG 


10 


109546 


F01449 


Hs.26954 


Homo sapiens mRNA; cDNA DKFZp762G123 (fr 


10 


110433 


AW294162 


Hs.301062 


UDP-N-acetyl-alpha-D-galactosamine:polyp 


10 


110976 


AL044174 


Hs.159526 


patched (Drosophila) homolog 


5 
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Wo 90ft 17ft 
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lZy04o 
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Ue OQQQfiQ 

nS.ZooyUo 


1 ft 


19Q"*ft1 


AVVZ400V0 


ns. 1 iuyuo 


l/yono 


ArUzUUoo 


nS.li zZo 




lZyoo4 


bfcZZZUYo 


ns.nouoy 




loUUoo 


M0Z4UZ 


nS.Z/4olo 




1 oUZ4o 


Uoo4oo 


Uc 1*;Q997 
MS. lOOZZ/ 


1 j 


10U4UU 




Ue OQ^tHQ 

HS.zoolUo 


ToU4oo 


Nfvi_uu iyzo 


Ue 4CCKQ7 


!Z : 


JOUOOO 


DhzYU4/Z 


ns.z/yyuu 


iL.J 


A 1AC0A 

130589 


AL1 10226 


Hs.16441 




1 30ooo 


AA993269 


U«> -I7070 

HS. 17872 




■1 Oftff OA 

130689 


NM_006691 


Hs.17917 


Tan 


130689 


AA046747 


Hs.17917 


■1 OIY7H 0 

130718 


N7019b 


HS. 18376 




130798 


M81349 


Hs.1955 


|v 


130840 


OCA A O OA-I 

BE048821 


Hs.20144 




-1 O-l -1 OA 

131184 


a do a nmc 
AbU4l)yo& 


Hs.23954 


J25 


131282 


X03350 


Hs.4 


"■CSS! 


131328 


AW939251 


Lin 1CCA7 


2f 


131543 


AW9ooool 


LI ~ A Ar*Df\ 

Hs.41639 




131753 


a a oonooe 

AA829286 


II. OOAACO 

Hs.332053 




131785 


H69342 


Hs.26320 


|60 


131828 


AJ000263 


Hs.278658 




132426 


AW1 18072 


Hs.89981 


eS; 


132675 


a lorn a nc 

AI291496 


Hs.5476 


; 


1 ozo9o 


Wzo54o 


rls.224829 


£ — : 




INM_UU4ZOO 


nS./B34 




A 004 OA 

1 331 20 


NM_00327o 


Hs.65424 


H OOil AT 

1 3340/ 


ArU179o7 


Hs.7306 




loony 


riZbyu4 


nS./o/3b 




134007 


AF072441 


Hs.7840 


40 


134055 


D86062 


Hs.182423 


134111 


AI372588 


Hs.8022 




134117 


AA081846 


Hs.7921 




134177 


BE243319 


Hs.79672 




134369 


AF207664 


Hs.8230 


45 


134496 


M64936 




134510 


NM 002757 


Hs.250870 




134550 


M26315 


Hs.85258 




134758 


NM 000078 


Hs.89538 




134963 


NM_003394 


Hs.91985 


50 


135066 


X04430 


Hs.93913 


408790 


AW580227 


Hs.47860 




446674 


AA563892 


Hs.306000 



gb:tr34b07.x1 NCI_CGAP_Ov23 Homo sapiens 1 0 

ESTs 10 

ESTs, Moderately similar to ALU5_HUMAN A 1 0 

endogenous retroviral protease 1 0 

perilipin 10 

chromosome 1 1 open reading frame 23 5 

STAT induced STAT inhibitor 3 1 0 

WAS protein family, member 2 1 0 

claudin 5 {transmembrane protein deleted 1 0 

isocitrate dehydrogenase 1 (NADP+), solu 10 

ESTs 10 

insulin-like growth factor binding prate 1 0 

cyclin G associated kinase 1 0 

hemoglobin, gamma G 1 0 

D component of complement (adipsin) 10 

HSPC015 protein 10 

DKFZP434H204 protein 10 

Homo sapiens, clone IMAGE:387501 2, mRNA 1 0 

extracellular link domain-containing 1 10 

extracellular link domain-containing 1 1 0 

KIAA1319 protein 10 

serum amyloid A4, constitutive 1 0 

small inducible cytokine subfamily A (Cy 10 

cerebral cell adhesion molecule 1 0 

alcohol dehydrogenase 1 B (class I), beta 1 0 

v-fos FBJ murine osteosarcoma viral onco 1 0 

programmed cell death 2 1 0 

serum amyloid A1 10 

TRABID protein 10 

keratin, hair, basic, 6 (monilethrix) 10 

diacylglycerol kinase, zeta (1 04kD) 1 0 

Homo sapiens, clone IMAGE:3530123, mRNA, 10 

ESTs 10 

Kruppel-like factor 4 (gut) 10 

tetranectin (plasminogen-binding protein 10 

secreted frizzled-related protein 1 10 

apolipoprotein D 10 

calcineurin binding protein 1 10 

ES1 (zebrafish) protein, human homolog o 10 

TU3A protein 5 
Homo sapiens mRNA; cDNA DKFZp566E183 (fr 5 

Kl AA0652 gene product 10 

a disintegrin-like and metalloprotease ( 10 

gb:Homo sapiens retinoic acid-inducible 10 

mitogen-activated protein kinase kinase 1 0 

CD8 antigen, alpha polypeptide (p32) 5 

cholesteryl ester transfer protein, plas 5 

wingless-type MMTV integration site fami 10 

interleukin 6 (interferon, beta 2) 1 0 

neurotrophic tyrosine kinase, receptor, type 2 10 
solute carrier family 4 (anion exchanger), memb 10 
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TABLE 2A 



Table 2A shows the accession numbers for those pkeys lacking unigenelD's for Table 2. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 

1 11 168 38585J AI798376 S46400 AW811617 AW811616 W00557 BE142245 AW858232 AW861851 AW858362 AA232351 AA218567 
AA055556 AW858231 AW857541 AW814172 H66214 AW814398 AF134164 AA243093 AA173345 AA199942 AA223384 
AA227092 AA227080 T12379 AA092174 T61139 AA149776 AA699829 AW879188 AW813567 AW813538 AI267168 AA157718 
AA157719 AA100472 AA100774 M130756 M1 57705 AA1 57730 AA15771 5 AA053524 AW849581 AW854566 C05254 
AW882836 T92637 AW812621 AA206583 AA209204 BE156909 AA226824 AI829309 AW991957 N66951 AA527374 H66215 
AA045564 AI694265 H60808 AA149726 AW195620 BE081333 BE073424 AW817662 AW817705 AW817703 AW817659 
BE081531 H59570 

103747 117944 1 AA081995 AA101099 

134496 46501J M64936 AI025512 AI382987 BE061777 AA089966 BE169930 T41176 AW594624 BE502415 AA121893 AI269283 T40311 
AI684569 AA25701 1 AI079277 AI241318 BE327710 AW975215 AW896268 AA884990 BE327514 

105239 34624J AA221036 R87170 BE537068 BE544757 C1 8935 AW812058 T92565 AA22741 5 AA233942 AA223237 AA668403 AA601627 
AW869639 BE061833 BE00062O AW961 170 AW847519 AA308542 AW821 833 AW945688 C04699 AA205504 AA377241 
AW821667 AA055720 AW817981 AW856468 AA155719 AA179928 T03007 AW754298 AA227407 AA113928 AA307904 
C16859 

100654 tigr_HT2969 A03758 A06977 A15293 D17029 D17107 D17171 L00132 L00133 M12523 M13075 M13076 M92816 U22961 V00494 V00495 
X51363 X51364 X51365 

102208 6735_9 U22961 AA203623 AA503337 AI174733 AI192802 C06092 AA035357 AI190619 AI199244 AI828450 AA602296 AI378195 
AI209170 AI186653 AI127795 AI183846 H77389 AI589465 M629390 H94306 AI018388 R68584 M027196 AI745413 
AI685092 AI093426 AI623873 AI074570 N50096 AA047486 N25060 M327614 AI042512 AI383957 M156873 AI333101 
N70806 AI141254 AI383191 AI401237 AI080709 AI093400 W84549 T90806 R00012 W01413 AA630557 AI378348 AI559265 
AA877103 W84464 M625146 R68379 AI133207 AI132980 AI133214 AI064826 AI061615 AI133473 AI174852 AI133404 
AI133272 V00494 M12523 M12523 AI207526 AI133120 AI064802 AI174993 AI1 14729 AI061645 AI064716 AI064959 H77388 
T85706 AF075298 AI1 10799 D17107 NM_000477 AF190168 R50724 AI248416 AI207432 AI133684 AI133345 AI174710 
AI133290 AI133304 AI174948 AI207484 AI1 10717 AF074624 AI114515 AF063516 AI110642 AI114559 AI114498 AI114759 
AI207568 AI064960 AI174753 AI1 14666 R69184 R0001 1 AI064997 T60501 AI207701 T71 735 AA38531 8 H73569 T60496 
H94399 AI1331 58 T74675 AA484750 T7341 3 T55909 R50261 T72061 N80533 T51 189 T74936 AI207490 AI1 32925 AI064701 
AI174748 AI114663 AI133104 AI132999 AI133100 AI064925 AI064979 AI133063 M343347 T69091 AA233989 T39772 
AI444620 T52290 D16931 T40012 T48403 T58926 T69195 AI133061 T50850 AI400677 AI091 136 AA334608 T5741 1 Z20979 
N56507 T87485AI133622AA343370 T40075T69671 T53849 T74820AF075316 AI110818 T40121 T57381 AI114468 
AA332728 T51362 AI1 14589 R06691 AI1 10629 AF063503 AI140543 AA334661 AA332720 AA343262 T73513 T86549 
AI114840 T57284 T39981 T61407 T72757 T51749 T56630 AA343125 T72126 R94135 T83028 T39972 T39896 AI174786 
AI132926 R09237 AI064838 AI133660 T60398 T88753 T55930 T921 26 AI444602 T60996 AH 14792 H9391 1 AI133106 
R10779 AI065020 T90925 T50889 D17029 AI133703 AA333805 AI133040 AI133017 AI064857 AM 10730 AF074637 AI207567 
H71080 T73217 M343950 AI174743 AA334224 AA334281 R06692 T64739 T40163 T60628 T81661 T73179 R01842 
M501730 T39931 T39662 T40136 AA334904 T71425 H77784 R00874 AI065049 T84512 T55918 AI207595 T39951 
AA005016 T60361 T69176 T73356 T58795 T61233 T39955 T60612 AI114676 AI064778 AA035710 W52763 AH 14786 
T83564 AA341859 T81 684 T55769 AH 14710 T51776 AA343213 AH 14714 T58102 AI110809 R28984 AI174854 AA305675 
M343592 T53836 T46869 T64721 T55508 W05241 T54019 T57945 T60513 T48364 AF075308 W86731 T82851 T48269 
H54053 T7321 1 AH 14590 T48317 T55965 T74857 R84226 T56552 T52231 T74946 T76976 R02576 T95666 AI203974 
AI189471 M005147 AI478102 AI207662 AI192792 AI768421 AI064737 AW051713 M936693 AI133117 AI766232 AI913646 
T83962 AI0651 12 AI207689 AI174684 AI207702 T81475 AH 33325 AI032512 AA701 169 AI936354 AH 14720 AI433289 
AA046980 AI823482 AI1 14536 M860651 AW242644 R07469 AW300438 AI133416 AW271670 AI991363 T78943 AI823481 
AA845518 AA719124 AA883454 T68850 T691 15 AI935509 AI150977 T62890 T71374 T68294 AI174774 T6741 1 T68318 
AI064689 T56624 T69010 T68982 T68302 AI332829 T72908 AI06481 9 AI205880 T62895 T69430 T951 1 1 AA025050 T73330 
W52657T71984 T69118 W92684 AH14860 T62093 T61797 AI522333 T73322 H92981 T56018 T61811 T57232 AI336158 
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T61821 T69457 T62900 T62912 T72917 T46885 AI702448 T57212 T57203 R94581 T71311 T61819 T89358 T67708 T70918 
T59166 AI1871 1 1 T64308 T62071 T69427 AM 14750 T60430 R09734 T69033 T69141 T69453 T67908 R16809 T69394 
AI207729 T55839 T90273 T73339 AW194909 T75486 T71850 T71305 T71287 T53877 T73452 T68852 N75290 AI312890 
T67751 AI174983 T51679 T54851 H69880 N73734 AA443453 T73466 H69672 N53869 T68447 D1 1809 D12412 T64300 
5 T28321 T55864 

102800 14782.20 M313538 U88895 U88902 

108351 genbank_AA071193 AA071193 

101447 entrez_M21305 M21305 
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TABLE 3: Figure 3 from BRCA 001 US 
Table 3 shows genes downregulated in tumor tissue compared to normal breast tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnkjenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of normal breast tissue to tumor 



Pkey ExAccn UnigenelD UnigeneTitle R1 

101336 NM_006732 Hs.75678 FBJ murine osteosarcoma viral oncogene h 10.0 

102208 U22961 gb:Human mRNA clone with similarity to L 10.0 

102990 AA829286 Hs.332053 serum amyloid A1 10.0 

111168 AI798376 gb:tr34b07.x1 NCI_CGAP_Ov23 Homo sapiens 10.0 

111803 AA593731 Hs.325823 ESTs, Moderately similar to ALU5JHUMAN A 10.0 

130085 M62402 Hs.274313 insulin-like growth factor binding prote 10.0 

130840 BE048821 Hs.20144 small inducible cytokine subfamily A (Cy 10.0 

131543 AW966881 Hs.41639 programmed cell death 2 10.0 

133120 NMJ03278 Hs.65424 tetranectin (plasminogen-binding protein 10.0 

134758 NM_000078 Hs.89538 cholesteryl ester transfer protein, plas 10.0 



112 



Attorney Docket No. :0 18501 -0052 10US 



TABLE 3A 

Table 3A shows the accession numbers for those pkeys lacking unigenelD's for Table 3. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 

1 1 1 168 38585J AI798376 S46400 AW811617 AW81 1616 W00557 BE142245 AW858232 AW861851 AW858362 AA232351 M218567 

AA055556 AW858231 AW857541 AW814172 H66214 AW814398 AF134164 AA243093 AA173345 AA199942 M223384 
AA227092 AA227080 T12379 AA092174 T61139 M149776 M699829 AW879188 AW813567 AW813538 AI267168 AA157718 
AA157719 M100472 M100774 AA130756 AA157705 AA157730 M157715 AA053524 AW849581 AW854566 C05254 
AW882836 T92637 AW812621 AA206583 AA209204 BE156909 AA226824 AI829309 AW991957 N66951 AA527374 H66215 
AA045564 AI694265 H60808 AA149726 AW195620 BE081333 BE073424 AW817662 AW817705 AW817703 AW817659 
BE081531 H59570 

1022086735_9 U22961 AA203623 AA503337 Af174733 AI192802 C06092 AA035357 AI190619 AI199244 AI828450 AA602296 AI378195 

AI209170 AI186653 AI127795 AI183846 H77389 AI589465 AA629390 H94306 AI018388 R68584 AA027196 AI745413 AI685092 
AI093426 AI623873 AI074570 N50096 M047486 N25060 AA327614 AI042512 AI383957 AA156873 AI333101 N70806 AI141254 
AI383191 AI401237 AI080709 AI093400 W84549 T90806 R00012 W01413 AA630557 AI378348 AI559265 M877103 W84464 
AA625146 R68379 AI133207 AI132980 AI133214 AI064826 Al 06 1 6 1 5 Al 1 33473 Al 1 74852 Al 1 33404 AI133272 V00494 M12523 
M12523 AI207526 AI133120 AI064802 AI174993 AM 14729 AI061645 AI064716 AI064959 H77388 T85706 AF075298 AI110799 
D17107 NM_000477 AF190168 R50724 AI248416 A1207432 AI133684 AI133345 AI174710 AI133290 AI133304 AI174948 
AI207484 AI1 10717 AF074624 AI1 14515 AF063516 AI110642 AI1 14559 AI1 14498 AI114759 AI207568 AI064960 AI174753 
AI114666 R69184 R00011 AI064997 T60501 AI207701 T71735 AA385318 H73569 T60496 H94399 AI133158 T74675 AA484750 
T73413 T55909 R50261 T72061 N80533 T51 189 T74936 AI207490 AI132925 AI064701 AI174748 Al 1 14663 AI133104 AI132999 
AI133100 AI064925 AI064979 Al 133063 AA343347 T69091 M233989 T39772 AI444620 T52290 D16931 T40012T48403 
T58926 T69195 AM 33061 T50850 AI400677 AI091136 AA334608 T5741 1 Z20979 N56507 T87485 AI1 33622 M343370 T40075 
T69671 T53849 T74820 AF075316 AM 10818T40121 T57381 AI114468 M332728 T51362 AI114589 R06691 AI110629 
AF063503 AI140543 AA334661 M332720 AA343262 T73513 T86549 AM 14840 T57284 T39981 T61407 T72757 T51749 
T56630 AA343125 T72126 R94135 T83028 T39972 T39896 AI174786 AI132926 R09237 AI064838 AI133660 T60398 T88753 
T55930 T92126 AI444602 T60996 AM 14792 H93911 AI133106 R10779 AI065020 T90925 T50889 D17029 AI133703 AA333805 
AI133040 AI133017 AI064857 AM 10730 AF074637 AI207567 H71080 T73217 M343950 AI174743 AA334224 AA334281 
R06692 T64739 T40163 T60628 T81661 T73179 R01842 AA501730 T39931 T39662 T40136 AA334904 T71425 H77784 R00874 
AI065049 T84512 T55918 AI207595 T39951 M005016 T60361 T69176 T73356 T58795 T61233 T39955 T60612 AM 14676 
AI064778 AA035710 W52763 AI114786 T83564 M341859 T81684 T55769 AI114710 T51776 M343213 AI114714 T58102 
AI110809 R28984 AI174854 AA305675 AA343592 T53836 T46869 T64721 T55508 W05241 T54019 T57945 T60513 T48364 
AF075308 W86731 T82851 T48269 H54053 T7321 1 AM 14590 T48317 T55965 T74857 R84226 T56552 T52231 T74946 T76976 
R02576 T95666 AI203974 AI1 89471 AA005147 AI4781 02 AI207662 AM 92792 AI768421 AI064737 AW051713AA936693 
AI1331 17 AI766232 AI913646 T83962 AI0651 12 AI207689 AM 74684 AI207702 T81475 AI1 33325 AI032512 AA701 169 AI936354 
AI114720 AI433289 AA046980 AI823482 AM 14536 AA860651 AW242644 R07469 AW300438 AI133416 AW271670 AI991363 
T78943 AI823481 M845518 AA719124 AA883454 T68850 T691 15 AI935509 AI150977 T62890 T71374 T68294 AI174774 
T67411 T68318 AI064689 T56624 T69010 T68982 T68302 AI332829 T72908 AI064819 AI205880 T62895 T69430 T951 1 1 
M025050 T73330 W52657 T71984 T691 18 W92684 AI1 14860 T62093 T61797 AI522333 T73322 H92981 T5601 8 T6181 1 
T57232 AI336158T61821 T69457 T62900 T62912 T72917 T46885 AI702448 T57212 T57203 R94581 T71311 T61819 T89358 
T67708 T70918 T59166 AI1871 1 1 T64308 T62071 T69427 AM 14750 T60430 R09734 T69033 T69141 T69453 T67908 R1 6809 
T69394 AI207729 T55839 T90273 T73339 AW1 94909 T75486 T71850 T71 305 T71287 T53877 T73452 T68852 N75290 
AI312890 T67751 AI174983 T51679 T54851 H69880 N73734 AA443453 T73466 H69672 N53869 T68447 D1 1809 D12412 
T64300 T28321 T55864 
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TABLE 4: Figure 4 from BRCA 001 US 

Table 4 shows genes upregulated in tumor tissue compared to normal breast tissue. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: Ratio of tumor to normal breast tissue 

Pkey ExAccn UnigenelD Unigene Title R1 

100113 NM_001269Hs.84746 chromosome condensation 1 2.3 

100114 X02308 Hs.82962 thymidylate synthetase 2.9 
100131 D12485 Hs.11951 ectonucleotide pyrophosphatase/phosphodiesterase 1 1.9 

100146 BE185499 Hs.2471 KIAA0020 gene product 1.9 

100147 D13666 Hs.136348 osteoblast specific factor 2 (fasciclin l-like) (periostin) 7.5 
100154 H60720 Hs.81892 KIAA01 01 gene product 9.2 
100163 W44671 Hs.124 gene predicted from cDNA with a complete coding sequence 1.6 
100220 AW015534 Hs.217493 annexinA2 2.0 
100265 D38521 Hs. 112396 KIAA0077 protein 1.5 
100271 BE160081 Hs.256290 S100 calcium-binding protein A11 (cakjizzarin) 13.5 
100275 BE242802 Hs.154797 KIAA0090 protein 5.1 
100323 D50920 Hs.23106 KIAA0130 gene product 1.9 
100335 AW247529 Hs.6793 platelet-activating factor acetylhydrolase, isoform lb, gamma subunit (29kD) 2.7 
100364 NM_004341Hs.154868 carbamoyl-phosphate synthetase 2, aspartate transcarbamylase, and dihydroorotase 2.0 
100372 NM_014791Hs.184339 KIAA0175 gene product 2.6 
100393 D84145 Hs.39913 novel RGD-containing protein 3.2 
100400 AW954324 Hs.75790 phosphatidylinositol glycan, class C 1.5 
100418 D86978 Hs.84790 KIAA0225 protein 2.0 
100482 M65028 Hs.81361 heterogeneous nuclear ribonucleoprotein A/B 2.9 
100518 NM_004415Hs.74316 desmoplakin (DPI, DPII) 1.9 

100666 L05424 Hs.1 69610 CD44 antigen (homing function and Indian blood group system) 5.7 

100667 L05424 Hs.1 69610 CD44 antigen (homing function and Indian blood group system) 9.0 

100668 L05424 Hs.1 69610 CD44 antigen (homing function and Indian blood group system) 7.6 
100678 AW502935 Hs.740 PTK2 protein tyrosine kinase 2 53.2 
100685 AA328229 Hs.1 84582 ribosomal protein L24 1.8 
100690 AA383256 Hs.1657 estrogen receptor 1 1.6 
100783 AF078847 Hs.1 91356 general transcription factor IIH, polypeptide 2 (44kD subunit) 5.9 
100850 AA836472 Hs.297939 cathepsinB 1.7 
100892 BE245294 Hs.180789 S164 protein 1.7 
100945 AF002225 Hs.180686 ubiquitin protein ligase E3A (human papilloma virus E6-associated protein, Angelman syndrome) 1.5 
100969 AA157634 Hs.79172 solute carrier family 25 (mitochondrial carrier; adenine nucleotide translocator), member 5 6.3 
100988 AK000405 Hs.76480 ubiquitin-like 4 11.4 
100999 H38765 Hs.80706 diaphorase (NADH/NADPH) (cytochrome b-5 reductase) 1.6 
101031 J05070 Hs.1 51738 matrix metalloproteinase 9 (gelatinase B, 92kD gelatinase, 92kD type IV collagenase) 8.2 

1 01 045 J0561 4 gb:Human proliferating cell nuclear antigen (PCNA) gene, promoter reg ion. 5.0 

101077 N99692 Hs.75227 Empirically selected from AFFX single probeset 2.6 

1 01 093 L06419 Hs.75093 procollagen-lysine, 2-oxoglutarate 5-dioxygenase (lysine hydroxylase, Ehlers-Danlos syndrome type Vl)1 .4 

101161 NM_006262Hs.37044 peripherin 16.9 

101186 AA020956 Hs.1 79881 core-binding factor, beta subunit 2.0 

101216 AA284166 Hs.84113 cyclin-dependent kinase inhibitor 3 (CDK2-associated dual specificity phosphatase) 1.8 

101228 AA333387 Hs.82916 chaperonin containing TCP1, subunit 6A(zeta1) 1.7 

101247 AA132666 Hs.78802 glycogen synthase kinase 3 beta 1.9 

101249 L18964 Hs.1 904 protein kinase C, iota 1.5 

101332 J04088 Hs.1 56346 topoisomerase (DNA) II alpha (170kD) 5.2 

101332 J04088 Hs.1 56346 topoisomerase (DNA) II alpha (170kD) 3.4 

101352 AI494299 Hs.1 6297 COX17 (yeast) homolog, cytochrome c oxidase assembly protein 6.3 

101396 BE267931 Hs.78996 proliferating cell nuclear antigen 4.2 

101445 M21259 gb:Human Alu repeats in the region 5' to the small nuclear rib 1.9 

101470 NM_000546Hs.1846 tumor protein p53 (Li-Fraumeni syndrome) 1.6 
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101478 NM_002890Hs.758 RAS p21 protein activator (GTPase activating protein) 1 2.5 

101483 M24486 Hs.76768 procollagen-proline, 2-oxoglutarate 4-dioxygenase {proline 4-hydroxylase), alpha polypeptide I 5.5 

101540 J04977 Hs.84981 X-ray repair complementing defective repairin Chinese hamster cells 5 (double-strand-break rejoining 2.1 

101573 AW248421 Hs.250758 proteasome (prosome, macropain) 26S subunit, ATPase, 3 1.6 

5 101580 NM_012151Hs.83363 coagulation factor Vlll-associated (intronic transcript) 5.7 

101592 AF064853 Hs.91299 guanine nucleotide binding protein (G protein), beta polypeptide 2 1.8 

101592 AF064853 Hs.91299 guanine nucleotide binding protein (G pr 5.6 

101621 BE391804 Hs.62661 guanylate binding protein 1 , interferon-inducible, 67kD 2.4 

101702 AW504089 Hs.179574 protein phosphatase 2 (formerly 2A), regulatory subunit B (PR 52), alpha isoform 1.3 

10 101734 M74099 Hs.147049 cut (Drosophila)-like 1 (CCAAT displacement protein) 2.1 

101759 M80244 Hs.184601 solute carrier family 7 (cationic amino acid transporter, y+ system), member 5 5.0 

101767 M81057 Hs.1 80884 carboxypeptidase B1 (tissue) 14.4 

101782 AA306495 Hs.1869 phosphoglucomutase 1 5.2 

10,1805 AW409747 Hs.75612 stress-induced-phosphoprotein 1 (Hsp70/Hsp90-organizing protein) 8.6 

15 |01806 AA586894 Hs.1 12408 S100 calcium-binding protein A7 (psoriasin 1) 8.9 

(101810 NM_000318Hs.180612 peroxisomal membrane protein 3 (35kD, Zellweger syndrome) 3.2 

i-* 101879 AA1 76374 Hs.243886 nuclear autoantigenic sperm protein (histone-binding) 1.6 

101911 AA441787 Hs.1 19689 glycoprotein hormones, alpha polypeptide 31.3 

W 101920 AF182645 Hs.8024 IK cytokine, down-regulator of HLA II 1.8 

£20 101973 U41514 Hs.80120 UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase 1 (GalNAc-T1) 2.4 

= =; 101983 AI904232 Hs.75323 prohibitin 8.4 

*fJ 102009 BE245149 Hs.82643 protein tyrosine kinase 9 1.3 

W 102036 BE250127 Hs.82906 CDC20 (cell division cycle 20, S. cerevisiae, homolog) 2.0 

flj 102083 T35901 Hs.75117 interleukin enhancer binding factor 2, 45kD 1.6 

.,.25 102083 T35901 Hs.75117 interleukin enhancer binding factor 2, 4 1.3 

"2; 102107 BE258602 Hs.182366 heat shock protein 75 1.4 

tj 102123 NM_001809Hs.1594 centromere protein A (17kD) 1.8 

102165 BE313280 Hs.1 59627 death associated protein 3 4.6 

~. 1 02198 AW950852 Hs.74598 polymerase (DNA directed), delta 2, regulatory subunit (50kD) 4.3 

-SO 102217 AA829978 Hs.301613 JTV1 gene 6.7 

Id-. 102220 U24389 Hs.65436 lysosomal 4.3 

n\ 102234 AW1 63390 Hs.278554 heterochromatin-like protein 1 1.9 

! -|! 102260 AL039104 Hs.159557 karyopherin alpha 2 (RAG cohort 1, importin alpha 1) 4.4 

4= 102302 AA306342 Hs.69171 protein kinase C-like 2 2.7 

C35 102330 BE298063 Hs.77254 chromobox homolog 1 (Drosophila HP1 beta) 1.5 

k< 102339 BE378432 Hs.95577 cyclin-dependent kinase 4 2.3 

\y 102348 U37519 Hs.87539 aldehyde dehydrogenase 3 family, member B2 2.0 

102349 AU077055 Hs.289107 baculoviral IAP repeat-containing 2 3.2 

102369 U39840 Hs.299867 hepatocyte nuclear factor 3, alpha 2.0 

40 102374 U33635 Hs.90572 PTK7 protein tyrosine kinase 7 6.2 

102391 AA296874 Hs.77494 deoxyguanosine kinase 1.5 

102455 U48705 Hs.75562 discoidin domain receptor family, member 1 6.9 

102465 NM_001359Hs.81548 2,4-dienoyl CoA reductase 1, mitochondrial 1.8 

102488 U50939 Hs.61828 amyloid beta precursor protein-binding protein 1, 59kD 1.5 

45 102489 AL080116 Hs.74420 origin recognition complex, subunit 3 (yeast homolog)-like 3.3 

102494 AI188137 Hs.75193 COP9 homolog 2.1 

102501 AF217197 Hs.74562 siah binding protein 1; FBP interacting repressor; pyrimidine tract binding splicing 3.2 

102522 BE250944 Hs.183556 solute carrier family 1 (neutral amino acid transporter), member 5 2.8 

102532 AF040253 Hs.70186 suppressorof Ty (S.cerevisiae) 5 homolog 5.7 

50 102564 U59423 Hs.79067 MAD (mothers against decapentaplegic, Drosophila) homolog 1 2.3 

102568 W81489 Hs.223025 RAB31 , member RAS oncogene family 5.3 

102580 U60808 Hs.152981 CDP-diacylglycerol synthase (phosphatidate cytidylyltransferase) 1 2.1 

102581 AU077228 Hs.77256 enhancer of zeste (Drosophila) homolog 2 1.6 

102582 U61232 Hs.32675 tubulin-specific chaperone e 2.1 
55 102617 AW161453 Hs.198767 COP9 (constitutive photomorphogenic, Arabidopsis, homolog) subunit 5 1.8 

102618 AL037672 Hs.81071 extracellular matrix protein 1 5.8 

102627 AL021918 Hs.158174 zinc finger protein 184 (Kruppel-like) 1.3 

102663 NM_002270Hs.168075 karyopherin (importin) beta 2 1.8 

102676 BE262989 Hs.12045 putative protein 2.3 

60 102687 NM_007019Hs.93002 ubiquitin carrier protein E2-C 4.3 

102689 U96132 Hs.171280 hydroxyacyl-Coenzyme A dehydrogenase, type II 6.0 

102696 BE540274 Hs.239 forkhead box M1 4.2 

102704 AU077058 Hs.54089 BRCA1 associated RING domain 1 1.9 

102705 T97490 Hs.50002 small inducible cytokine subfamily A (Cys-Cys), member 19 2.3 
65 102750 AB014460 Hs.66196 nth (E.coli endonuclease lll)-like 1 1.2 

102801 BE252241 Hs.38041 pyridoxal (pyridoxine, vitamin B6) kinase 6.4 

102812 U90549 Hs.236774 high-mobility group (nonhistone chromosomal) protein 17-like 3 1.6 
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102827 


BE244588 Hs.6456 


chaperonin containing TCP1, subunit 2 (beta) 


5.6 




102831 


AA262170 Hs.80917 


adaptor-related protein complex 3, sigma 1 subunit 


2.0 




102844 


AV653790 Hs.324275 


WW domain-containing protein 1 


1.3 




102868 


X02419 Hs.77274 


plasminogen activator, urokinase 


4.4 


5 


102925 


BE440142 Hs.2943 


signal recognition particle 19kD 


1.9 




102935 


BE561850 Hs.80506 


small nuclear ribonucleoprotein polypeptide A' 


2.4 




102968 


AU076611 Hs.1 54672 


methylene tetrahydrofolate dehydrogenase (NAD+ dependent), methenyltetrahydrofolate cyclohydroiase2.7 




102983 


BE387202 Hs.1 18638 


nnn-mptastatir r^lls 1 nrofpin (NM23A} exDressed in 


3.1 


10 


102985 


U95742 Hs.2707 


<J 1 UJ O [Jl 1 dot? UdllolUUII 1 


5.2 


103023 


AW500470 Hs.1 17950 


multifunctional polypeptide similar to SAICAR synthetase and AIR carboxylase 


1.6 




103038 


AA926960 Hs.334883 


nrntpin kinasp i 


2.5 




103060 


NM_005940Hs.155324 


IIIdUIA lllcLdlKJ|JIULCIlIdoC I I ^IVIIVIr I I, oUUII Iciyolll Of 


4.5 




103080 


AU077231 Hs.82932 


cycitn u i ^rKnu i . pdrduiyruiu duciiuiiidiuoio i / 


3.1 


15 


103089 


D31152 Hs.179729 


conagen, type a, aipna t ^ocnrniu rneiapnyseai cnonurouyopidoidy 


2.4 


103177 


BE244377 Hs.48876 


Tarnesyi-oipiiOopridic idificoyiuaiibicidoc i 


3.5 


103178 


AA205475 Hs.275865 


IIUUbUNIdl piUlclll o IO 


9.9 


— 


103179 


NM_001777Hs.82685 


K/UHi dllUycM \r\il-lcldlcu dltuycll, llllcy [Hi-dooUMdiCU oiyiiai udiiouuuciy 


1.3 




103181 


X69636 Hs.334731 


nomo sapiens, cione nviMot. j*woouo, mrMNrt, pdnidi ujo 


2.0 




103185 


NM_006825Hs.74368 


tranemomhrana nrwtoin anHnnlaomin rotir'ntniYi^rtini intormoHiatfa Pfimnartmpnt 

uansiTiciTiurdnc pruicin ^ooivuy, ciiuuptdoiuiLr iciiuuiuiit/ouiy i n iicii iieuiaic wjnipai ui ici n 


1.6 


loo 


103191 


AA401039 Hs.2903 


proiein pnospnaiase h \TunTieriy A), cdidiyuo ouuuini 


2.5 




103193 


NM 004766HS.75724 


CUdiyillcl piULclfl UUIIipicA, oUUUIlK Ucld £. ^Ucld fJlllllc^ 


2.2 


ir ! ?c : 


103194 


NM_004939Hs.78580 


nPAn/H /Acn CX\\ i Ala Acn/Hic^ Kay nnlunontiHo 1 

L/CMU/n ^rtojj-oiu rtid-nojj/nioy Uva puiyjjcjjuuc i 


6.3 


5 k ! 


103206 


X72755 Hs.77367 


monokine induced by gamma interferon 


8.8 


\i 


103223 


BE275607 Hs.1 708 


chaperonin containing TCP1, subunit 3 (gamma) 






103232 


X75962 Hs.1 29780 


tumor necrosis factor receptor superfamily, member 4 






1(TO38 


AI'3f>Q28 c > HW5189 

ni jujluj i io./ w i uo 


death -associated protein 


0.0 


S 


103297 


IMM f)fl1 I i4 , iHs Q078 


immature colon carcinoma transcript 1 


1 Q 




103130 


AIR03447 H<! 7749.fi 


smai! nuclear ribonucleoprotein polypeptide G 


£..0 




103349 


X89059 


gb:H. sapiens mRNA for unknown protein expressed in macrophage 


1 R 
I.0 




1 VOO/ u 




coated vesicle membrane protein 


1 ft 


II! 




X'QAA'tt Hs114^fifi 


pyrroline-5-cartx>xylate synthetase (glutamate gamma-semialdehyde synthetase) 










gb:H. sapiens dbi/acbp gene exon 1 & 2. 


a n 




1(1^4^0 

lUOnOv 




translocase of inner mitochondrial membrane 17 (yeast) homolog A 


1.0 




103491 




myeloid/lymphoid or mixed-lineage leukemia 3 


<i R 


f!35 




AL031224 H<; 33102 


transcription factor AP-2 bete (activating enhancer-binding protein 2 beta) 


C\ 1 
0. 1 




103547 


AI37S722 180062 


proteasome (prosome, macropain) subunit, beta type, 8 (large multifunctional protease 7) 


Q 7 

3.1 






nm nnR2i8H'? 85701 


pnospnoinosiiiuc-j-Kinase, cataiyiic, aipiid poiypcpuuc 






103613 


iNivi_uuUij*turio.fcO iu 


SRY (sex determining region Y)-box 9 (campomelic dysplasia, autosomal sex-reversal) 


1 ? 
I.O 


40 


103621 


BE379766 Hs. 150675 


polymerase (RNA) II (DNA directed) polypeptide K (7.0kD) 




103622 


AAfiOQfiS"; H<! 278672 


membrane component, chromosome 11, surface marker 1 


C..O 




103727 


AI878883 Hs.296381 


growth factor receptor-bound protein 2 


1.3 






Al 1W01 Hi 8768 


hypothetical protein rLJlUo4y 


I.O 




103754 


AI01"5709 H<! 172089 


Homo sapiens mRNA; cuna DKrZpbobizU^ (trom clone UM-zpoobizU/^) 


1.0 


45 


103780 


AA094752 Hs.169992 


hypothetical 43.2 Kd protein 


7 R 


103795 


H26531 Hs 7367 


Homo sapiens BTB domain protein (BDPL) mRNA, partial cds 


I.Z 




103797 


AA080912 


yu.zjiu*HiV70.r i OLidLdyciic i iicuium \poi zoo) nuiiiu od|jieiio uui^rt uiunc *j oiiihkii 


1.5 




103813 


AI042"582 Hs 181271 


CGI-120 protein 


15 




103855 


W02363 Hs 302267 


hypothetical protein FLJ10330 


1.5 


50 




AKnm?7ft Hs ms7^7 

nlMJU O MO. vJ( 


hypothetical protein FLJ10416 similar to constitutive photomorphogenic protein 1 


6.5 




MM nfl94fi7Hs 

IN IVI__UU^*TV / MO.3/0 t TH 


mammaglobin 2 


2.9 




l\M\Jl o 


nnt3 1 £*r£ no. 1 \JQ£.OQ 


ESTs 


1.4 




lUH I J *t 


AA47AQR4 Hq fid^1 


PRO0659 protein 


5.6 




104227 


AB002343 Hs.98938 


protocadherin alpha 9 


1.6 


55 


104275 


AI751970 Hs.101067 


GCN5 (general control of amino-acid synthesis, yeast homolog)-like 2 


5.4 


104325 


BE379766 Hs.150675 


polymerase (RNA) II (DNA directed) polypeptide K (7.0kD) 


6.3 




104370 


AA324597 Hs.21851 


Homo sapiens cDNA FLJ12900 fis, clone NT2RP2004321 


1.6 




104423 


R83113 Hs.1432 


protein kinase C substrate 80K-H 


5.2 




104482 


AB037762 Hs.44268 


myelin gene expression factor 2 


1.2 


60 


104532 


AI498763 Hs.203013 


hypothetical protein FU12748 


2.1 


104563 


AL1 17403 Hs.306189 


DKF2P434F1735 protein 


1.2 




104667 


AI239923 Hs.30098 


ESTs 


1.3 




104757 


AI694413 Hs.332649 


olfactory receptor, family 2, subfamily I, member 6 


2.3 




104804 


AI858702 Hs.31803 


ESTs, Weakly similar to N-WASP [H.sapiens] 


1.3 


65 


104806 


AB023175 Hs.22982 


KIAA0958 protein 


2.3 


104827 


AW052006 Hs.8551 


PRP4/STK/WD splicing factor 


10.9 




104846 


AI250789 Hs.32478 


ESTs 


5.6 




104854 


AA041276 Hs.154729 


3-phosphoinositide dependent protein kinase-1 


12.3 
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104867 AA278898 Hs.225979 hypothetical protein similar to small G proteins, especially RAP-2A 2.0 

104871 T78044 Hs.28893 Homo sapiens mRNA; cDNA DKFZp56402364 (from clone DKFZp56402364) 1.3 

104896 AW015318 Hs.23165 ESTs 17.7 

104909 AW408164 Hs.249184 transcription factor 19 (SC1) 5.0 

5 104916 AW958157 Hs.1 55489 NS1 -associated protein 1 1.7 

104919 AA026880 Hs.25252 prolactin receptor 1.4 

104930 AF043467 Hs.32893 neurexophilin 2 2.2 

104973 NM_015310Hs.6763 KIM0942 protein 5.0 

104974 Y12059 Hs.278675 bromodomain-containing 4 1.4 
10 104975 AL136877 Hs.50758 SMC4 (structural maintenance of chromosomes 4, yeastj-like 1 2.4 

104975 AL136877 Hs.50758 SMC4 (structural maintenance of chromoso 2.3 

104978 AI199268 Hs.19322 Homo sapiens, Similar to RIKEN cDNA 2010317E24 gene, clone IMAGE:3502019, mRNA, partial cds 7.2 

104979 AA937934 Hs.321062 ESTs 1.3 
104994 AI499930 Hs.334885 mitochondrial GTP binding protein 3.5 

15 105009 BE379584 Hs.34789 dolichyl-diphosphooligosaccharide-protein glycosyltransferase 5.5 

105012 AF098158 Hs.9329 chromosome 20 open reading frame 1 3.3 

105028 AI050715 Hs.2331 E2F transcription factor 5, p130-binding 2.2 
M= 105032 AA127818 gb:zl12a02.s1 Soares_pregnant_uterus_NbHPU Homo sapiens cDNA clone IMAGE:501674 3' 6,8 

f=i 105039 AA907305 Hs.36475 ESTs 2.5 

20 105041 AB037716 Hs.26204 KIAA1295 protein 2.2 

*= ; 105045 BE242899 Hs. 129951 speckle-type POZ protein 3.8 

yl 105079 AA151342 Hs.12677 CGI-1 47 protein 9.5 

ft- 105087 AA147884 Hs.9812 Homo sapiens CDNAFU1 4388 fis, clone HEMBA1002716 5.6 

!?! 105088 H58589 Hs.35156 Homo sapiens CDNAFU1 1027 fis, clone PLACE10041 14 2.2 

tU5 105095 Z78407 Hs.27023 vesicle transport-related protein 2.2 

105110 BE387350 Hs.33122 KIAA1 160 protein 1.6 

f =; 105126 AW975433 Hs.36288 ESTs 6.3 

*=■' 105127 AA045648 Hs.301957 nudix (nucleoside diphosphate linked moiety X)-type motif 5 2.1 

*" 105141 AA164687 Hs.177576 mannosyl (alpha-1,3-)-glycoprotein beta-1,4-N-acetylglucosaminyltransferase, isoenzyme A 2.7 

f30 105158 AW976357 Hs.234545 hypothetical protein NUF2R 1.9 

r: 105169 BE245294 Hs.180789 S164 protein 1.7 

105186 AA191512 Hs.28005 Homo sapiens cDNA FLJ1 1309 fis, clone PLACE1010076 4.8 

fl-i 105254 AA071276 Hs.19469 KIAA0859 protein 1.9 

> 105281 AA263143 Hs.24596 RAD51-interacting protein 2.8 

2|5 105288 N99673 Hs.3585 ESTs, Weakly similar to AF1 26743 1 DNAJ domain-containing protein MCJ [H.sapiens] 1.9 

P 105302 M700122 Hs.3355 sentrin-specific protease 8.0 

ft! 105331 AW270037 Hs.1 79507 KIAA0779 protein 1.8 

105359 NM_016015Hs.8054 CGI-68 protein 8.2 

105366 BE264645 Hs.282093 hypothetical protein FU21 91 8 5.0 

40 105373 AW887701 Hs.32356 hypothetical protein FLJ20628 2.5 

105374 BE242803 Hs.262823 hypothetical protein FU1 0326 2.2 

105387 AW592146 Hs.108636 membrane protein CH1 2.3 

105393 AF167570 Hs.256583 interleukin enhancer binding factor 3, 90kD 5.4 

105399 BE386877 Hs.334811 Npw38-binding protein NpwBP 1.6 

45 105400 AF198620 Hs.65648 RNA binding motif protein 8A 1.6 

105445 AA252395 gb:zs12g10.s1 NCI_CGAP_GCB1 Homo sapiens cDNA clone IMAGE:685026 3', mRNA sequence. 5.0 

105507 BE268348 Hs.226318 CCR4-NOT transcription complex, subunit 7 1.6 

105529 AA1 13449 Hs.32471 hypothetical protein FLJ20364 1.3 

105530 AB023179 Hs.9059 KIAA0962 protein 3.4 
50 105547 AA262640 Hs.27445 unknown 9.3 

105564 BE616694 Hs.288042 hypothetical protein FLJ14299 1.4 

105596 M579535 Hs.18490 hypothetical protein FLJ20452 10.9 

105597 AF054284 Hs.334826 splicing factor 3b, subunit 1, 155kD 2.9 
105608 AI808201 Hs.287863 hypothetical protein FU12475 1.7 

55 105610 AA280072 Hs.99872 fetal Alzheimer antigen 1.4 

105617 AK000892 Hs.4069 glucocorticoid modulatory element binding protein 1 1.7 

105620 AW302245 Hs.181390 casein kinase 1, gamma 2 5.5 

105658 AA985190 Hs.246875 hypothetical protein FLJ20059 9.4 

105697 AW499988 Hs.27801 zinc finger protein 278 2.0 

60 105708 R26944 Hs.180777 Homo sapiens mRNA; cDNA DKFZp564M0264 (from clone DKFZp564M0264) 1.7 

105743 BE246502 Hs.9598 sema domain, immunoglobulin domain (Ig), transmembrane domain (TM) and short 2.6 

105746 AW151952 Hs.46679 hypothetical protein FLJ20739 1.5 

105759 AI123118 Hs.15159 chemokine-like factor, alternatively spliced 1.3 

105771 AI267720 Hs.153221 synovial sarcoma, translocated to X chromosome 1.6 

65 105820 AA741336 Hs.152108 transcriptional unit N143 2.2 

105826 AA478756 Hs.194477 E3 ubiquitin ligase SMURF2 1.3 

105856 AI262106 Hs.12653 ESTs 2.4 
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105858 AF151066 Hs.281428 hypothetical protein 2.9 

105875 AK001708 Hs.32271 hypothetical protein FU 10846 1.4 

105930 AF016371 Hs.9880 peptidyl prolyl isomerase H (cyclophilin H) 5.2 

106000 AW194426 Hs.20726 ESTs 1.7 

5 106011 AW081 202 Hs.1 2284 Homo sapiens, clone IMAGE:2989556, mRNA, partial cds 2.8 

106017 M477956 Hs.26268 ESTs 1.4 

106073 AL1 57441 Hs.17834 downstream neighbor of SON 1.4 

106078 AA130158 Hs.19977 ESTs, Moderately similar to ALU8JHUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 1.6 

106094 AA533491 Hs.23317 hypothetical protein FLJ14681 6.8 

10 106140 AB006624 Hs.14912 KIAA0286 protein 1.6 

106271 AA251393 Hs.289052 Homo sapiens, Similar to RIKEN cDNA 5430429M05 gene, clone MGC:13155, mRNA, complete cds 10.8 

106288 AB037742 Hs.24336 KIAA1321 protein 1.3 

106300 Y10043 Hs.19114 high-mobility group (nonhistone chromosomal) protein 4 3.6 

106333 AL043114 Hs.22410 ESTs,WeaklysimilartoA54849collagenalpha1(VII)chainprecursor[H.sapiens] 5.4 

15 106350 AK001404 Hs.194698 cyciin B2 5.7 

106359 AW390282 Hs.31130 transmembrane 7 superfamily member 2 6.3 

H : 106381 AB040916 Hs.24106 KIAA1483 protein 6.5 

106389 AW748420 Hs.6236 Homo sapiens cDNA:FLJ21487fis, clone COL05419 2.2 

: 106457 AF119256 Hs.27801 zinc finger protein 278 2.7 

M20 106470 D63078 Hs.186180 Homo sapiens cDNA: FLJ23038 fis, clone LNG02039 2.3 

LP! 106531 AA454036 Hs.8832 ESTs 1.6 

w 106586 AA243837 Hs.57787 ESTs 1.6 

106589 AK000933 Hs.28661 Homo sapiens CDNAFLJ10071 fis, clone HEMBA1001702 2.4 

W 106610 AA458882 Hs.79732 fibulinl 7.9 

<25 106624 NM_003595Hs.26350 tyrosylprotein sulfotransferase 2 7.7 

i? =: 106650 AL049951 Hs.22370 Homo sapiens mRNA; cDNADKFZp564O0122 (from clone DKFZp564O01 22) 1.8 

W 106669 AV657117 Hs.184164 ESTs, Moderately similar to S65657 alpha-1C-adrenergic receptor splice form 2 [H.sapiens] 1.3 

¥ 106713 BE614802 Hs.184352 hypothetical protein FLJ 12549 4.5 

r-i 106717 AA600357 Hs.239489 TIA1 cytotoxic granule-associated RNA-binding protein 1.3 

r'BO 106723 BE388094 Hs.21857 ESTs 1.6 

106795 AF1 74487 Hs.293753 Bcl-2-related ovarian killer protein-like 5.7 

?1J 106829 AW959893 Hs.27099 hypothetical protein FLJ23293 similar to ARL-6 interacting protein-2 16.2 

106831 BE564871 Hs.29463 centrin, EF-hand protein, 3 (CDC31 yeast homolog) 1.5 

T 106846 AB037744 Hs.34892 KIAA1 323 protein 2.2 

1=35 106852 AF151031 Hs.300631 hypothetical protein 1.3 

Hi 106873 N49809 Hs.11197 Homo sapiens, clone IMAGE:3343149, mRNA, partial cds 16.8 

106886 W79171 Hs.9567 GL002 protein 1.5 

106906 AA861271 Hs.222024 transcription factor BMAL2 2.2 

106920 AK001838 Hs.296323 serum/glucocorticoid regulated kinase 3.3 

40 106945 AK000511 Hs.6294 hypothetical protein DKFZp434L1435 similar to valyltRNA synthetase 6.8 

106973 BE156256 Hs.11923 hypothetical protein 6.6 

106977 AL043152 Hs.50421 KIAA0203 gene product 4.8 

106978 AW631480 Hs.8688 ESTs 6.0 
107004 AA1 46872 Hs.300700 hypothetical protein FLJ20727 1.3 

45 107029 AF264750 Hs.288971 myeloid/lymphoid or mixed-lineage leukemia 3 1.8 

107071 AW385224 Hs.35198 ectonucleotide pyrophosphatase/phosphodiesterase 5 (putative function) 1.7 

107113 AK000733 Hs.23900 GTPase activating protein 2.5 

107125 AK000512 Hs.69388 hypothetical protein FLJ20505 1.7 

107136 AV661958 Hs.8207 GK001 protein 4.6 

50 107136 AV661958 Hs.8207 GK001 protein 3.3 

107146 AK001455 Hs.5198 Down syndrome critical region gene 2 2.0 

107151 AW378065 Hs.8687 ESTs 6.3 

107155 AW391927 Hs.7946 KIAA1288 protein 33.5 

107174 BE122762 Hs.25338 ESTs 5.2 

55 107197 W15477 Hs.64639 glioma pathogenesis-related protein 6.1 

107221 AW888411 Hs.81915 leukemia-associated phosphoprotein p18 (stathmin) 17.4 

107243 BE219716 Hs.34727 ESTs, Moderately similar to I38759 zinc finger/leucine zipper protein [H.sapiens] 7.4 

107248 AW263124 Hs.315111 nuclear receptor co-repressor/HDAC3 complex subunit 1.8 

107263 D60341 Hs.21198 translocase of outer mitochondrial membrane 70 (yeast) homolog A 6.6 

60 107265 BE379594 Hs.49136 ESTs, Moderately similar to ALU7_HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION 2.5 

107298 N95657 Hs.6820 ESTs, Moderately similar to YOJ1_CAEEL HYPOTHETICAL 63.5 KD PROTEIN ZK353.1 IN 2.5 

107298 N95657 Hs.6820 ESTs, Moderately similartoYOJ1_CAEELH 1.7 

107299 BE277457 Hs.30661 hypothetical protein MGC4606 3.2 
107316 T63174 Hs.193700 Homo sapiens mRNA; cDNA DKFZp586l0324 (from clone DKFZp586l0324) 2.0 

65 107354 NM_006299Hs.96448 zinc finger protein 193 5.0 

107392 AW299900 Hs.267632 TATA element modulatory factor 1 1.2 

107481 AA307703 Hs.279766 kinesin family member 4A 1.6 

118 



Attorney Docket No.:018501-005210US 



107529 BE515065 Hs.296585 nucleolar protein (KKE/D repeat) 3.0 

107554 AAO01386 Hs.59844 ESTs 1.3 
107681 BE379594 Hs.49136 ESTs, Moderately similar to ALU7_HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION 2.2 

107772 AA018587 Hs.303055 ESTs, Weakly similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 2.1 

5 107859 AW732573 Hs.47584 potassium voltage-gated channel, delayed-rectifier, subfamily S, member 3 8.4 

107901 L42612 Hs.335952 keratin 6B 2.5 

107901 L42612 Hs.335952 keratin 6B 1.6 

107922 BE153855 Hs.61460 Ig superfamily receptor LNIR 2.2 

107974 AW956103 Hs.61712 pyruvate dehydrogenase kinase, isoenzyme 1 6.7 
10 108040 AL121031 Hs.159971 SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily b, member 1 1.5 

108230 AA054224 Hs.59847 ESTs 1.3 

108274 AF1 29535 Hs.272027 F-box only protein 5 7.1 

108296 N31256 Hs.161623 ESTs 2.5 

108496 AA083069 Hs.339659 ESTs 3.5 

=.15 108607 BE300380 Hs.69476 Homo sapiens cDNAFLJ12758fis, clone NT2RP2001328 3.4 

C 108621 AA101809 Hs.182685 ESTs 1.6 

P 108634 AW022410 Hs.69507 ESTs 17 

r=i 108647 BE546947 Hs.44276 homeoboxCIO 9-8 

= = 108695 AB029000 Hs.70823 KIAA1077 protein 7.2 

«20 108717 AA122393 Hs.70811 hypothetical protein FLJ2051 6 1-3 

CB 108740 AI089575 Hs.9071 progesterone membrane binding protein 2.7 

n\ 108828 AK001693 Hs.273344 DKFZP564O0463 protein 1-8 

l C- 108859 AL121500 Hs.178904 ESTs 1.5 

108872 H06720 Hs.111680 endosulfine alpha 2.1 

©5 108891 AI801235 Hs.48480 ESTs 5.3 

108894 AK001431 Hs.5105 hypothetical protein FLJ10569 4.0 

1. 108955 AA149754 Hs.195155 Homo sapiens amino acid transport system N2 (SN2) mRNA, complete cds 5.6 

U 108982 AA151708 Hs.171980 homeo box (expressed in ES cells) 1 1.6 

U, 108987 AA152178 Hs.23467 hypothetical protein FLJ 10633 6.2 

1=30 109002 AB028987 Hs.72134 KIAA1 064 protein 1.7 

= j : 109011 AA156542 Hs.72127 ESTs 14 
=P 109026 AA157811 gb:zo35d07.s1 Stratagene colon (937204) Homo sapiens cDNA clone 3' similar to contains Alu repetitive 5.3 

M 109068 AA1 64293 Hs.72545 ESTs 2.9 

= : 109101 AW608930 Hs.52184 hypothetical protein FLJ2061 8 1.6 

H35 109112 AW419196 Hs.257924 hypothetical protein FLJ13782 3.2 

109124 AK000684 Hs.183887 hypothetical protein FU22104 1.7 

109139 AJ132592 Hs.59757 zinc finger protein 281 2.6 

109166 AA219691 Hs.73625 RAB6 interacting, kinesin-like (rabkinesin 6) 2.9 

109198 BE566742 Hs.58169 highly expressed in cancer, rich in leucine heptad repeats 2.0 

40 109213 NM_016603Hs.82035 potential nuclear protein C50RF5; GAP-like protein 5.3 

109220 AW958181 Hs.189998 ESTs 5.7 

109233 AU077281 Hs.170285 nucleoporin 214kD (CAIN) 5.3 

109270 N99673 Hs.3585 ESTs, Weakly similar to AF126743 1 DNAJ domain-containing protein MCJ [H.sapiens] 1.4 

109273 AA375752 Hs.82719 Homo sapiens mRNA; cDNA DKFZp586F1822 (from clone DKFZp586F1822) 2.9 

45 109313 AF153201 Hs.86276 C2H2 (Kruppel-type) zinc finger protein 1.3 

109341 AA213506 Hs.115099 EST 2.9 

109391 AL096858 Hs.184245 KIAA0929 protein Msx2 interacting nuclear target (MINT) homolog 1.5 

109420 H83603 Hs.40408 homeo box C9 2.2 

109426 N30531 Hs.42215 protein phosphatase 1, regulatory subunit 6 3.0 

50 109429 AI160029 Hs.61438 ESTs 1-9 

109445 AA232103 Hs.189915 ESTs 18 

109450 AB032969 Hs.173042 KIAA1 143 protein 3.7 

109468 NM_015310Hs.6763 KIAA0942 protein 3.2 

109478 AW074143 Hs.87134 ESTs 2.0 

55 109570 L40027 Hs.1 18890 glycogen synthase kinase 3 alpha 2.1 

109662 F02614 Hs.27319 ESTs 1.4 

109825 R71264 Hs.16798 ESTs 1.3 

110039 H11938 Hs.21907 histone acetyltransferase 2.0 

110056 AA503041 Hs.279009 matrix Gla protein 2.5 

60 110085 AA603840 Hs.29956 KIAA0460 protein 1.7 

110110 T07353 Hs.7948 ESTs 2.9 

110129 R51853 Hs.226429 ESTs, Weakly similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 1.7 

110154 NM_014521Hs.17667 SH3-domain binding protein 4 4.2 
110240 AI668594 Hs.176588 ESTs, Weakly similar to CP4Y_HUMAN CYTOCHROME P450 4A11 PRECURSOR [H.sapiens] 4.2 

65 110242 N41744 Hs.19978 CGI-30 protein 13 

110259 H28428 Hs.32406 ESTs, Weakly similar to I38022 hypothetical protein [H.sapiens] 2.2 

110312 BE256986 Hs.11896 hypothetical protein FLJ 12089 2.1 
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110330 
110501 
110504 
110525 
110568 
110699 
110705 
110742 
110761 
110762 
110765 
110769 
110799 
110805 
110813 
110820 
110840 
110844 
110854 
110856 
110885 
110897 
110915 
110918 
110958 
110963 
110981 
110984 
111125 
111132 
111164 
111172 
111174 
111179 
111184 
111184 
111189 
111216 
111221 
111223 
111239 
111285 
111299 
111312 
111318 
111337 
111352 
111370 
111384 
111389 
111391 
111392 
111452 
111486 
111549 
111585 
111627 
111870 
111937 
111944 
111987 
112134 
112244 
112388 
112456 
112464 
112506 



Hs.16621 

Hs.210859 
Hs.37430 



AI288666 
H55748 
H55915 
H57330 

AK001160 Hs.5999 

T97586 Hs. 18090 

AB007902 Hs.32168 

AW190338 Hs.28029 

AL138077 Hs.16157 

BE044245 Hs.30011 

AK000322 Hs.18457 

BE000831 Hs.23837 

AI089660 Hs.323401 

T25829 Hs.24048 

AA767373 Hs.35669 



R33261 
N31598 
AI740792 



Hs.6614 

Hs.12727 

Hs.167531 



BE612992 Hs.27931 
M992380 

BE384447 Hs.16034 

AL1 17430 Hs.6880 

BE092285 Hs.29724 

H04360 Hs.24283 
NM_005864Hs.24587 

AK002180 Hs.11449 

AK001980 Hs.24284 

AW613287 Hs.80120 

N63823 Hs.269115 

AB037807 Hs.83293 

N46180 Hs.122489 



R67419 
AL050166 



Hs.21851 
Hs.26295 



AK000136 Hs.10760 
AI815486 Hs.243901 



AI815486 
N67603 



Hs.243901 
Hs.272130 



AW139408 Hs.152940 
AB037782 Hs.15119 
AA852773 Hs.334838 
N90956 Hs.17230 
AA778711 Hs.4310 
AB033091 Hs.74313 



AI523913 
T99755 



Hs.34504 
Hs.334728 



AA837396 Hs.263925 
H58589 Hs.35156 



AI478658 
N94606 



Hs.94631 
Hs.288969 



AK000987 Hs.169111 
NM_003896Hs.225939 
W46342 Hs.325081 



R02354 

AI051194 

W90638 

R10720 

R52656 



Hs.15999 

Hs.227978 

Hs.20321 

Hs.20670 

Hs.21691 



AB037834 Hs.18685 
BE298665 Hs.14846 
AW083791 Hs.21263 
NM_015310Hs.6763 
R41823 Hs.7413 
AB029000 Hs.70823 
R46071 Hs.301693 
NM_016248Hs.232076 
AW007287 Hs.28538 
AI742756 Hs.26079 



DKFZP434I1 16 protein 6.2 

gb:yq94a01.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone IMAGE:203400 3' 6.1 

hypothetical protein FLJ1 1016 6.1 

EST 6.3 

hypothetical protein FLJ10298 1 .3 

ESTs 1.8 

KIAA0442 protein 1.6 

hypothetical protein MGC1 1256 7.6 

hypothetical protein FLJ 12707 2.5 

hypothetical protein MGC2963 9.3 

hypothetical protein FU20315 5.5 

Homo sapiens cDNA FLJ11812 fis, clone HEMBA1006364 2.1 

dpy-30-like protein 1.5 

FK506 binding protein precursor 6.6 
ESTs, Moderately similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 5.7 

ESTs, Weakly similar to A43932 mucin 2 precursor, intestinal [H.sapiens] 3.4 

hypothetical protein FLJ21 610 1 .7 

methylcrotonoyl-Coenzyme A carboxylase 2 (beta) 1 7 

hypothetical protein FU10607 similar to glucosamine-phosphate N-acetyltransferase 4.7 

gb:ot37g06.s1 Soares_testis_NHT Homo sapiens cDNA clone 3' similar to contains element 2.3 

hypothetical protein MGC1 31 86 3.5 

DKFZP434D156 protein 2.2 

hypothetical protein FLJ13187 2.6 

ESTs, Moderately similar to reduced expression in cancer [H.sapiens] 1.9 

signal transduction protein (SH3 containing) 6.7 

DKFZP56401 23 protein 2.0 

ADP-ribosyltransferase {NAD+; poly(ADP-ribose) polymerase)-like 2 1 .3 
UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase 1 (GalNAc-T1) 1.8 

ESTs, Moderately similar to Z195_HUMAN ZINC FINGER PROTEIN 195 [H.sapiens] 3.6 

hypothetical protein 2.1 

Homo sapiens cDNA FLJ13289 fis, clone OVARC1001170 2.3 

Homo sapiens cDNA FLJ12900 fis, clone NT2RP2004321 3.7 

Homo sapiens mRNA; cDNA DKFZp586D1 1 22 (from clone DKFZp586D1 1 22) 7.5 

asporin (LRR class 1) 7.1 

Homo sapiens cDNA FLJ20738 fis, clone HEP08257 6.7 

Homo sapiens cDNA FLJ20738 fis, clone HE 3.3 

ESTs, Weakly similar to S65824 reverse transcriptase homolog [H.sapiens] 3.6 

ESTs 1-5 

KIAA1361 protein 2.6 

KIAA1866 protein 4.6 

hypothetical protein FLJ22087 7.9 

eukaryotic translation initiation factor 1A 6.9 

KIAA1265 protein 5.0 

ESTs 3.8 

ESTs 1-2 

LIS1 -interacting protein NUDE 1 , rat homolog 5. 1 

Homo sapiens cDNA FLJ1 1027 fis, clone PLACE10041 14 2.2 

brefeldin A-inhibited guanine nucleotide-exchange protein 1 2.8 

HSCARG protein 2.2 

oxidation resistance 1 2.1 

sialyltransferase 9 (CMP-NeuAc:lactosylceramide alpha-2,3-sialyltransferase; GM3 synthase) 5.1 

Homo sapiens, clone IMAGE:3659680, mRNA, partial cds 8.4 

ESTs 2.7 

EST 6.5 

ESTs, Moderately similar to ZRF1_HUMAN ZUOTIN RELATED FACTOR-1 (M-PHASE 1 .4 

EST 16 

ESTs 1.6 

Homo sapiens mRNA for KIAA1413 protein, partial cds 2.4 

Homo sapiens mRNA; cDNA DKFZp564D01 6 (from clone DKFZp564D01 6) 1 0.6 

suppressor of potassium transport defect 3 6.6 

KIAA0942 protein 5.1 

ESTs; calsyntenin-2 2.8 

KIAA1077 protein 14.6 

Homo sapiens, clone IMAGE:3638994, mRNA, partial cds 9.0 

A kinase (PRKA) anchor protein 11 1 .4 

Homo sapiens cDNA: FLJ21086 fis, clone CAS03272 1 .4 

ESTs 3.2 
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112513 R68425 Hs.13809 hypothetical protein FLJ 10648 2.0 

112752 AK001635 Hs.14838 hypothetical protein FLJ10773 1.8 

112884 AK000004 Hs.5013 Homo sapiens mRNA for FU00004 protein, partial cds 6.6 

112923 T10258 Hs.5037 EST 1.5 

112936 AW970826 Hs.6185 KIAA1557 protein 3.2 

112958 R61388 Hs.6724 ESTs 6.0 

112966 Z44718 Hs.1 02548 glucocorticoid receptor DNA binding factor 1 6.4 

112978 AK000272 Hs.7099 hypothetical protein FLJ20265 1.2 

112995 AA737033 Hs.7155 ESTs, Moderately similar to 21 15357A TYKi protein [M.musculus] 5.6 

112996 BE276112 Hs.7165 zinc finger protein 259 2.0 
113047 AI571940 Hs.7549 ESTs 1.9 
113049 AW965190 Hs.7560 Homo sapiens mRNA for KIAA1729 protein, partial cds 2.4 
113089 T40707 Hs.270862 ESTs 1.3 
1 13196 T57317 gb:yb51a03.s1 Stratagene fetal spleen (937205) Homo sapiens cDNA clone IMAGE:74668 3', 1 .7 
113248 T63857 gb:yc16e01.s1 Stratagene lung (937210) Homo sapiens cDNA clone 3', mRNA sequence 2.8 
113254 AK002180 Hs.1 1449 DKFZP56401 23 protein 1.3 
113277 AW971049 Hs.1 1774 protein (peptidyl-prolyl cis/trans isomerase) NIMA-interacting, 4 (parvulin) 3.2 
113429 AA688021 Hs.1 79808 ESTs * 1.2 
113499 AI467908 Hs.8882 ESTs 5.9 
113547 H59588 Hs.1 5233 ESTs 2.0 
113554 AW503990 Hs.1 42442 HP1-BP74 3.6 
113647 AA813887 Hs.188173 Homo sapiens cDNA FLJ12187 fis, clone MAMMA1000831 1.3 
113702 T97307 gb:ye53h05.s1 Scares fetal liver spleen 1NFLS Homo sapiens cDNA clone IMAGE:121497 3', 4.4 
113722 AV653556 Hs.184411 albumin 1.3 
113759 AW499665 Hs.9456 SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily a, member 5 1.2 
113777 BE266947 Hs.10590 zinc finger protein 313 13.4 
113783 AL359588 Hs.7041 hypothetical protein DKFZp762B226 1.7 
113791 AI269096 Hs.135578 chitobiase, di-N-acetyl- 1.3 
113808 W44735 Hs.9286 Homo sapiens cDNA: FLJ21278 fis, clone COL01 832 3.3 
113811 BE207480 Hs.6994 Homo sapiens cDNA: FLJ22044 fis, clone HEP09141 3.1 
113817 H13325 Hs.332795 hypothetical protein DKFZp761017121 3.2 
113826 AW378212 Hs.24809 hypothetical protein FLJ 10826 2.3 
113834 T26483 Hs.6059 EGF-containing fibulin-like extracellular matrix protein 2 11.3 
113868 W57902 Hs.90744 proteasome (prosome, macropain) 26S subunit, non-ATPase, 11 2.7 
113870 AL079314 Hs.16537 hypothetical protein, similar to (U06944) PRAJA1 6.1 
113885 AW959486 Hs.21732 ESTs 6.6 

1 13923 AW953484 Hs.3849 hypothetical protein FLJ22041 similar to FK506 binding proteins 1 .9 

113989 W87544 Hs.268828 ESTs 1.2 

114022 AI539519 Hs.120969 Homo sapiens CDNAFLJ1 1562 fis, clone HEMBA1 0031 97 5.4 

114030 AI825386 Hs.164478 hypothetical protein FU21 939 similar to 5-azacytidine induced gene 2 9.4 

114060 AB029551 Hs.7910 RING1 and YY1 binding protein 1.8 

114196 AF017445 Hs.150926 fucose-1 -phosphate guanylyltransferase 1.5 

114226 AB028968 Hs.7989 KIAA1045 protein 1.8 

114253 BE149866 Hs.14831 Homo sapiens, Similar to zinc finger protein 136 (clone pHZ-20), clone MGC:10647, mRNA, complete cds 2.3 

114262 AL117518 Hs.3686 KIAA0978 protein 1.4 

114275 AW515443 Hs.306117 KIAA0306 protein 15.8 

114292 AI815395 Hs.184641 fatty acid desaturase 2 1.9 

114309 AA332453 Hs.20824 CGI-85 protein 2.4 

114392 AA249590 Hs.100748 ESTs, Weakly similartoA28996 proline-rich protein M14 precursor -mouse [M.musculus] 1.8 

114407 BE539976 Hs.103305 Homo sapiens mRNA; cDNA DKFZp434B0425 (from clone DKFZp434B0425) 1.2 

114455 H37908 Hs.271616 ESTs, Weakly similar to ALU8_HUMAN ALU SUBFAMILY SX SEQUENCE 5.5 

114463 AL120247 Hs.40109 KIAA0872 protein 5.2 

114464 AI091713 Hs.106597 Homo sapiens, SimilartoRIKENcDNA 1110012M11 gene, clone IMAGE:3688605, mRNA, partial cds 1 .2 
114471 AA028074 Hs.104613 RP42homolog 1.8 
114480 BE066778 Hs.151678 UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase 6 (GalNAc-T6) 13.4 
114671 AA766268 Hs.266273 hypothetical protein FLJ13346 1.9 
114698 AA476966 Hs.1 10857 polymerase (RNA) 111 (DNA directed) polypeptide K (12.3 kDa) 3.5 
114730 AI373544 Hs.331328 intermediate filament protein syncoilin 3.8 
114767 AI859865 Hs.154443 minichromosome maintenance deficient (S.cerevisiae) 4 1.6 
114774 AV656017 Hs.184325 CGI-76 protein 3.1 
114798 AA1 59181 Hs.54900 serologically defined colon cancer antigen 1 3.5 
114860 AL157545 Hs.42179 bromodomain and PHD finger containing, 3 4.3 

114895 AA236177 Hs.76591 KIAA0887 protein 7.1 

114896 BE539101 Hs.5324 hypothetical protein 1.3 

114911 AA236672 gb:zt29f02.s1 Soares ovary tumor NbHOT Homo sapiens cDNA clone IMAGE:723771 3', mRNA sequence. 1.5 

114930 AA237022 Hs.188717 ESTs 2.0 

114938 AA242834 Hs.58384 ESTs 2.9 
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114965 AI733881 Hs.72472 BMP-R1B 2.3 

115023 AF102546 Hs.63931 dachshund (Drosophila) homolog 1.3 

115038 AA252360 Hs.87968 toll-like receptor 9 1.6 

115061 AI751438 Hs.41271 Homo sapiens mRNA full length insert cDNA clone EUROIMAGE 1913076 11.8 

5 115062 AA253314 Hs.154103 LIM protein (similar to rat protein kina 1.5 

115117 AI670847 Hs.5324 hypothetical protein 1.5 

115121 AI634549 Hs.88155 ESTs 2.8 

115206 AW183695 Hs.186572 ESTs 2.5 

1 15221 AW365434 Hs.79741 hypothetical protein FLJ101 16 1 .5 

10 115239 BE251328 Hs.73291 hypothetical protein FLJ 10881 1.3 

115242 AI368236 Hs.283732 ESTs, Moderately similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE 1.4 

115278 AK002163 Hs.301724 hypothetical protein FLJ 11 301 1.5 

115285 AW972872 Hs.293736 ESTs 2.4 

115291 BE545072 Hs.122579 hypothetical protein FLJ 10461 6.2 

15 115400 AI215069 Hs.89113 ESTs 6.6 

U> 115468 AA314349 Hs.48499 tumor antigen SLP-«p 7.4 

r~: 115471 AK001376 Hs.59346 hypothetical protein FU1 051 4 1.4 

il 115479 AW301608 Hs.278188 ESTs, Moderately similar to I54374 gene NF2 protein [H.sapiens] 4.0 

115496 AW247593 Hs.71819 eukaryotic translation initiation factor 4E binding protein 1 16.3 

U20 115500 Y14443 Hs.88219 zinc finger protein 200 5.0 

===•=: 115553 AJ275986 Hs.71414 transcription factor (SMIF gene) 2.5 

!?: 115581 AI540842 Hs.61082 ESTs 6.1 

W 115587 BE081342 Hs.283037 HSPC039 protein 2.9 

K\ 115590 AA399477 Hs.67896 7-60 protein 5.3 

=25 115646 N36110 Hs.305971 solute carrier family 2 (facilitated glucose transporter), member 10 4.7 

^ 115652 BE093589 Hs.38178 hypothetical protein FLJ23468 10.6 

£ 115655 AL048269 Hs.288544 Homo sapiens, clone MGC: 16063, mRNA, complete cds 12.7 

f=; 115663 AI138785 Hs.40507 ESTs 2.0 

r 115676 AA953006 Hs.88143 ESTs 3.0 

rSO 115690 AA625132 Hs.44159 hypothetical protein FLJ21615 1.7 

ffj 115693 AF231023 Hs.55173 cadherin, EGF LAG seven-pass G-type receptor 3, flamingo (Drosophila) homolog 6.8 

r : 115715 BE395161 Hs.1390 proteasome (prosome, macropain) subunit beta type, 2 1.7 

J." 115734 AI950339 Hs.40782 ESTs 2.6 

W 115811 NM_015434Hs.48604 DKFZP434B1 68 protein 2.1 

fS5 115823 AI732742 Hs.87440 ESTs 2.1 

115837 AI675217 Hs.42761 ESTs 1.3 

115844 AI373062 Hs.332938 hypothetical protein MGC5370 4.4 

115866 AW062629 Hs.52081 KIAA0867 protein 7.2 

115875 N55669 Hs.333823 mitochondrial ribosomal protein L13 1.2 

40 115941 AI867451 Hs.46679 hypothetical protein FLJ20739 5.5 

115968 AB037753 Hs.62767 KIAA1 332 protein 9.8 

116003 BE275469 Hs.66493 Down syndrome critical region gene 5 1.4 

116011 AL359053 Hs.57664 Homo sapiens mRNA full length insert cDNA clone EUROIMAGE 2005735 2.4 

116108 AA770688 Hs.28777 H2A histone family, member L 1.8 

45 116134 BE243834 Hs.50441 CGI-04 protein 1.4 

1 16189 N35719 Hs.44749 ESTs, Moderately similar to T00358 hypothetical protein KIAA0684 [H.sapiens] 1 .2 

1 16195 AW821 113 Hs.72402 ESTs 2.1 

116238 AV660717 Hs.47144 DKFZP586N0819 protein 1.7 

116246 AF265555 Hs.250646 baculoviral IAP repeat-containing 6 1.7 

50 116262 AI936442 Hs.59838 hypothetical protein FLJ 10808 1.7 

116298 AI955411 Hs.94109 Homo sapiens cDNA FLJ 1 3634 fis, clone PLACE101 1133 1.9 

116318 AF097645 Hs.58570 deleted in cancer 1; RNA helicase HDB/DICE1 4.9 

116325 AI472106 Hs.49303 Homo sapiens cDNAFLM 1663 fis, clone HEMBA1004631 1.4 

116336 AL133033 Hs.4084 KIAA1025 protein 1.9 

55 116339 AK000290 Hs.44033 dipeptidyl peptidase 8 1.5 

116350 AA497129 Hs.184771 nuclearfactorl/C (CCAAT-binding transcription factor) 1.9 

116358 AI149586 Hs.38125 interferon-induced protein 75, 52kD 1.9 

116365 N50174 Hs.46765 ESTs 6.1 

116368 N90466 Hs.71109 KIAA1 229 protein 1.6 

60 116417 AW499664 Hs. 12484 Human clone 23826 mRNA sequence 7.4 

116436 AA161411 Hs.58668 chromosome 21 open reading frame 57 2.1 

116462 AF218313 Hs.236828 putative helicase RUVBL 1.5 

116470 AI272141 Hs.83484 SRY (sex determining region Y)-box 4 2.1 

116470 AI272141 Hs.83484 SRY (sex determining region Y)-box 4 1.2 

65 116575 AA312572 Hs.6241 phosphoinositide-3-kinase, regulatory subunit, polypeptide 1 (p85 alpha) 1.5 

116637 AK001043 Hs.92033 integrin-linked kinase-associated serine/threonine phosphatase 2C 2.7 

116640 X89984 Hs.211563 B-cell CLL/lymphoma 7A 2.3 
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116700 AI800202 Hs.317589 hypothetical protein MGC10765 1.4 

116705 AW074819 Hs.12313 hypothetical protein FLJ14566 3.4 

116732 AW152225 Hs.165909 ESTs, Weakly similar to I38022 hypothetical protein [H.sapiens] 2.9 

116921 AW068115 Hs.821 biglycan 8.3 

5 116926 H73608 Hs.290830 ESTs 1.7 

117034 U72209 Hs. 1 80324 YY1 -associated factor 2 3.4 

117132 AI393666 Hs.42315 p10-binding protein 5.2 

117247 N21032 gb:yx46(06.s1 Soares melanocyte 2NbHM Homo sapiens cDNA done IMAGE:264803 3', mRNA sequence. 5.5 

117276 N71183 Hs.121806 HomosapienscDNAFU11971 fis, clone HEMBB1001208 1.5 
10 117284 AK001701 Hs.183779 Homo sapiens cDNA FLJ10590 fis, clone NT2RP2004392, weakly similar to MNN4 PROTEIN 2.0 

117367 AI041793 Hs.42502 ESTs 2.0 

117368 AI878942 Hs.90336 ATPase, H+transporting, lysosomal (vacuolar proton pump), member J 2.1 
117382 AF150275 Hs.40173 ESTs 2.7 
117412 N32536 Hs.42645 solute carrier family 16 (monocarboxylic acid transporters), member 6 1.4 

15 117557 AF123050 Hs.44532 diubiquitin 3.4 

117588 N34895 Hs.44648 ESTs 3.4 

M: 117745 BE294925 Hs.46680 CGI-1 2 protein 3.0 

}*>, 117754 AA121673 Hs.59757 zinc finger protein 281 1.9 

117879 N54706 Hs.303025 chromosome 11 open reading frame 24 1.8 

C20 117881 AF161470 Hs.260622 butyrate-induced transcript 1 5.7 

if! 117904 BE540675 Hs.332938 hypothetical protein MGC5370 5.9 

117911 AL137379 Hs.47125 hypothetical protein FLJ13912 1.7 

!? : 117933 Y10518 Hs.116470 hypothetical protein FLJ20048 1.7 

fU 117983 AL110246 Hs.47367 KIAA1 785 protein 5.4 

%25 118078 N54321 Hs.47790 EST 5.2 

3 -»; 118301 AA453902 Hs.293264 ESTs 2.6 

W 118429 AA243332 Hs.74649 cytochrome c oxidase subunit Vic 2.5 

s 118472 AL157545 Hs.42179 bromodomain and PHD finger containing, 3 4.1 

f=- 118488 AJ277275 Hs.50102 rapa-2 (rapa gene) 1.2 

fSO 118509 N22617 Hs.43228 Homo sapiens cDNA FLJ11835 fis, done HEMBA1006595 1.5 

f~' 118528 AI949952 Hs.49397 ESTs 7.4 

I1.J 118656 AI458020 Hs.293287 ESTs 2.5 

== 118670 AA332845 Hs.152618 ESTs, Moderately similar to ZN91_HUMAN ZINC FINGER PROTEIN 91 [H.sapiens] 1.2 

T 118698 AB033113 Hs.50187 KIAA1 287 protein 2.1 

1=35 118737 AA199686 gb:zq75g09.M Strategene hNT neuron (937233) Homo sapiens cDNA clone IMAGE;647488 5' 5.2 
fl i 118925 N92293 Hs.206832 ESTs, Moderately similar to ALU8_HUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 1.4 
118984 AI668709 Hs.240722 ESTs, Moderately similar to ALU8JHUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 3.6 

118986 AF148713 Hs.125830 bladder cancer overexpressed protein 4.8 

119206 W24781 Hs.293798 KtAA1710 protein 1.7 

40 119235 AW453069 Hs.3657 activity-dependent neuroprotective protein 2.2 

119235 AW453069 Hs.3657 activity-dependent neuroprotective prote 1.6 

119265 BE539706 Hs.285363 ESTs 1.4 

119279 N57568 Hs.48028 EST 25.1 

119298 NM_001241Hs.1 55478 cyclin T2 1.6 

45 119338 AI417240 Hs.320836 ESTs, Weakly similar to A47582B-cell growth factor precursor [H.sapiens] 1.3 

119349 T65004 Hs.163561 ESTs 8.4 

119403 AL117554 Hs.119908 nucleolar protein NOP5/NOP58 6.7 

119478 AI624342 Hs.170042 ESTs 2.4 

119486 AI796730 Hs.55513 ESTs 2.1 

50 119513 W37933 Empirically selected from AFFX single probeset 1.9 

119601 AK000155 Hs.91684 Homo sapiens mRNA; cDNADKFZp667H03 (from clone DKFZp667l103) 3.7 

119602 AW675298 Hs.233694 hypothetical protein FLJ1 1350 3.0 
119676 AA243837 Hs.57787 ESTs 1.4 
119682 W61019 Hs.57811 ESTs , 1.2 

55 119774 AB032977 Hs.6298 KIAA1 151 protein 1.8 

119780 NM 016625HS.191381 hypothetical protein 3.1 

119789 BE393948 Hs.50915 kallikrein 5 (KLK5; KLK-L2; stratum comeum tryptic enzyme) 9.2 
119805 AJ223810 Hs.43213 ESTs, Weakly similar to IEFS_HUMAN TRANSFORMATION-SENSITIVE PROTEIN IEFSSP 3.6 

119818 AA1 30970 Hs.58382 hypothetical protein FU1 1101 2.5 

60 119863 AA081218 Hs.58608 Homo sapiens CDNAFLJ14206 fis, clone NT2RP3003157 2.7 

1 19905 AW449064 Hs.1 19571 collagen, type III, alpha 1 (Ehlers-Danlos syndrome type IV, autosomal dominant) 2.6 

119966 AA703129 Hs.58963 ESTs 2.7 

120132 W57554 Hs.125019 lymphoid nudear protein (LAF-4) mRNA 1.2 

120206 H26735 Hs.91668 Homo sapiens clone PP1498 unknown mRNA 45.7 

65 120248 AI924294 Hs.173259 uncharacterized bone marrow protein BM033 1.2 

120253 AA131376 Hs.326401 fibroblast growth factor 12B 38.9 

120269 AW131940 Hs.104030 ESTs 9.6 
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120274 AA177051 gb:nc02a02.s1 NCI_CGAP_Pr3 Homo sapiens cDNA clone 1MAGE:194 similar to contains Alu 4.6 

120280 AA190577 gb:zp52g02.s1 Stratagene HeLa cell s3 937216 Homo sapiens cDNA clone 3', mRNA sequence 2.0 

120296 AW995911 Hs.299883 hypothetical protein FLJ23399 1.8 

120297 M191384 Hs.104072 ESTs, Weakly similartoZ195_HUMAN ZINC FINGER PROTEIN 195 [H.sapiens] 15.2 

120324 M195517 Hs.191643 ESTs 5.5 

120325 AA195651 Hs.104106 ESTs 6.4 
120327 AK000292 Hs.278732 hypothetical protein FLJ20285 16.1 
120336 N85785 Hs.181 165 eukaryotic translation elongation factor 1 alpha 1 2.9 
120342 AW450669 Hs.45068 hypothetical protein DKFZp434l1 43 5.7 
120345 AA210722 Hs.104158 ESTs 4.5 
120349 AW969481 Hs.55189 hypothetical protein 16.8 
120352 R06859 Hs.193172 ESTs, Weakly similar to I38022 hypothetical protein [H.sapiens] 5.0 
120356 AF000545 Hs.296433 putative purinengic receptor 28.1 
120371 AA219305 Hs.104196 EST 12.4 

120382 AA228026 Hs.38774 ESTs 4.0 

120383 AL109963 Hs.1 231 22 FSH primary response (LRPR1, rat) homolog 1 9.7 
120386 AW969665 Hs.1 54848 hypothetical protein DKFZp434D01 27 32.6 

120388 AA232874 Hs.1 04245 ESTs 3.1 

1 20389 AW967985 Hs.325572 ESTs, Moderately similar to ALU7_HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION 21 .7 
120396 AA134006 Hs.79306 eukaryotic translation initiation factor 4E 12.5 
120404 AB023230 Hs.96427 KIAA1013 protein 7.2 
120418 AW966893 Hs.26613 Homo sapiens mRNA; cDNA DKFZp586F1323 (from clone DKFZp586F1323) 11.4 
120423 AA236453 Hs.18978 Homo sapiens cDNA: FLJ22822 fis, clone KAIA3968 1.9 

1 20472 AI950087 gb:wq05c02.x1 NCI_CGAP_Kid1 2 Homo sapiens cDNA clone 3', mRNA sequence 1 9.4 

120473 AA251973 Hs.269988 ESTs 5.4 
120484 AA253170 Hs.96473 EST 10.4 
120504 AA256837 gb:zr84d10.s1 Soares_NhHMPu_S1 Homo sapiens cDNA clone IMAGE:682387 3', mRNA sequence. 3.9 
120509 BE047718 Hs.96545 ESTs 9.4 
120520 AA258601 Hs.161731 EST 2.4 
120535 BE350244 Hs.96547 ESTs 2.5 
120551 AA279160 Hs.111407 Homo sapiens, clone IMAGE:361 3029, mRNA, partial cds 5.2 
120570 AA280679 Hs.271445 ESTs, Weakly similarto ALU1.HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 14.4 
120582 BE244830 Hs.284228 ZNF135-like protein 10.2 
120590 AW372799 Hs.125790 leucine-rich repeat-containing 2 2.1 
120596 AA282074 Hs.237323 N-acetylglucosamine-phosphate mutase 7.5 
120619 AW965339 Hs.111471 ESTs 2.5 
120624 AW407987 Hs.173518 M-phase phosphoprotein homolog 52.0 
120639 AA286942 gb:zs56f05.s1 NCI_CGAP_GCB1 Homo sapiens cDNA clone IMAGE:701505 3' similar to contains Alu2.4 
120648 AA287095 Hs.140309 Homo sapiens, clone IMAGE:3677194, mRNA, partial cds 5.0 
120653 AW063659 Hs.191649 ESTs 2.2 

120668 AW969638 Hs.112318 6.2 kd protein 2.2 

120669 BE536739 Hs.109909 ESTs 1.9 

120695 AA976503 gb:oq30a04.s1 NCI_CGAP_GC4 Homo sapiens cDNA clone 3' similarto contains PTR7.H PTR7 46.8 

120696 AI821539 Hs.97249 ESTs 2.5 
120713 AW449855 Hs.96557 Homo sapiens cDNA FU1 2727 fis, clone NT2RP2000027 5.9 
120718 AA292747 Hs.97296 ESTs 2.9 
120750 AI191410 Hs.96693 ESTs, Moderately similar to 2109260AB cell growth factor [H.sapiens] 7.0 
120774 AI608909 Hs.193985 ESTs 7.8 
120807 AA346385 Hs.30002 SH3-containing protein SH3GLB2; KIAA1848 protein 6.8 
120809 AA346495 gb:EST52657 Fetal heart II Homo sapiens cDNA 3' end similar to EST containing O family repeat, 4.4 
120938 AA386260 Hs.104632 EST 4.4 
120977 AA398155 Hs.97600 ESTs 4.4 

120984 BE262951 Hs.99052 ESTs 5.6 

120985 AI219896 Hs.97592 ESTs 12 
121011 AA398360 Hs.97608 EST 3.1 
121026 AI439713 Hs.165295 ESTs 3.5 
121081 AA398721 Hs.186749 ESTs, Highly similarto I37550 mismatch repair protein MSH2 [H.sapiens] 5.4 
121133 AA363307 Hs.97032 ESTs 3.7 
121176 AL121523 Hs.97774 ESTs 1.7 
121223 AI002110 Hs.97169 ESTs, Weakly similar to dJ667H12.2.1 [H.sapiens] 2.9 
121320 AA403008 Hs.301927 c6.1A 1.9 
121340 AW956981 Hs.97910 Homo sapiens CDNAFLJ13383 fis, clone PLACE1001024 3.5 
121408 AA406137 Hs.98019 EST 6.0 
121439 AA410190 Hs.98076 ESTs, Weakly similar to A47582B-cell growth factor precursor [H.sapiens] 7.4 
121450 AA406430 Hs.105362 Homo sapiens, clone MGC:18257, mRNA, complete cds 6.9 
121452 AW971063 Hs.292882 ESTs 1.8 
121455 H58306 Hs.15165 retinoic acid induced 14 10.5 
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121457 W07404 Hs.144502 hypothetical protein FU22055 3.4 

121496 AA442224 Hs.97900 ESTs 14.4 

121505 AA494172 Hs.194417 ESTs 13.1 

121508 M402515 Hs.97887 ESTs 28.0 

5 121513 AA416653 Hs.181510 ESTs 6.2 

121514 M412112 gb:zt69b02.s1 Soares_testis_NHT Homo sapiens cDNA clone IMAGE:727563 3', mRNA sequence. 2.6 

121549 AA412477 Hs.98142 EST 7.4 

121558 AA412497 gb:zt95g12.s1 Soares_testis_NHT Homo sapiens cDNA clone IMAGE:730150 3' similarto contains2.8 

121577 AA411970 Hs.98096 EST 3.5 

10 121581 AA416568 gb:zu05c10.s1 Soares_testis_NHT Homo sapiens cDNA clone 3', mRNA sequence 6.1 

121589 AD001528 Hs.89718 spermine synthase 3.9 

121594 AA626010 Hs.98247 ESTs 2.2 

121622 AA416931 Hs.126065 ESTs 4.2 

121655 M421537 Hs.178072 Homo sapiens mRNA; cDNA DKFZp434B1023 (from clone DKFZp434B1023) 7.8 

15 121682 M418160 Hs.86043 Homo sapiens cDNAFLJ13558fis, clone PLACE1007743 2.0 

121690 AV660305 Hs.110286 ESTs 4.7 

121706 U55184 Hs.154145 hypothetical protein FLJ 11 585 12.7 

IT 121714 AA419225 Hs.98269 Homo sapiens cDNA FLJ1 1953 fis, clone HEMBB1000883 8.1 

D 121729 AI949597 Hs.98325 ESTs 1.8 

j=20 121731 AA421041 Hs.180744 ESTs 4.0 

rtl 121744 AA398784 Hs.97514 ESTs 7.1 

121748 BE536911 Hs.234545 hypothetical protein NUF2R 19.5 

ED 121773 AB033022 Hs.1 58654 KIAA1 196 protein 7.9 

n i 121775 M421773 Hs.161008 ESTs 1.7 

s '25 121776 AA292579 Hs.125133 hypothetical protein FLJ22501 6.6 

^ 121786 AI810774 Hs.98376 ESTs 10.5 

Pi 121832 AW340797 Hs.98434 ESTs 5.8 

121836 AA328348 Hs.218289 ESTs 3.8 

* 121839 AA425691 Hs.191606 ESTs, Highly similarto KIAA1 048 protein [H.sapiens] 5.0 

DO 121842 AF027406 Hs.104865 serine/threonine kinase 23 2.7 

y. 121847 AA446628 Hs.2799 cartilage linking protein 1 2.3 

L ; 121871 AW972668 Hs.293044 ESTs 2.9 

W 121882 AA426376 Hs.98459 ESTs 5.0 

JC; 121911 AA427950 gb:zw50f02.s1 Soares_total_fetus_Nb2HF8_9w Homo sapiens cDNA clone IMAGE:773499 3' 7.2 

r»35 121915 AA428179 Hs.223405 ESTs, Moderately similar to A46010X-linked retinopathy protein [H.sapiens] 2.5 

!?: 121935 AA428647 Hs.98611 EST 2.3 

W 121983 AA298760 Hs.180191 hypothetical protein FLJ14904 3.4 

121985 AI862570 Hs.299214 Homo sapiens, clone IMAGE:2822295, mRNA, partial cds 11.4 

121995 AA210863 Hs.3532 nemo-like kinase 3.8 

40 121999 AA430211 Hs.98668 EST 6.4 

122009 AW292763 Hs.160822 Homo sapiens cDN A: FLJ20863fis, clone ADKA01 804 2.2 

122013 AA431085 Hs.98706 ESTs 6.5 

122036 W92142 Hs.271963 ESTs, Weakly similar to ALU5_HUMAN ALU SUBFAMILY SC SEQUENCE CONTAMINATION 13.1 

122050 AI453076 Hs.166109 ELAV (embryonic lethal, abnormal vision, Drosophila)-like 2 9.1 

45 122060 AA431738 Hs.98750 EST 13.1 

122114 AW161023 Hs.104921 ESTs 1.5 

122188 AA398838 gb:zt80d01.r1 Soares_testis_NHT Homo sapiens cDNA clone 5', mRNA sequence 3.3 

122204 AA435936 Hs.98842 EST 5.6 

122246 AA329550 Hs.29417 HCF-binding transcription factor Zhangfei 5.1 

50 122257 AA436819 Hs.98899 ESTs 5.6 

122302 AA441801 Hs.104947 ESTs 5.8 

122341 AW601969 Hs.99010 hypothetical protein FLJ22263 similarto nuclear receptor-binding SET-domain protein 1 2.0 

122356 AA443794 Hs.98390 ESTs 7.3 

122369 AA443985 Hs.303222 ESTs 12.2 

55 122371 AA868555 Hs.178222 ESTs 5.0 

122372 AA446008 Hs.336677 EST 7.6 

122378 AB032948 Hs.21356 hypothetical protein DKFZp762K2015 2.5 

122405 AA446572 Hs.303223 EST 2.8 

122412 AA446869 Hs.119316 ESTs 7.3 

60 122415 AA446918 Hs.99088 EST 1.9 

122418 AA446966 Hs.99090 ESTs, Moderately similar to similar to KIAA0766 [H.sapiens] 6.8 

1 22440 AW505139 Hs.9460 Homo sapiens mRNA; cDNA DKFZp547C244 (from clone DKFZp547C244) 2.6 

122446 AA447603 Hs.99123 EST 1.8 

122448 AA447626 Hs.99127 EST 3.5 

65 122458 AI266159 Hs.104980 ESTs 1.5 

122460 AW418788 Hs.99148 ESTs, Weakly similar to S43569 R01H10.6 protein - Caenorhabditis elegans [C.elegans] 9.7 

122464 AA448158 Hs.99152 EST 4.8 
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122490 AA448349 Hs.238151 EST 6.1 

122492 M448417 Hs.104990 ESTs 5.4 

122502 AA204969 Hs.234863 Homo sapiens cDNAFLJ12082fis, clone HEMBB1002492 1.3 

122510 AA449232 Hs.99195 ESTs 11.2 

5 122530 AW959741 Hs.40368 adaptor-related protein complex 1,sigma 2 subunit 10.1 

122547 M779725 Hs.164589 ESTs 2.5 

122555 AA194055 Hs.293858 ESTs 1.9 

122570 AA452578 Hs.262907 ESTs 9.5 

122572 AA452601 Hs.99287 EST 11.0 

10 122586 AK001910 Hs.99303 Homo sapiens cDNA FLJ1 1048 fis, clone PLACE1004516 3.4 

122587 AB040893 Hs.6968 KIAA1460 protein 2.0 

122598 AI028173 Hs.99329 ESTs 1.7 

122599 AL355841 Hs.99330 hypothetical protein FLJ23588 4.4 
122602 AA411925 Hs.301960 ESTs 4.6 

15 122607 AA453518 Hs.98023 ESTs 61.5 

122614 M453630 Hs.99339 EST 10.7 

. . 122616 AA453638 Hs.161873 ESTs 107.3 

j==: 122617 A | 681535 hs.148135 serine/threonine kinase 33 121.4 

Ci 122618 AA453641 gb:zx48e06.s1 Soares_testis_NHT Homo sapiens cDNA clone 3', mRNA sequence 31.1 

=20 122622 AA453987 Hs.144802 ESTs 5.6 

;!:: 122717 AA456859 Hs.178358 ESTs 8.5 

U'i 122762 AI376875 Hs.105119 ESTs 10.4 

ff! 122829 AW204530 Hs.99500 ESTs 81.8 

122834 AA461492 Hs.99545 HomosapienscDNAFU10658 fis, clone NT2RP2006052 3.6 

^25 122836 AA460581 Hs.290996 ESTs 4.5 

Si 122837 AA461509 Hs.293565 ESTs, Weakly similar to putative p150 [H.sapiens] 2.7 

P; 122838 AA460584 Hs.334386 ESTs 75.3 

- 122854 AA600235 Hs.9625 N IMA (never in mitosis gene a)-related kinase 6 7.7 

2 122856 AI929374 Hs.75367 Srolike-adapter 5.8 

C30 122861 AA335721 Hs.119394 ESTs 1.3 

£], 122866 BE539656 Hs.283705 ESTs 4.1 

'J'. 122868 AF005216 Hs.1 15541 Janus kinase 2 (a protein tyrosine kinase) 5.3 

iU 122870 AW576312 Hs.318722 Homo sapiens cDNA: FLJ21766 fis, done COLF7179 9.9 

.1:: 122872 AW081394 Hs.97103 ESTs 5.3 

4-35 122879 AA769410 Hs.128654 ESTs 13.9 

122907 AA470074 Hs.169896 ESTs 11.5 

fU 122916 AA470140 Hs.229170 EST 1.7 

122981 AA478951 Hs.105629 ESTs 5.0 

123013 AW968324 Hs.17384 ESTs 15.4 

40 123016 AW338067 Hs.323231 Homo sapiens cDNAFUl 1946 fis, clone HEMBB1000709 2.8 

123034 AL359571 Hs.44054 ninein (GSK3B interacting protein) 8.7 

123072 AI382600 Hs.104308 ESTs, Weakly similar to KIAA1 395 protein [H.sapiens] 8.8 

123082 AA485360 Hs. 105661 ESTs 3.9 

123088 AI343652 Hs.105667 ESTs 3.8 

45 123110 AA486256 Hs.193510 EST 7.4 

123114 BE304942 Hs.265848 myomegalin 2.8 

123131 T52027 Hs.271795 ESTs, Weakly similar to I38022 hypothetical protein [H.sapiens] 2.4 

123132 AI061582 Hs.324179 Homo sapiens CDNAFLJ12371 fis, clone MAMMA1002434 15.6 
123136 AW451999 Hs.194024 ESTs 5.1 

50 123149 AI734179 Hs.105676 ESTs 23.8 

123152 AW601773 Hs.270259 ESTs 5.2 
123258 AA490929 Hs.105274 ESTs, Weakly similar to RMS1_HUMAN REGULATOR OF MITOTIC SPINDLE ASSEMBLY 1 [H.sapiens] 9.3 
123315 AA496369 gb:zv37d10.s1 Soares ovary tumor NbHOT Homo sapiens cDNA clone IMAGE:755827 3' similar to4.1 

123369 AA504757 Hs.105738 ESTs 6.9 

55 123394 AA731404 Hs.1 05510 ESTs 3.6 

123433 AW450922 Hs.112478 ESTs 3.7 

123466 AA599042 Hs.112503 EST 7.4 

123470 AW303285 Hs.303632 Human DNA sequence from clone RP11-110H4 on chromosome 5 Contains a pseudogene similar to 3.5 

123471 AB021644 Hs.197219 zinc finger protein 14(KOX6) 5.2 
60 123475 BE439553 Hs.250528 Homo sapiens, clone IMAGE:4098694, mRNA, partial cds 1.7 

123482 N95059 Hs.55098 ESTs 1.6 

123486 BE019072 Hs.334802 Homo sapiens CDNAFLJ14680 fis, clone NT2RP2004242, weakly similar to 2.4 

123508 AW380388 Hs.155546 KIAA1080 protein; Golgi-associated, gamma-adaptin ear containing, ARF-binding protein 2 2.2 

123615 AA609170 gb:af12a12.s1 Soares_testis_NHT Homo sapiens cDNA clone 3', mRNA sequence 7.8 

65 123619 AA602964 gb:no97c02.s1 NCI_CGAP_Pr2 Homo sapiens cDNA clone, mRNA sequence 2.8 

123658 AA609364 gb:zu71d09.s1 Soares_testis_NHT Homo sapiens cDNA clone IMAGE:743441 3' similar to contains Alu. 1.7 

123674 AI269609 Hs.105187 kinesin protein 9 gene 5.7 
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123735 NM_013241Hs.95231 FH1/FH2 domain-containing protein 10.0 

123738 AA609891 Hs.1 12777 EST 5.2 

123753 AA609955 Hs.234961 Huntingtin interacting protein E 30.6 

123804 AA620464 Hs.261915 EST, Weakly similar to S65657 alpha-1C-adrenergic receptor splice form 2 [H.sapiens] 2.1 

5 1 2381 1 AA620586 gb:ae60g05.s1 Stratagene lung carcinoma 93721 8 Homo sapiens cDNA clone IMAGE:951 320 3' 2.7 

123951 AB012922 Hs. 173043 metastasis-associated Mike 1 6.2 

123983 AJ272267 Hs.146178 choline dehydrogenase 4.4 

124001 L42542 Hs.75447 ralA binding protein 1 7.0 

124006 AI147155 Hs.270016 ESTs 8.1 

10 124070 AI950314 Hs.154762 HIV-1 rev binding protein 2 3.7 

124074 H05635 Hs.294030 topoisomerase-related function protein 4-2 1.2 

124178 BE463721 Hs.97101 putative G protein-coupled receptor 3.1 

124203 AA372796 Hs.269339 ESTs, Weakly similar to AF161356 1 HSPC093 [H.sapiens] 5.7 

124352 AA640891 Hs.102406 ESTs 3.1 

15 124375 D87454 Hs.192966 KIAA0265 protein 3.5 

124385 AI267847 gb:aq49a10.x1 Stanley Frontal NB pool 2 Homo sapiens cDNA clone similar to contains 57.1 

124390 AA317338 Hs.7535 COBW-like protein 2.8 

Jf : 124391 AF155099 Hs.279780 NY-REN-1 8 antigen 7.1 

G 124417 N34059 gb:yv28h09.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone 3' similar to contains Alu 3.3 

r=20 124428 H13540 Hs.82202 ribosomal protein L17 2.9 

124440 AA532519 Hs.129043 Human DNA sequence from clone 989H11 on chromosome 22q13.1-13.2. Contains partof a 7.8 

Wm 124466 R10084 Hs.113319 kinesin heavy chain member 2 2.6 

ff ! 124482 N53935 gb:yv59d09.s1 Soares fetal liver spleen 1 NFLS Homo sapiens cDNA clone 3', mRNA sequence 7.9 

si 124498 H79433 Hs.268997 ESTs 7.8 

5 "25 124515 AA669097 Hs.109370 ESTs 3.3 

W 124608 N71076 Hs.102800 ESTs, Weakly similar to neuronal thread protein AD7c-NTP [H.sapiens] 4.5 

r'i 124631 NM_014053Hs.270594 FLVCR protein 3.2 

124634 AI765123 Hs.143671 Homo sapiens cDNAFLJ13533fls, clone PLACE1006371 5.8 

3 124637 AA1 60474 Hs.75798 hypothetical protein 9.3 

G30 124642 AW968856 Hs.278569 sorting nexin 17 3.5 

L.=. 124649 N92593 Hs.313054 ESTs 6.1 

L , 124656 AW297702 Hs.102915 ESTs 8.3 

124661 R48170 Hs.78436 EphB1 5.6 

J: 124683 AA381661 Hs.119878 ESTs, Weakly similarto M3K9_HUMAN MITOGEN-ACTIVATED PROTEIN KINASE KINASE 7.9 

?-35 124712 R09166 Hs.191148 ESTs 5.7 

!f: 124735 R22952 Hs.268685 ESTs 11.3 

W 124761 AA374756 Hs.93560 Homo sapiens mRNA for KIAA1 771 protein, partial cds 9.0 

124768 AW368528 Hs.100855 ESTs 8.1 

124775 R41772 Hs.100878 ESTs 4.9 

40 124777 R41933 Hs.140237 ESTs, Weakly similar to ALU1.HUMAN ALU SUBFAMILY J SEQUENCE 2.8 

124788 R43543 Hs.100912 Homo sapiens cDNA: FLJ22726 fis, clone HSI15005 5.1 

124809 AL355722 Hs.106875 Homo sapiens EST from clone 35214, full insert 4.2 

124811 R46068 Hs.288912 hypothetical protein FU22604 14.2 

124812 R47948 Hs.188732 ESTs 7.9 
45 124822 AA418160 Hs.86043 Homo sapiens cDNAFLJ13558 fis, clone PLACE1007743 6.6 

124825 AA501669 Hs.336693 ESTs 2.3 

124833 AW975868 Hs.294100 ESTs 2.7 

124857 R63652 Hs.137190 ESTs 2.3 

124860 R65763 Hs.101477 EST 23.9 

50 124863 AI382555 Hs.127950 bromodomain-containing 1 2.0 

124876 AF1 35422 Hs.27059 GDP-mannose pyrophosphorylaseA 4.4 

124878 BE397530 Hs.288057 hypothetical protein FLJ22242 2.7 

124902 H37941 Hs.101883 ESTs 5.7 

124903 AW296713 Hs.221441 ESTs 32.4 
55 124930 AI076343 Hs.173939 ESTs, Weakly similarto ALUB_HUMAN !l!l ALU CLASS B WARNING ENTRY ill [H.sapiens] 22.8 

124942 R99978 Hs.268892 ESTs, Moderately similarto B34087 hypothetical protein [H.sapiens] 6.1 

124958 AI078645 Hs.431 murine leukemia viral (bmi-1) oncogene homolog 1.9 

124980 T40841 Hs.98681 ESTs 4.5 

125002 T59338 Hs.269463 ESTs, Weakly similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 4.9 

60 125047 T79815 Hs.279793 ESTs 5.0 

125051 T79956 Hs.100588 EST 135.3 

125056 T81310 Hs.100592 ESTs 5.4 

125101 AI472068 Hs.286236 KIAA1 856 protein 5.6 

125113 T96595 Hs.302270 ESTs, Weakly similar to ALUF_HUMAN 111! ALU CLASS F WARNING ENTRY 111 [H.sapiens] 1.8 

65 125115 T97341 gb:ye57e05.s1 Soares fetal liverspleen 1NFLS Homo sapiens cDNA clone IMAGE:121856 3' similarto 9.6 

125125 AI222382 Hs.240767 Human DNA sequence from clone RP1-12G14 on chromosome 6q24.1-25.2. Contains the 5' end of the gene 1.5 

125147 W38150 Empirically selected from AFFX single probeset 1.7 
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125161 W44657 Hs.144232 EST 10.7 
125249 AA630863 Hs.131375 ESTs, Moderately similar to ALUB_HUMAN !!!! ALU CLASS B WARNING ENTRY !!! [H.sapiens] 1.3 

125255 AF098162 Hs.118631 timeless (Drosophila) homolog 9.4 

125279 AW401 809 Hs.4779 KIAA1 150 protein 1.5 

5 125280 AI123705 Hs.106932 ESTs 8.0 

125298 AW972542 Hs.289008 Homo sapiens cDNA: FLJ21814 fis, clone HEP01068 1.5 

125660 AW292171 Hs.23978 scaffold attachment factor B 5.9 

125827 NM_003403Hs.97496 YY1 transcription factor 1.2 

125891 U29589 Hs.7138 cholinergic receptor, muscarinic 3 6.4 

10 126005 AW409701 Hs.1578 baculoviral IAP repeat-containing 5 (survivin) 14.3 

126202 AA157632 Hs.272630 vacuolar proton pump delta polypeptide 2.4 

126695 AA643322 Hs.172028 adisintegrinandmetalloproteinasedomainIO 9.1 

127050 AW411066 Hs.274351 CGI-89 protein 17.0 

127274 AW966158 Hs.58582 Homo sapiens cDNAFLJ12789fis, clone NT2RP2001947 12.8 

15 128355 AW293012 Hs.161623 ESTs 7.3 

128493 D87466 Hs.240112 KIAA0276 protein 3.1 

128493 D87466 Hs.240112 KIAA0276 protein 1.3 

rj 128522 BE173977 Hs.10098 putative nucleolar RNA helicase 9.4 

f==i 128527 AA504583 Hs.101047 transcription factor 3 (E2A immunoglobulin enhancer binding factors E12/E47) 1.5 

72.0 128528 R39234 Hs.251699 ESTs, Weakly similar to IDN4-GGTR1 4 [H.sapiens] 2.8 

Ui 128595 U31875 Hs.272499 short-chain alcohol dehydrogenase family member 12.1 

p;i 128599 NM_015366Hs.102336 Rho GTPase activating protein 8 2.3 

= 128604 AI879099 Hs.102397 GIOT-3 for gonadotropin inducible transcription repressor-3 1.3 

,y 128608 BE267994 Hs.102419 zinc finger protein 7.1 

S25 128625 AB037841 Hs.102652 hypothetical protein ASH1 1.3 

ri 128629 AL096748 Hs.102708 DKFZP434A043 protein 3.2 

y 128639 AW582962 Hs.102897 CGI-47 protein 2.0 

\_ 128656 AA458542 Hs.10326 coatomer protein complex, subunit epsilon 1.4 

Ci 128656 AA458542 Hs.10326 coatomer protein complex, subunit epsilo 1.3 

L,30 128658 BE397354 Hs.324830 diptheria toxin resistance protein required for diphthamide biosynthesis (Saccharomyces)-like 2 2.4 
~ I 128670 AA975486 Hs.103441 Homo sapiens, Similar to RIKEN cDNA 170001 0L1 9 gene, clone MGC:16214, mRNA, complete cds 7.1 

?V 128691 W27939 Hs.103834 hypothetical protein MGC5576 7.7 

_E; 128696 BE081143 Hs.225977 nuclear receptor coactivator 3 3.8 

== ; 128700 Y15221 Hs.103982 small inducible cytokine subfamily B (Cys-X-Cys), member 1 1 1.6 

:35 128714 T85231 Hs.179661 tubulin, beta 5 7.6 

W 128717 AK001564 Hs.104222 hypothetical protein FLJ10702 5.5 

128733 BE147740 Hs.104558 ESTs, Moderately similar to I38022 hypothetical protein [H.sapiens] 2.7 

128737 AF292100 Hs.104613 RP42 homolog 2.8 

128742 AA307211 Hs.251531 proteasome (prosome, macropain) subunit, alpha type, 4 4.4 

40 128746 AI470163 Hs.323342 actin related protein 2/3 complex, subunit 4 (20 kD) 2.2 

128747 AB027249 Hs.104741 PDZ-binding kinase; T-cell originated protein kinase 2.8 

128772 BE302796 Hs.105097 thymidine kinase 1, soluble 5.3 

128781 N71826 Hs.105465 small nuclear ribonucleoprotein polypeptide F 53.9 

128797 NM_002975Hs.105927 stem cell growth factor; lymphocyte secreted C-type lectin 13.3 

45 128806 AW630942 Hs.106061 RDRNA-binding protein 2.6 

128814 AW248431 Hs.256526 nuclear prelamin A recognition factor 2.2 

128830 BE281170 Hs.106357 valosin-containing protein 5.9 

128835 AK001731 Hs.106390 Homo sapiens mRNA; cDNA DKFZp586H0924 (from clone DKFZp586H0924) 1.6 

128854 BE159181 Hs.168232 hypothetical protein FLJ13855 2.2 

50 128854 BE159181 Hs.168232 hypothetical protein FLJ 13855 1.9 

128868 AA419008 Hs.1 06730 chromosome 22 open reading frame 3 3.0 

128868 AA419008 Hs.106730 chromosome 22 open reading frame 3 2.2 

128871 AF189723 Hs.106778 ATPase, Ca++ transporting, type 2C, member 1 1.5 

128891 F34856 Hs.292457 Homo sapiens, clone MGC:16362, mRNA, complete cds 13.3 

55 128906 R57988 Hs.10706 epithelial protein lost in neoplasm beta 4.7 

128920 AA622037 Hs.166468 programmed cell death 5 1.4 

128925 R67419 Hs.21851 Homo sapiens CDNAFLJ12900 fis, clone NT2RP2004321 1.9 

128946 Y13153 Hs.107318 kynurenine 3-monooxygenase (kynurenine 3-hydroxylase) • 7.2 

128949 AA009647 Hs.8850 a disintegrin and metalloproteinase domain 12 (meltrin alpha) (ADAM-12) 2.4 

60 128958 AW139032 Hs.107376 hypothetical protein DKFZp434N035 1.3 

128959 AI580127 Hs.107381 hypothetical protein FLJ1 1200 10.9 

128965 AW150697 Hs.107418 ESTs 1.4 

128970 AI375672 Hs.165028 ESTs 1.3 

128975 BE560779 Hs.284233 NICE-5 protein 14.0 

65 128979 AW271217 Hs.281434 Homo sapiens cDNAFLJ14028 fis, clone HEMBA1003838 1.6 

128995 AI816224 Hs.107747 DKFZP566C243 protein 1.9 

129019 AI950087 gb:wq05c02.x1 NCI_CGAP_Kid12 Homo sapiens cDNA clone 3', mRNA sequence 2.9 
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129021 AL044675 Hs. 173081 KIAA0530 protein 3.8 

129021 AL044675 Hs. 173081 KIAA0530 protein 2.5 

129032 R80088 Hs.108104 ubiquitin-conjugating enzyme E2L 3 3.4 

129076 AW296806 Hs.326234 ESTs, Highly similar to T46422 hypothetical protein DKFZp434M2023.1 [H.sapiens] 5.0 

5 129078 AI351010 Hs.102267 lysosomal 2.1 

129088 AA744610 Hs.194431 palladin 17.1 

129095 L12350 Hs.108623 thrombospondin 2 2.7 

129096 AA463189 Hs.288906 WW Domain-Containing Gene 20.9 

129097 BE243933 Hs.1 08642 zinc finger protein 22 (KOX 15) 3.0 
10 129099 AF146074 Hs.108660 ATP-binding cassette, sub-family C (CFTR/MRP), member 5 5.8 

129136 W93048 Hs.250723 hypothetical protein MGC2747 5.9 

129149 AA356620 Hs.108947 KIAA0050 gene product 6.3 

129172 AW162916 Hs.241576 hypothetical protein PR02577 1.8 

129192 AA286914 Hs.183299 ESTs 2.1 

15 129194 AA150797 Hs.109276 latexin protein 3.2 

H> 129198 N57532 Hs.109315 KIAA1415 protein 5.8 

~: 129207 AI934365 Hs.109439 osteoglycin (osteoinductive factor, mimecan) 8.0 

S 129228 U40714 Hs.239307 tyrosyl-tRNA synthetase 2.9 

W : 129229 AF013758 Hs.109643 polyadenylate binding protein-interacting protein 1 3.2 

[T20 129254 AA252468 Hs.1098 DKFZp434J1813 protein 2.6 

ff\ 129255 AI961727 Hs.1 09804 H1 histone family, member X 7.3 

■J' 129288 W26392 Hs.1 10080 ESTs, Weakly similar to S13495 pregnancy zone protein rH.sapiens] 9.6 

RJ 129296 AI051967 Hs.1 10122 ESTs 1.2 

%A 129323 AA287239 Hs.5518 Homo sapiens cDNA FLJ1 1311 fis, clone PLACE1010102 5.1 

r :25 129340 H75334 Hs.1 1050 F-box only protein 9 4.6 

129347 BE614192 Hs.279869 melanoma-associated antigen recognised by cytotoxic T lymphocytes 7.6 

s 129362 U30246 Hs.110736 solute carrier family 12 (sodium/potassium/chloride transporters), member2 6.7 

pi 129366 BE220806 Hs.184697 Homo sapiens clone 23785 mRNA sequence 8.6 

r 129370 AI686379 Hs.1 10796 SAR1 protein 1.4 

H30 129372 NM_016039Hs.1 10803 CGI-99 protein 2.0 

fli 129403 AF149785 Hs.1 11 126 pituitary tumor-transforming 1 interacting protein 7.4 

V 129404 AI267700 Hs.317584 ESTs 5.0 

T 129404 AI267700 Hs.317584 ESTs 2.5 

Q 129423 AA204686 Hs.234149 hypothetical protein FLJ20647 10.2 

ri35 129449 AI096988 Hs.1 11 554 ADP-ribosylation factor-like 7 8.0 

129453 AW974265 Hs.111632 Lsm3 protein 3.2 

129482 AA188185 Hs.289043 spindlin 6.7 

129482 M188185 Hs.289043 spindlin 3.6 

129513 AW843633 Hs.306163 hypothetical protein AL1 101 15 7.1 

40 129515 AF255303 Hs.112227 membrane-associated nucleic acid binding protein 2.5 

129527 M769221 Hs.270847 delta-tubulin 3.2 

129559 W01296 Hs.11360 hypothetical protein FLJ 14784 7.5 

129560 AA317841 Hs.7845 hypothetical protein MGC2752 6.8 
129570 AI923097 Hs.1 1441 chromosome 1 open reading frame 8 2.0 

45 129575 F08282 Hs.278428 progestin induced protein 1.6 

129587 H14718 Hs.11506 Human clone 23589 mRNA sequence 6.8 

129588 BE408300 Hs.301862 postmeiotic segregation increased 2-like 9 1.4 
129591 N57423 Hs.179898 HSPC055 protein 7.3 
129594 AW403724 Hs.36989 coagulation factor VII (serum prothrombin conversion accelerator) 9.0 

50 129596 AF035537 Hs.115521 REV3 (yeast homolog)-like, catalytic subunit of DNA polymerase zeta 1.6 

129628 U38945 Hs.1174 cyclin-dependent kinase inhibitor 2A (melanoma, p16, inhibits CDK4) 2.2 

129628 U38945 Hs.1174 cyclin-dependent kinase inhibitor 2A (me 1.4 

129629 AK000398 Hs.11747 hypothetical protein FU20391 3.8 
129649 AD000092 Hs.16488 calreticulin 3.3 

55 129675 NM_015556Hs.172180 KIAA0440 protein 13.4 

129680 U03749 gb:Human chromogranin A (CHGA) gene, promoter an 14.1 

129689 AW748482 Hs.77873 B7homolog3 2.6 

129702 AI304966 Hs.12035 ESTs, Weakly similar to I38022 hypothetical protein [H.sapiens] 7.4 

129720 AA156214 Hs.12152 APMCF1 protein 2.0 

60 129721 NM_001415Hs.211539 eukaryotic translation initiation factor 2, subunit 3 (gamma, 52kD) 1.7 

129726 H15474 Hs.132898 fatty acid desatu rase 1 8.3 

129778 AK001676 Hs.12457 hypothetical protein FLJ 10814 1.8 

129779 AA394090 Hs.12460 Homo sapiens clone 23870 mRNA sequence 5.4 
129800 AF052112 Hs.12540 lysosomal 1.7 

65 129806 AB023148 Hs.173373 KIAA0931 protein 1.2 

129815 BE565817 Hs.26498 hypothetical protein FLJ21657 3.1 

129840 NM_006590Hs. 12820 SnRNP assembly defective 1 homolog 1.8 
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130675 AA442233 Hs. 17731 hypothetical protein FLJ1 2892 5 4 

130692 AA652501 Hs.13561 hypothetical protein MGC4692 5.0 

130693 R68537 Hs. 17962 ESTs 2.0 
130712 AJ271881 Hs.279762 bromodomain-containing 7 1.8 

5 130714 AI348274 Hs.18212 DNA segment on chromosome X (unique) 9879 expressed sequence 2.0 

130730 AB007920 Hs. 18586 KIM0451 gene product 3.7 

130744 H59696 Hs.18747 POP7 (processing of precursor, S. cerevisiae) homolog 3.1 

130751 AF052105 Hs. 18879 chromosome 12 open reading frame 1.4 

130757 AL036067 Hs.18925 protein x 0001 5.7 

10 130768 AF258627 Hs.211562 ATP-binding cassette, sub-family A (ABC1), member 1 5.1 

130789 AK000355 Hs.8899 sirtuin (silent mating type information regulation 2, S.cerevisiae, homolog) 5 5.2 

130815 AB018298 Hs.19822 SEC24 (S. cerevisiae) related gene family, member D 1.5 

130836 J05068 Hs.2012 transcobalamin I (vitamin B12 binding protein, R binder family) 15.7 

130841 AL157468 Hs.325825 Homo sapiens cDNA FLJ20848 fis, clone ADKA01732 2.8 

15 130843 AA447492 Hs.20183 ESTs, Weakly similar to AF164793 1 protein x 013 [H.sapiens] 1.5 

130844 U76248 Hs.20191 seven in absentia (Drosophila) homolog 2 3.4 

130855 AJ243706 Hs.143323 putative DNA/chromatin binding motif 1.7 

M 130861 NM_016578Hs.20509 HBV pX associated protein-8 1.9 

f=i 130879 NM_003416Hs.2076 zinc finger protein 7 (KOX 4, clone HF. 16) 1.4 

5-520 130880 BE514434 Hs.20830 kinesin-like 2 2.1 

^ 130892 AL120837 Hs.20993 high-glucose-regulated protein 8 2.4 

Ul 130898 AB033078 Hs.186613 sphingosine-1 -phosphate lyase 1 1.7 

fr: 130911 BE409769 Hs.21189 DnaJ (Hsp40) homolog, subfamily A, member 2 1.8 

zs 130919 N79110 Hs.21276 collagen, type IV, alpha 3 (Goodpasture antigen) binding protein 2.3 

ii25 130944 BE382657 Hs.21486 signal transducer and activator of transcription 1,91 kD 5.4 

SI 130971 N39842 Hs.301444 KIAA1673 2.2 

!■<=■. 130992 BE398091 Hs.74316 desmopiakin (DPI, DPII) 18 

130993 T97401 Hs.21929 ESTs 1.6 

» 131005 AV658308 Hs.2210 thyroid hormone receptor interactor 3 1.6 

i:*30 131028 AI879165 Hs.2227 CCAAT/enhancer binding protein (C/EBP), gamma 1.2 

Lj. 131042 AI826288 Hs.171637 hypothetical protein MGC2628 1.6 

131046 AA321649 Hs.2248 small inducible cytokine subfamily B (Cys-X-Cys), member 10 7.4 

W 131046 AA321649 Hs.2248 small inducible cytokine subfamily B (Cy 3.0 

JS 131047 H23230 Hs.22481 ESTs, Moderately similar to A46010 X-linked retinopathy protein [H.sapiens] 1.7 

J~35 131060 AA194422 Hs.22564 myosin VI 5.1 

^ 131060 AA194422 Hs.22564 myosin VI 2.5 

fU 131070 N53344 Hs.22607 ESTs 7.1 

131076 AA749230 Hs.26433 dolichyl-phosphate (UDP-N-acetylglucosamine) N-acetylglucosaminephosphotransferase 1 ( 2.0 

131076 AA749230 Hs.26433 dolichyl-phosphate (UDP-N-acetylglucosam 1.9 

40 131099 AL133353 Hs.226581 COX15 (yeast) homotog, cytochrome c oxidase assembly protein 7.0 

131174 NM_006540Hs.29131 nuclear receptor coactivator 2 19 

131185 BE280074 Hs.23960 cyclin B1 5 8 

131206 AW138839 Hs.24210 ESTs 2 0 

131213 AA885699 Hs.24332 CGI-26 protein 7.0 

45 131225 H62087 Hs.31659 thyroid hormone receptor-associated protein, 95-kD subunit 7.5 

131231 N47468 Hs.59757 zinc finger protein 281 2.9 

131233 D89053 Hs.268012 fatty-acid-Coenzyme A ligase, long-chain 3 3.5 

131243 AW383256 Hs.24752 spectrin SH3 domain binding protein 1 2.8 

131245 AL080080 Hs.24766 thioredoxin domain-containing 2.8 

50 131247 AL043100 Hs.326190 fatty acid amide hydrolase 5.6 

131281 AA251716 Hs.25227 ESTs 5.7 

131283 X80038 Hs.339713 Homo sapiens clone F19374 APO E-C2 gene cluster 1.3 

131305 AV656017 Hs.184325 CGI-76 protein 5.0 

131320 AA505691 Hs.145696 splicing factor (CC1 .3) 1.8 

55 131339 AF058696 Hs.25812 Nijmegen breakage syndrome 1 (nibrin) 2.6 

131339 AF058696 Hs.25812 Nijmegen breakage syndrome 1 (nibrin) 2.6 

131375 AW293165 Hs.143134 ESTs 5.4 

131390 BE269388 Hs.182698 mitochondrial ribosomal protein L20 5.3 

131410 BE259110 Hs.279836 HSPC1 66 protein 2.2 

60 131412 NM_012247Hs.124027 SELENOPHOSPHATE SYNTHETASE; Human selenium donor protein 2.0 

131429 AL046302 Hs.26750 hypothetical protein FLJ21 908 1.4 

131458 BE297567 Hs.27047 hypothetical protein FLJ20392 1.7 

131475 AA992841 Hs.27263 KIAA1458 protein 2.0 

131501 AV661958 Hs.8207 GK001 protein 26 

65 131501 AV661958 Hs.8207 GK001 protein 1.6 

131511 AA732153 Hs.27865 Homo sapiens cDNA: FU21333 fis, clone COL02535 2.0 

131528 AU076408 Hs.28309 UDP-glucose dehydrogenase 1.6 
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131532 BE268278 Hs.28393 hypothetical protein MGC2592 7.4 

131543 AW966881 Hs.41639 programmed cell death 2 2.2 

131544 AL355715 Hs.28555 programmed cell death 9 (PDCD9) 21 
131562 NM_003512Hs.28777 H2A histone family, member L 1.7 

5 131564 T93500 Hs.28792 Homo sapiens cONA FU11041 fis, clone PLACE1004405 5.1 
131564 T93500 Hs.28792 Homo sapiens cDNA FU 11 041 fis, clone PL 1.8 
131569 AL389951 Hs.271623 nucleoporin 50kD 5.0 
131618 BE393822 Hs.29645 Homo sapiens mRNA; cDNA DKFZp761 C029 (from clone DKFZp761 C029); partial cds 1 .8 

131622 R78195 Hs.29692 Homo sapiens cDNA FLJ1 1436 fis, clone HEMBA1001213 1.3 
10 131623 AB037791 Hs.29716 hypothetical protein FU 10980 2 2 

131623 AB037791 Hs.29716 hypothetical protein FLJ 10980 1.9 
131643 AW410601 Hs.30026 HSPC1 82 protein 2.9 
131653 AW960597 Hs.30164 ESTs 1 3 
131656 AI218918 Hs.30209 KIAA0854 protein 2.8 

15 131669 X52486 Hs.3041 uracil-DNAglycosylase2 2.8 

131692 BE559681 Hs.30736 KIAA0 124 protein 5.6 

131714 AA642831 Hs.31016 putative DNA binding protein 2.9 

M: 131722 D13757 Hs.311 phosphoribosyl pyrophosphate amidotransferase 3.4 
r% n 131737 AK001641 Hs.31323 inhibitor of kappa light polypeptide gene enhancer in B-cells, kinase complex-associated protein 3.8 

3.0 131760 X76732 Hs.3164 nucleobindin 2 2.9 

131760 X76732 Hs.3164 nucleobindin 2 2.8 

If; 131763 AI878932 Hs.317 topoisomerase (DNA) I 3.4 

*?• 131772 AA382590 Hs.170980 KIAA0948 protein 25.5 

IT! 131774 BE267158 Hs.169474 DKFZP586J01 19 protein 5.5 

!l25 131787 D87077 Hs.196275 KIAA0240 protein 2.4 

%A 131793 AW966127 Hs.32246 Homo sapiens CDNAFLJ14656 fis, clone NT2RP2002439 7.9 

p[ 131795 BE501849 Hs.32317 high-mobility group 20B 1.4 

131798 X86098 Hs.301449 adenovirus 5 E1A binding protein 4.1 

= 131817 U20536 Hs.3280 caspase 6, apoptosis-related cysteine protease 4.2 

f 30 131824 U28838 Hs.32935 TATA box binding protein (TBP)-associated factor, RNA polymerase III, GTF3B subunit 2 3.5 

n 131850 AI251317 Hs.33184 ESTs 5.1 

fj" 131878 AA083764 Hs.6101 hypothetical protein MGC3 178 5.8 

fli 131885 BE502341 Hs.3402 ESTs 13.7 

jr 131885 BE502341 Hs.3402 ESTs 2.4 

235 131887 W17064 Hs.332848 SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily e, member 1 3.2 

W 131900 AA099014 Hs.231029 Homo sapiens, clone MGC:15961, mRNA, complete cds 8.7 

HI 131900 AA099014 Hs.231029 Homo sapiens, clone MGC:15961, mRNA, com 2.0 

131904 AF078866 Hs.284296 Homo sapiens cDNA: FLJ22993 fis, clone KAT1 191 4 5.5 

131905 AA179298 Hs.3439 stomatin-like 2 11.3 
40 131913 AW207440 Hs.185973 degenerative spermatocyte (homolog Drosophila; lipid desaturase) 1.7 

131916 M025976 Hs.34569 ESTs 5.2 

131925 AF151048 Hs.183180 anaphase promoting complex subunit 11 (yeastAPCH homolog) 2.7 

131929 BE541211 Hs.34804 HomosapienscDNAFU11472fis,cloneHEMBA1001711 5.3 

131941 BE252983 Hs.35086 ubiquitin specific protease 1 2.3 

45 131950 M355113 Hs.35380 x 001 protein 1.5 

131962 AK000046 Hs.267448 hypothetical protein FU20039 2.3 

131965 W79283 Hs.35962 ESTs 1.4 

131971 BE567100 Hs.154938 hypothetical protein MDS025 3.5 
131977 U90441 Hs.3622 procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline 4-hydroxylase), alpha polypeptide II 6.5 

50 131985 AA503020 Hs.36563 hypothetical protein FLJ2241 8 2.4 

131991 AF053306 Hs.36708 budding uninhibited by benzimidazoles 1 (yeast homolog), beta 2.1 

132019 H56995 Hs.37372 Homo sapiens DNA binding peptide mRNA, partial cds 3.2 

132031 AF1 93844 Hs.3758 COP9 complex subunit 7a 5.8 

132062 BE266155 Hs.3832 clathrin-associated protein AP47 1.5 

55 132084 NM_002267Hs.3886 karyopherin alpha 3 (importin alpha 4) 3.7 

132103 BE171921 Hs.3991 ESTs 1.4 

132105 AV646076 Hs.39959 ESTs 5.8 

132116 AW960474 Hs.40289 ESTs 17 

132176 M857025 Hs.8878 kinesin-like 1 3.3 

60 132180 NM_004460Hs.418 fibroblast activation protein, alpha 14.7 

132192 AA206153 Hs.4209 mitochondrial ribosomal protein L37 5.5 

132194 R42432 Hs.4212 ESTs 4.4 

132203 NM_004782Hs.194714 synaptosomal-associated protein, 29kD 2.2 

132207 BE206939 Hs.42287 E2F transcription factor 6 2.2 

65 132235 AV658411 Hs.42656 KIAA1681 protein 7.8 

132240 AB018324 Hs.42676 KIAA0781 protein 1.5 

132252 AI566004 Hs.141269 Homo sapiens cDNA: FLJ21550 fis, clone COL06258 1.3 
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132266 AA301228 Hs.43299 hypothetical protein FLJ 12890 57 

132273 AA227710 Hs.43658 DKFZP586L1 51 protein 42 

132276 M653507 Hs.285711 hypothetical protein FLJ13089 2.1 

132288 N36110 Hs.305971 solute carrier family 2 (facilitated glucose transporter), member 10 1.5 

5 132294 AB023191 Hs.44131 KIAA0974 protein 10.0 

132298 NM_015986Hs.7120 cytokine receptor-like molecule 9 19 

132299 AW405882 Hs.44205 cortistatjn 9.2 
132325 N37065 Hs.44856 hypothetical protein FU1 21 16 2.0 
132348 AW067708 Hs.170311 heterogeneous nuclear ribonucleoprotein D-like 6.5 

10 132370 AW572805 Hs.46645 ESTs 3.8 

132374 AF155582 Hs.46744 corel UDP-galactose:N-acetylgalactosamine-alpha-R beta 1,3-galactosyltransferase 1.5 

132376 AI279892 Hs.46801 sorting nexin 14 12 5 

132384 AA312135 Hs.46967 HSPC034 protein 28.3 

132393 AL135094 Hs.47334 hypothetical protein FLJ14495 1.9 

15 132450 AA100012 Hs.48827 hypothetical protein FU12085 1.9 

132452 AW973521 Hs.247324 mitochondrial ribosomal protein S14 6.1 

132456 AB011084 Hs.48924 KIAA05 12 gene product ALEX2 1.7 

== ; 132465 AW169847 Hs.49169 KIAA1634 protein 86 

Q 132470 AI224456 Hs.4934 H.sapiens polyA site DNA 5.2 

?=20 132484 X16660 Hs.119007 RAB4, member RAS oncogene family 1.4 

Ti\ 132518 AW885606 Hs.5064 ESTs 6.1 

Ul 132528 T78736 Hs.50758 SMC4 (structural maintenance of chromosomes 4, yeast)-like 1 3.3 

tti 132530 AA306105 Hs.50785 SEC22, vesicle trafficking protein (S. cerevisiae)-like 1 2.0 

132532 AA454132 Hs.5080 mitochondrial ribosomal protein L16 2.9 

: %5 132534 BE388673 Hs.5086 hypothetical protein MGC1 0433 2.2 

Si 132543 BE568452 Hs.5101 protein regulator of cytokinesis 1 7.3 

r ! j 132571 AW674699 Hs.5169 suppressor of G2 allele of SKP1, S. cerevisiae, homolog of 1.7 

132574 AW631437 Hs.5184 TH1 drosophila homolog 71 

* 132596 AK001484 Hs.5298 CGI-45 protein " 2.2 

C30 132611 AA345547 Hs.53263 hypothetical protein FU 13287 2.2 

Ls. 132612 H12751 Hs.5327 PR01914 protein 6.8 

132616 BE262677 Hs.283558 hypothetical protein PR01855 14.0 

\'y 132638 AI796870 Hs.54277 DNA segment on chromosome X (unique) 9928 expressed sequence 11.4 

Jg: 132648 U51127 Hs.54434 hypothetical protein MGC1 715 1.9 

===35 132668 AB018319 Hs.5460 KIAA0776 protein 2.6 

If ; 132692 AW191962 Hs.249239 collagen, type VIII, alpha 2 2.0 

132715 F11875 Hs.5534 Homo sapiens cDNA FLJ 1 2961 fis, clone NT2RP2005645 1.5 

132718 NM_004600Hs.554 Sjogren syndrome antigen A2 (60kD, ribonucleoprotein autoantigen SS-A/Ro) 3.0 

132724 AI142265 Hs.55498 geranylgeranyl diphosphate synthase 1 " 2.4 

40 132731 AI189075 Hs.301872 hypothetical protein MGC4840 12.4 

132744 AA010233 Hs.55921 glutamyl-prolyl-tRNA synthetase 14.6 

132760 M125985 Hs.56145 thymosin, beta, identified in neuroblastoma cells 2.7 

132771 Y10275 Hs.56407 phosphoserine phosphatase 3.0 

132773 AA459713 Hs.295901 KIAA0493 protein 2.3 

45 132784 AI142133 Hs.56845 GDP dissociation inhibitor 2 1.8 

132798 AI026701 Hs.5716 KIAA0310 gene product 3.7 

132807 U07418 Hs.57301 mutl (E. coli) homolog 1 (colon cancer, nonpolyposis type 2) 1 .8 

132810 AB007944 Hs.5737 KIAA0475 gene product 5.9 

132813 BE31 3625 Hs.57435 solute carrier family 11 (proton-coupled divalent metal ion transporters), member 2 8.7 

50 132815 AI815189 Hs.57475 sex comb on midleg homolog 1 6.4 

132817 N27852 Hs.57553 tousled-like kinase 2 3.6 

132821 AJ251595 Hs.169610 CD44 antigen (homing function and Indian blood group system) 2.8 

132833 U78525 Hs.57783 eulraryotic translation initiation factor 3, subunit 9 (eta, 116kD) 14.6 

132842 NM_016154Hs.279771 Homo sapiens clone PP1596 unknown mRNA 1.6 

55 132844 F12200 Hs.5811 chromosome 21 open reading frame 59 2.5 

132851 U0971 6 Hs.287912 lectin, mannose-binding, 1 1.4 

132863 BE268048 Hs.236494 RAB10, member RAS oncogene family 4.2 

132869 AW963217 Hs.203961 ESTs, Moderately similarto AF1 16721 89 PR02168 [H.sapiens] 2.8 

132873 AW007683 Hs.58598 KIAA1266 protein 2.0 

60 132875 NM_004850Hs.58617 Rho-associated, coiled-coil containing protein kinase 2 1.6 

132891 BE267143 Hs.59271 U2(RNU2) small nuclear RNA auxiliary factor 1 (non-standard symbol) 1.4 

132897 AW503667 Hs.59545 ring finger protein 15 5.4 

132902 AI936442 Hs.59838 hypothetical protein FLJ10808 6.1 

132912 AW732760 Hs. 167578 Homo sapiens cDNA FU1 1095 fis, clone PLACE1005374 7.1 

65 132913 W78714 Hs.60257 Homo sapiens CDNAFU13598 fis, clone PLACE1009921 2.8 

132940 T79136 Hs.127243 Homo sapiens mRNA for KIAA1724 protein, partial cds 6.1 

132941 AI817165 Hs.6120 hypothetical protein FLJ 13222 10.3 
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132942 M554458 Hs.197751 KIAA0666 protein 1.8 
132952 AI658580 Hs.61426 Homo sapiens mesenchymal stem cell protein DSC96 mRNA, partial cds 2 2 

132962 M576635 Hs.6153 CGI-48 protein 4 9 

132972 AA034365 Hs.288924 Homo sapiens cDNA FLJ11392 fis, clone HEMBA1 000575 27 
5 132973 AA035446 Hs.323277 ESTs 5 3 

132977 AA093322 Hs.301404 RNA binding motif protein 3 32 
132980 AA040696 Hs.62016 ESTs 13 
132994 AA1 12748 Hs.279905 clone HQ0310 PRO0310p1 30 
133012 M847843 Hs.62711 Homo sapiens, clone IMAGE:3351295, mRNA 10.3 
10 133015 AJ002744 Hs.246315 UDP-N-acetyl-alpha-D-galactosamine.polypeptide N-acetylgalactosaminyltransferase 7 (GalNAc-T7) 2.1 
133016 AI439688 Hs.6289 hypothetical protein FLJ20886 " 1.3 

133053 AI065016 Hs.6390 Homo sapiens clone FLB3344 PRO0845 mRNA, complete cds 6.0 
133062 AW500374 Hs.64056 PRO0149 protein 5.3 
133069 BE247441 Hs.6430 protein with polyglutamine repeat; calcium (ca2+) homeostasis endoplasmic reticulum protein 4.9 
15 133091 AK001628 Hs.64691 KIAA0483 protein , 35 

133110 AA808177 Hs.65228 ESTs 13.1 
133134 AF198620 Hs.65648 RNA binding motif protein 8A 1.3 

s _, 133145 H94227 Hs.6592 Homo sapiens, clone IMAGE:2961368, mRNA, partial cds 2.2 

133152 Z11695 Hs.324473 mitogen-activated protein kinase 1 1.3 
C20 133174 M431620 Hs.324178 hypothetical protein MGC2745 17.1 
p 5 ! 133175 AW955632 Hs.66666 ESTs, Weakly similar to S19560 proline-rich protein MP4 - mouse [M.musculus] 1.8 

7~, 133177 X97795 Hs.66718 RAD54 (S.cerevisiae)-like 4.9 

133197 AI275243 Hs.180201 hypothetical protein FLJ20671 31 

W 133208 AI801777 Hs.6774 ESTs 4.4 

fl25 133226 AW954569 Hs.296287 Homo sapiens, Similar to bromodomain-conlaining 4, clone IMAGE:3542455, mRNA, partial cds 1.7 

I", 133228 AI492924 Hs.6831 gofgi phosphoprotein 1 6.0 

133240 AK001489 Hs.242894 ADP-ribosylation factor-like 1 1.5 

O 133254 AI567421 Hs.273330 Homo sapiens, clone IMAGE:3544662, mRNA, partial cds 1.4 

133266 AH 60873 Hs.69233 zinc finger protein 5.6 

:30 133268 AW956781 Hs.293937 ESTs, Weakly similar to FXD2_HUMAN FORKHEAD BOX PROTEIN D2 [H.sapiens] 1.9 

M 133285 M76477 Hs.289082 GM2 ganglioside activator protein 4.7 

M= 133291 BE297855 Hs.69855 NRAS-related gene 5.0 

-hi 133314 AA102670 Hs.70725 gamma-aminobutyric acid (GABA) A receptor, pi 2.7 

!=■ 133321 T79526 Hs.179516 integral type I protein 9 3 

B p5 133327 AL390127 Hs.7104 Kruppel-like factor 13 4.4 

p? 133347 BE257758 Hs.71475 acid cluster protein 33 1.8 

pr| 133360 AI016521 Hs.71816 v-akt murine thymoma viral oncogene homolog 1 5.5 

\¥ 133366 AA292811 Hs.72050 non-metastatic cells 5, protein expressed in (nucleoside-diphosphate kinase) 2.7 

133367 AF231919 Hs.18759 K1AA0539 gene product 1.7 

40 133370 AF245505 Hs.72157 DKFZP564I1922 protein 18 

133383 BE313555 Hs.7252 KIAA1 224 protein 1.7 

133390 AI950382 Hs.72660 phosphatidylserine receptor 1.3 

133391 AW103364 Hs.727 inhibin, beta A (activin A, activin AB alpha polypeptide) 16.1 
133394 AA305127 Hs.237225 hypothetical protein HT023 12.2 

45 133437 AL031591 Hs.7370 phosphotidylinositol transfer protein, beta 10.4 

133452 NM_002759Hs.274382 protein kinase, interferon-inducible double stranded RNA dependent 1.2 

133453 AI659306 Hs.73826 protein tyrosine phosphatase, non-receptor type 4 (megakaryocyte) 1.7 
133500 AW964804 Hs.74280 hypothetical protein FLJ22237 11.1 
133529 W45623 Hs.74571 ADP-ribosylation factor 1 2.8 

50 133540 AL037159 Hs.74619 proteasome (prosome, macropain) 26S subunit, non-ATPase, 2 2.9 

133543 AU077073 Hs.108327 damage-specific DNA binding protein 1 (127kD) 2.5 

133578 AU077050 Hs.75066 translin 1.5 

133579 X75346 Hs.75074 mitogen-activated protein kinase-activated protein kinase 2 2.1 
133582 BE391579 Hs.75087 Fas-activated serine/threonine kinase 1.3 

55 133594 AW160781 Hs.172589 nuclear phosphoprotein similar to S. cerevisiae PWP1 2.2 

1 33595 AA393273 Hs.751 33 transcription factor 6-like 1 (mitochondrial transcription factor 1 -like) 1 .5 

133599 NM_002885Hs.75151 RAP1, GTPase activating protein 1 5.7 

133621 NM_004893Hs.75258 H2A histone family, member Y 25.5 

133627 NM_002047Hs.75280 glycyl-tRNA synthetase 15.8 

60 133631 NM_000401Hs.75334 exostoses (multiple) 2 3.3 

133649 U25849 Hs.75393 acid phosphatase 1, soluble 1.6 

133690 AV661185 Hs.75574 mitochondrial ribosomal protein L1 9 4.1 

133720 L27841 Hs.75737 pericentriolar material 1 1.5 

133722 AW969976 Hs.279009 matrix Gla protein 6.3 

65 133751 AW402048. Hs.334787 Homo sapiens, Similar to likely ortholog of yeast ARV1, clone IMAGE:3506392, mRNA 3.9 

133757 T52946 Hs.196209 RAE1 (RNA export 1 , S.pombe) homolog 1.7 

133760 BE271766 Hs.181357 laminin receptor 1 (67kD, ribosomal protein SA) 1.8 
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133765 M62194 Hs.75929 cadherin 11, type 2, OB-cadherin (osteoblast) 15 
133780 AA557660 Hs.76152 decorin 35 
133784 BE622743 Hs.301064 arfaptin 1 68 
133791 M34338 Hs.76244 spermidine synthase 2 6 

5 133797 AL133921 Hs.76272 retinoblastoma-binding protein 2 14 
133822 D50525 Hs.699 peptidylprolyl isomerase B (oyclophilin B) 80 
1 33842 AW797468 Hs.285013 putative human HLA class II associated protein I 13 5 

133845 AA1 47026 Hs.76704 ESTs 22 
133850 W29092 Hs.7678 cellular retinoic acid-binding protein 1 18 
10 133859 U86782 Hs.178761 26S proteasome-associated pad1 homolog 2 0 

133865 AB01 1 155 Hs.170290 discs, large (Drosophila) homolog 5 28 

133867 AW340125 Hs.76989 KIM0097 gene product 6 7 

133868 AB012193 Hs.183874 cullin 4A 2 5 
133881 U30872 Hs,77204 centromere protein F (350/400kD, mitosin) 30 

15 133922 U30825 Hs.77608 splicing factor, arginine/serine-rich 9 14 
133924 D86326 Hs.325948 vesicle docking protein p1 15 5.4 
133929 NM_006306Hs.211602 SMC1 (structural maintenance of chromosomes 1, yeast)-like 1 4 9 

i = 133936 L17128 Hs.77719 gamma-glutamyl carboxylase 37 

[" 133941 BE244332 Hs.77770 adaptor-related protein complex 3, mu 2 subunit 12.1 
1=20 133959 X81789 Hs.77897 splicing factor 3a, subunit 3, 60kD 97 
fp 133976 AI908165 Hs.169946 GATA-binding protein 3 (T-cell receptor gene activator) 3.1 

r=: 133989 AL040328 Hs.78202 SWI/SNF related, matrix associated, actin dependent regulator of chromatin 1.3 

133997 AI824113 Hs.78281 regulator of G-protein signalling 12 97 
W 134010 AB016092 Hs.197114 RNA binding protein; AT-rich element binding factor , 2.4 

r=25 134015 D31764 Hs.278569 sorting nexin 17 2 5 

: = : 134070 NM_003590Hs.78946 cullin 3 1.3 

%\ 134110 U41060 Hs.79136 UV-1 protein, estrogen regulated 4.2 

Q 134129 NM_014742Hs.79305 KIAA0255 gene product 2.2 

" . 134134 H86504 Hs.173328 protein phosphatase 2, regulatory subunit B (B56), epsilon isoform 5.0 

^30 134200 BE559598 Hs.197803 KIAA01 60 protein 3.2 

\J 134206 AF107463 Hs.79968 splicing factor 30, survival of motor neuron-related 2.5 

y, 134208 NM_000288Hs.79993 peroxisomal biogenesis factor 7 21 

= , 134219 NM_000402Hs.80206 glucose-6-phosphate dehydrogenase 9.1 

? 134234 BE300078 Hs.80449 Homo sapiens, clone IMAGE:3535294, mRNA, partial cds 2.8 

4=35 134275 AI878910 Hs.3688 cisptatin resistance-associated overexpressed protein 18 

p 134292 AI906291 Hs.81234 immunoglobulin superfamily, member 3 20 

If! 134301 AW502505 Hs.81360 Homo sapiens cDNA: FLJ21927 fis, clone HEP04178, highly similar to HSU90909 2.5 

ry 134305 U61397 Hs.81424 ubiquitin-like 1 (sentrin) 2 8 

134324 AB029023 Hs.179946 KIAA1 100 protein 10 4 

40 134326 AW903838 Hs.81800 chondroitin sulfate proteoglycan 2 (versican) 19 

134329 N92036. Hs.81848 RAD21 (S. pombe) homolog 2.6 

1 34337 NM_004922Hs.81964 SEC24 (S. cerevisiae) related gene family, member C 2.3 

134348 AW291946 Hs.82065 interleukin 6 signal transducer (gp130, oncostatin M receptor) 13.0 

134367 AA339449 Hs.82285 phosphoribosylgrycinamide formyltransferase, phosphoribosylglycinamide synthetase, 8.8 

45 134376 X06560 Hs.82396 2',5'-oligoadenylate synthetase 1 (40-46 kD) 1.5 

134379 AW362124 Hs.323193 hypothetical protein MGC3222 8.1 
1 34384 AI589941 Hs.8254 Homo sapiens, Similar to tumor differentially expressed 1 , clone IMAGE:3639252, mRNA, partial cds 2.6 

134391 AA417383 Hs.82582 integrin, beta-like 1 (with EGF-like repeat domains) 41 

134395 AA456539 Hs.8262 lysosomal 17 

50 134403 AA334551 Hs.82767 sperm specific antigen 2 26 

134405 AW067903 Hs.82772 collagen, type XI, alpha 1 1 3 

134411 BE272095 Hs.167791 reticulocalbin 1, EF-hand calcium binding domain 3.2 

134415 AI750762 Hs.82911 protein tyrosine phosphatase type IVA, member 2 19 

134421 AU077196 Hs.82985 collagen, type V, alpha 2 10 3 

55 134424 Z44190 Hs.83023 peroxisomal biogenesis factor 11B 2 4 

134446 M1 12036 Hs.83419 KIM0252 protein 1.2 

134447 M58603 Hs.83428 nuclear factor of kappa light polypeptide gene enhancer in B-celis 1 (p1 05) 1 .6 
134470 X54942 Hs.83758 CDC28 protein kinase 2 " 2.1 
134480 NM_005000Hs.83916 Empirically selected from AFFX single probeset 5 3 

60 134485 X82153 Hs.83942 cathepsin K (pycnodysostosis) 2 5 

134498 AW246273 Hs.84131 threonyl-tRNA synthetase 21 

134513 AA425473 Hs.84429 KIAA0971 protein 38 

134516 AK001571 Hs.273357 hypothetical protein FU 10709 2.4 

134520 BE091005 Hs.74861 activated RNA polymerase II transcription cofactor 4 67 

65 134529 AW411479 Hs.848 FK506-binding protein 4 (59kD) 2 3 

134577 BE244323 Hs.85951 exportin, tRNA (nuclear export receptor for tRNAs) 55 

134582 AA927177 Hs.86041 CGG triplet repeat binding protein 1 5.8 

135 



Attorney Docket No.:01 850 1 -0052 1 OUS 



134612 AW068223 Hs.171581 ubiquitin C-terminal hydrolase UCH37 2 2 

134624 AF035119 Hs.8700 deleted in liver cancer 1 2 0 

134632 X78520 Hs.174139 chloride channel 3 2 3 

134654 AK001741 Hs.8739 hypothetical protein FLJ10879 14 

5 134664 AA256106 Hs.87507 ESTs 72 9 

134666 BE391929 Hs.8752 transmembrane protein 4 85 

134687 U62317 Hs.88251 arylsulfatase A 6.0 

134692 NM_003474Hs.8850 a disintegrin and metalloproteinase domain 12 (meltrin alpha) 4.3 

134705 BE161887 Hs.88799 anaphase-promoting complex subunit 10 2 3 

10 134714 Y14768 Hs.890 lysosomal 6 7 

134719 AA852985 Hs.89232 chromobox homolog 5 (Drosophila HP1 alpha) 23 

134722 AF129536 Hs.284226 F-box only protein 6 29 

134724 AF045239 Hs.321576 ring finger protein 22 6 6 

134746 X07871 Hs.89476 CD2 antigen (p50), sheep red blood cell receptor 2 3 

15 134751 AW630803 Hs.89497 laminBI 62 

134790 BE002798 Hs.287850 integral membrane protein 1 19 

134806 AD001528 Hs.89718 spermine synthase 18 

134834 AW451 370 Hs.8991 adaptor-related protein complex 1, gamma 2 subunit 14 

! 134850 AI701162 Hs.90207 hypothetical protein MGC1 11 38 1.4 

4=20 134853 BE268326 Hs.90280 5-aminoimidazole-4-carboxamide ribonucleotide formyltransferase/llvlP cyclohydrolase 5 6 

O 134859 D26488 Hs.90315 KIM0007 protein 28 

If! 134880 AI879195 Hs.90606 15 kDa selenoprotein 17 

i! 134910 AA532963 Hs.9100 Homo sapiens cDNAFLJ13100fis, clone NT2RP3002255 1 7 

W*' 134925 AW885909 Hs.6975 PRO1073 protein 2.1 

n.25 134955 AW401361 Hs.91773 protein phosphatase 2 (formerly 2A), catalytic subunit, alpha isoform 13 

l r _'\ 134971 AI097346 Hs.286049 phosphoserine aminotransferase 21 

Jr 134975 R50333 Hs.92186 Leman coiled-coil protein 2 3 

O 135011 AB037835 Hs.92991 KIAA1414 protein 16 

135022 NM_O00408Hs.93201 glycerol-3-phosphate dehydrogenase 2 (mitochondrial) 39 

, -30 135032 AW301984 Hs.173685 hypothetjcal protein FLJ12619 6 2 

= : 135077 AW503733 Hs.9414 KIAA1488 protein 20 

H' 135083 AB036063 Hs.94262 p53-inducible ribonucleotide reductase small subunit 2 homolog 1.3 

p] 135095 AF027219 Hs.9443 zinc finger protein 202 71 

135096 M081258 Hs.132390 zinc finger protein 36 (KOX 18) 3 2 

#35 135153 AI093155 Hs.95420 JM27 protein 2 5 

C ! ! 135181 BE250865 Hs.279529 px19-like protein 14 

si 135199 AA477514 Hs.96247 translin-associated factor X 5.0 

= 135207 N26427 Hs.9634 ESTs, Highly similar to C10JHUMAN PUTATIVE C10 PROTEIN [H.sapiens] 61 

135214 T78802 Hs.96560 hypothetjcal protein FU 11 656 4 6 

40 135243 BE463721 Hs.97101 putative G protein-coupled receptor 5 6 

135245 AI028767 Hs.262603 ESTs 35 

135257 AW291023 Hs.97255 ESTs, Weakly similar to A46010 X-linked retinopathy protein [H.sapiens] 12 

135263 AI088775 Hs.55498 geranylgeranyl diphosphate synthase 1 * 2 6 

135274 AA448460 Hs.112017 GE36gene 53 

45 135294 AA1 50320 Hs.9800 protein kinase Njmu-R1 91 

135295 AI090838 Hs.98006 ESTs 24 

135307 AI743770 Hs.98368 ESTs, Weakly similar to KIAA0822 protein [H.sapiens] 13.3 

135321 AI652069 Hs.98614 ribosome binding protein 1 (dog 180kD homolog) 2 6 

135354 AA456454 Hs.183418 cell division cycle 2-like 1 (PITSLRE proteins) 8 3 

50 135361 AA373452 Hs.167700 Homo sapiens cDNAFLJ10174fis, clone HEMBA1003959 1.5 

135389 U05237 Hs.99872 fetal Alzheimer antigen 4.9 

135400 X78592 Hs.99915 androgen receptor (dihydrotestosterone receptor; testicular feminization; spinal and bulbar 2.0 

134975 R50333 Hs.92186 Leman coiled-coil protein 2 6 

135011 AB037835 Hs.92991 KIAA1414 protein 14 

55 135022 NM_000408Hs.93201 glycerol-3-phosphate dehydrogenase 2 (mi 16 

135032 AW301984 Hs.173685 hypothetical protein FU12619 14 

135077 AW503733 Hs.9414 KIAA1 488 protein 18 

135083 AB036063 Hs.94262 p53-inducible ribonucleotide reductase s 2 5 

135095 AF027219 Hs.9443 zinc finger protein 202 15 

60 135096 AA081258 Hs.132390 zinc finger protein 36 (KOX 18) 21 

135153 AI093155 Hs.95420 JM27 protein 44 

135181 BE250865 Hs.279529 px19-like protein 14 9 

135199 AA477514 Hs.96247 translin-associated factor X 13 

135207 N26427 Hs.9634 ESTs, Highly similar to C10_HUMAN PUTATI 17 

65 135214 T78802 Hs.96560 hypothetical protein FLJ 11 656 6.1 

135243 BE463721 Hs.97101 putative G protein-coupled receptor 2 7 

135245 AI028767 Hs.262603 ESTs 12.2 
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135257 AW291023 Hs.97255 ESTs, Weakly similar to A46010 X-linked 7.6 

135263 AI088775 Hs.55498 geranylgeranyl diphosphate synthase 1 1.8 

135274 AA448460 Hs.112017 GE36gene 4.1 

135294 AA1 50320 Hs.9800 protein kinase Njmu-R1 1.2 

5 135295 AI090838 Hs.98006 ESTs 4.8 

135307 AI743770 Hs.98368 ESTs, Weakly similar to KIAA0822 protein 5.8 

135321 AI652069 Hs.98614 ribosome binding protein 1 (dog 180kD ho 12.3 

135354 M456454 Hs.183418 cell division cycle 2-like 1 (PITSLRE pr 5.7 

135361 AA373452 Hs.167700 Homo sapiens cDNA FU10174fis, clone HE 7.9 

10 135389 U05237 Hs.99872 fetal Alzheimer antigen 1.9 

135400 X78592 Hs.99915 androgen receptor (dihydrotestosterone r 13.9 

302256 AA857131 Hs.171595 HIV TAT specific factor 1 5.3 

302276 AW057736 Hs.323910 HER2 receptor tyrosine kinase (c-erb-b2, 2.2 

303135 AW592789 Hs.279474 HSPC070 protein 1.4 

15 303686 AK000714 Hs.109441 MSTP033 protein 5.2 

310085 R43191 Hs.101248 Homo sapiens clone IMAGE:32553, mRNA seq 2.3 

315518 AA808229 Hs.167771 ESTs 2.8 

y. 317781 NM_007057Hs.42650 ZW10 interactor 2.0 

L_ 320836 AI268997 Hs.197289 rab3 GTPase-activating protein, non-cata 5.5 

1=20 321114 AA902256 Hs.78979 Golgi apparatus protein 1 1.4 

fj 322221 N24236 Hs.179662 nucleosome assembly protein 1 -like 1 1.3 

f?; 322474 AF1 18083 Hs.29494 PR01912 protein 2.9 

"/ 322556 BE041451 Hs.177507 hypothetical protein 1.6 

Iff 323541 AF292100 Hs.104613 RP42homolog 1.8 

TiZ5 407827 BE278431 Hs.40323 BUB3 (budding uninhibited by benzimidazo 1.6 

I,"- 408196 AL034548 Hs.43627 SRY (sex determining region Y)-box 22 6.1 

408813 AI580090 Hs.48295 RNA helicase family 5.6 

p 409176 R73727 Hs.101617 ESTs, Weakly similar to T32527 hypotheti 2.6 

413670 AB000115 Hs.75470 hypothetical protein, expressed in osteo 2.4 

L30 414108 AI267592 Hs.75761 SFRS protein kinase 1 1.5 

W 414846 AW304454 Hs.77495 UBX domain-containing 1 4.2 

|== 416980 AA381133 Hs.80684 high-mobility group (nonhistone chramoso 23.6 

fl \ 417378 R57256 Hs.82037 TATA box binding protein (TBP)-associate 5.8 

'~ 418283 S79895 Hs.83942 cathepsin K (pycnodysostosis) 1.3 

435 418467 NM_006910Hs.85273 retinoblastoma-binding protein 6 1.6 

p 420269 U72937 Hs.96264 alpha thalassemia/mental retardation syn 2.3 

Sf; 420802 U22376 Hs.1334 v-myb avian myeloblastosis viral oncogen 1.6 

; *' 421225 AA463798 Hs.102696 MCT-1 protein " 3.5 

421642 AF172066 Hs.106346 retinoic acid repressible protein 4.9 

40 421828 AW891965 Hs.279789 histone deacetylase 3 3.1 

421983 AI252640 Hs.1 10364 peptidylprolyl isomerase C (cyclophilin 1.9 

422052 AA302744 Hs.104518 ESTs 2.4 

422055 NM_014320Hs.111029 putative heme-binding protein 4.1 

423750 AF1 65883 Hs.298229 prefokfin2 7.0 

45 424001 W67883 Hs.137476 paternally expressed 10 (PEG10; KIAA105 4.9 

425182 AF041259 Hs.155040 zinc finger protein 217 3.4 

425284 AF155568 Hs.155489 NS1 -associated protein 1 2.1 

426372 BE304680 Hs.169531 DEAD/H (Asp-Glu-Ala-Asp/His) box polypep 7.5 

428049 AW183765 Hs.182238 GW128 protein 1.7 

50 428477 AW500533 Hs.1 1482 splicing factor, arginine/serine-rich 11 2.4 

437562 AB001636 Hs.5683 DEAD/H (Asp-Glu-Ala-Asp/His) box polypep 3.8 

438449 AK001333 Hs.6216 Homo sapiens hepatocellular carcinoma-as 5.6 

441560 F13386 Hs.7888 Homo sapiens clone 23736 mRNA sequence 2.0 

445580 AF167572 Hs.12912 skbl (S. pombe) homolog 7.5 

5 5 446999 M151520 Hs.334822 hypothetical protein MGC4485 2.2 

447111 AI017574 Hs.17409 cysteine-rich protein 1 (intestinal) 2.8 

447778 BE620592 Hs.71190 ESTs, Weakly similar to S 16506 hypotheti 1.7 

448873 NM_003677Hs.22393 density-regulated protein 5.9 

449687 W68520 Hs.331328 intermediate filament protein syncoilin 5.6 

60 450701 H39960 Hs.288467 Homo sapiens cDNA FLJ12280 fis, clone MA 1.4 

450703 AA011202 Hs.184771 nuclear factor l/C (CCAAT-binding transc 4.7 

452461 N78223 Hs.108106 transcription factor 2.9 

452511 BE408178 Hs.285165 Homo sapiens cDNA FLJ20845 fis, clone AD 12.1 

453157 AF077036 Hs.31989 DKFZP586G1 722 protein 4.7 

65 453658 BE541906 Hs.87819 Homo sapiens, clone MGC:2492, mRNA, comp 1.3 

100833 AF135168 Hs.108802 N-ethylmaleimide-sensitive factor 3.2 

102481 U50360 gb:Human calcium, calmodulin-dependent p 6.2 
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102827 BE244588 Hs.6456 chaperonin containing TCP1,subunit 2 (b 7 9 

103549 BE270465 Hs.78793 protein kinase C, zeta 20 

104331 AB040450 Hs.279862 cdk inhibitor p21 binding protein 5.3 

110018 AW579842 Hs.1 04557 hypothetical protein FU 10697 20 

115008 AK001827 Hs.87889 helicase-moi 57 

119075 M10905 Hs.287820 ftbronectin 1 1.3 

119615 AL034423 Hs.75875 ubiquitin-conjugating enzyme E2 variant 2.9 

125006 BE065136 Hs.145696 splicing factor (CC1. 3) 1.7 

127609 X80031 Hs.530 collagen, type IV, alpha 3 (Goodpasture 2.4 

129209 R62676 Hs.17820 Rho-associated, coiled-coil containing p 5.2 

129917 M30773 Hs.278540 protein phosphatase 3 (formerly 2B), reg 4.5 

130182 BE267033 Hs. 192853 ubiquitin-conjugating enzyme E2G 2 (homo 11.0 

130365 W56119 Hs.155103 eukaryotic translation initiation factor 3.3 

131135 NM_016569Hs.267182 TBX3-iso protein 1.3 

131853 AI681917 Hs.3321 ESTs, Highly similar to IRX1_HUMAN IROQU 3.2 

131881 AW361018 Hs.3383 upstream regulatory element binding prot 14.3 

132726 N52298 Hs.55608 hypothetical protein MGC955 3.0 

135193 X95525 Hs.96103 TATA box binding protein (TBP)-associate 2.7 

409487 H19886 gb:yn57a05.ii Soares adult brain N2b5H 2.3 

416040 AW8 191 58 Hs.289044 Homo sapiens cDNAFLJ 12048 fis, clone HE 7.4 
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TABLE 4A 

Table 4A shows the accession numbers for those pkeys lacking unigenelD's for Table 4. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 



123615 30686_-15 AA609170 
123619 371 681 1 AA602964 AA609200 
101445 1650_-5 M21259 
124385 656394J AI267847 N27351 
124417 1642364J N34059 N46979 
124482 1657509_1 N53935N53950 
102481 31281_-28 U50360 
103349 11052_-2 X89059 

1 10856 1 9346J4 AA992380 N33063 N21418 H79958 R21 91 1 H79957 

103797 109699J AA080912 AA075318 M083403 AA076594 AA078992 AA084926 AA081881 AA1 13913 AA1 13892 
M083821 AA134801 M082953 AA070343 M062835 AA075419 AA063293 AA071252 AA078900 AA062836 AW974305 

120280 160212J AA190577 AA181657 

113248 328626J T63857 AW971220 AA493469T63699 

120472 44573_2 A1950087 N70208 R97040 N36809 AI3081 19 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 
M908598 AA251875 AI820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
AI219788 M884444 N92578 F13493 AA927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 M488964 
AA283144 AI890387 AI950344 AI741346 AI689062 AA282915 AW102898 AI872193 AI763273 AW173586 AW150329 AI653832 
AI762688 AA988777 AA488892 AI356394 AW103813 AI539642 M642789 M856975 AW505512 AI961 530 AW629970 
BE612881 AW276997AW513601 AW512843 M044209 AW856538 AA1 80009 M337499AW961 101 AA251669 M251874 
AI81 9225 AW205862 AI683338 AI858509 AW276905 AI633006 M972584 AA908741 AW072629 AW51 3996 AA293273 
AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 AW61 3456 AI373032 AI564269 
F00531 H83488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 F04005 

129019 44573_2 AI950087 N70208 R97040 N36809 AI308119 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 
M908598 AA251875 AI820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 M884922 BE328517 
AI219788 AA884444 N92578 F1 3493 M927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 AA488964 
AA283144 AI890387 AI950344 AI741346 AI689062 AA282915 AW102898 AI872193 AI763273 AW173586 AW150329 AI653832 
AI762688 AA988777 AA488892 AI356394 AW1 03813 AI539642 M642789 M856975 AW505512 AI961 530 AW629970 
BE612881 AW276997AW513601 AW512843 AA044209 AW856538M1 80009 AA337499 AW961101 AA251669 AA251874 
AI819225 AW205862 AI683338 AI858509 AW276905 AI633006 AA972584 AA908741 AW072629 AW513996 AA293273 
AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 AW613456 AI373032 AI564269 
F00531 H83488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 F04005 

120695 9683_3 AA976503 AI917802 AA953664 AA404613 AA428771 BE280542 AW194691 AI927301 AI740458 AI796100 AI935603 AW052210 
AA970201 A1633384 AA425910 AI017004 AI241295 AA402816 AA291468 

122188 275673J M398838 M435847 

121581 283769J AA416568 AA442889 AA417233AA442223 

122618 305217J AA453641 AA454061 

109026 150431J AA157811 AA836869 

123658 genbank_AA609364 AA609364 

123811 genbank_AA620586 M620586 

125115 genbank_T97341 T97341 

125147 NOT_FOUND_entrez_W38150 W38150 

118737 382979J AA1 99686 N73861 

120274 genbank_AA177051 M1 77051 

113196 genbank_T57317 T57317 

120504 genbank_AA256837 AA256837 

120639 genbank_AA286942 AA286942 
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120809 genbank_AA346495 AA346495 
113702 genbank_T97307 T97307 

129680 23162J U03749 NM_001275 J03483 J03915 AI214509 AW245744 AL046455 AA318960 AI741505 M843875 AI829382 AI560122 
AI858999 D55958 AI684005 D53170 M854091 AI025609 D53119 D54729 D55504 D55377 D55313 AW512244 AA846441 
AW043898 AI969102 AA405741 AI091983 AA788784 AA706586 AA854361 AW470949 AA843095 AA772028 AI148432 
AI038109 M782478 AA910064 AI220384 M781296 AA843881 AA854064 AA843125 AA843419 AA319036 AA319054 
AI273831 W32275 AI584185 C05724 M789023 AI686818 D54392 AI022485 AA431410 AA854232 W39212 W15214 AA894441 
AI803081 AI167381 AW245389 AA319430 AA335156 AI042646 AA327030 AA725170 T27943 M889304 AA976699 AI687001 
AI621107 AI865540 AA772107 C06286 AA319661 AA405992 

101045 entrez_J05614J05614 

117247 genbank N21032 

110501 genbank H55748 

103392 entrez_X94563X94563 

105032 genbank M127818 

119513 NOT_FOUND_entrez W37933 

105445 genbank AA252395 

121514 genbank AA412112 

121558 genbank AA412497 

121911 genbank AA427950 

123315 714071J AA496369 M496646 

114911 genbank AA236672 

409487 1134778.1 H19886AW402806T10231 
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TABLE 5: Figure 5 from BRCA 001 US 
Table 5 shows genes upregulated in tumor tissue compared to normal breast tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnkjenelD: Unfcjene number 

10 Unfcjene Title: Unkjene gene title 

R1 : Ratio of tumor to normal breast tissue 



Pkey ExAccn UnigenelD UnigeneTitle R1 

?f 100114 X02308 Hs.82962 thymidylate synthetase 2.9 

p 100147 D13666 Hs.136348 osteoblast specific factor 2 (fasciclin 7.5 

F>i 100154 H60720 Hs.81892 KIM01 01 gene product 9.2 

100335 AW247529 Hs.6793 platelet-aclivating factor acetylhydrola 2.7 

1120 100666 L05424 Hs.169610 CD44 antigen (homing function and Indian 5.7 

if) 100667 L05424 Hs.169610 CD44 antigen (homing function and Indian 9 

=f: 100668 L05424 Hs.169610 CD44 antigen (homing function and Indian 7.6 

— 100678 AW502935 Hs.740 PTK2 protein tyrosine kinase 2 53.2 

SI 100988 AK000405 Hs.76480 ubiquitin-like 4 11.4 

p25 101031 J05070 Hs.151738 matrix metalloproteinase 9 (gelatinase B 8.2 

101045 J05614 gb:Human proliferating cell nuclear anti 5 

101332 J04088 Hs.156346 topoisomerase (DNA) II alpha (170kD) 3.4 

p 101352 AI494299 Hs.16297 COX17 (yeast) homolog, cytochrome coxid 6.3 

i, 101580 NM_012151Hs.83363 coagulation factor Vlll-associated (intr 5.7 

[30 101592 AF064853 Hs.91299 guanine nucleotide binding protein ( 5.6 

\U 101767 M81057 Hs.180884 carboxypeptidase B1 (tissue) 14.4 

J!: 101806 AA586894 Hs.112408 S100 calcium-binding protein A7 (psorias 8.9 

~. 101810 NM_000318Hs.180612 peroxisomal membrane protein 3 (35kD, Ze 3.2 

*= ; 101983 AI904232 Hs.75323 prohibitin 8.4 

?B5 102107 BE258602 Hs.182366 heat shock protein 75 1.4 

102165 BE313280 Hs.159627 death associated protein 3 4.6 

102198 AW950852 Hs.74598 polymerase (DNA directed), delta 2, regu 4.3 

102217 AA829978 Hs.301613 JTV1 gene 6.7 

102220 U24389 Hs.65436 lysosomal 4.3 

40 102302 AA306342 Hs.69171 protein kinase C-like 2 2.7 

102348 U37519 Hs.87539 aldehyde dehydrogenase 3 family, member 2 

102374 U33635 Hs.90572 PTK7 protein tyrosine kinase 7 6.2 

102455 U48705 Hs.75562 discoidin domain receptor family, member 6.9 

102568 W81489 Hs.223025 RAB31, member RAS oncogene family 5.3 

45 102618 AL037672 Hs.81071 extracellular matrix protein 1 5.8 

102687 NM_007019Hs.93002 ubiquitin carrier protein E2-C 4.3 

102689 U96132 Hs.171280 hydroxyacyl-Coenzyme A dehydrogenase, ty 6 

102704 AU077058 Hs.54089 BRCA1 associated RING domain 1 1.9 

102705 T97490 Hs.50002 small inducible cytokine subfamily A (Cy 2.3 
50 102801 BE252241 Hs.38041 pyridoxal (pyridoxine, vitamin B6) kinas 6.4 

102827 BE244588 Hs.6456 chaperonin containing TCP1, subunit 2 (b 5.6 

1 03060 NM_005940Hs. 1 55324 matrix metalloproteinase 11 (MMP11; stro 4.5 

103080 AU077231 Hs.82932 cyclin D1 (PRAD1: parathyroid adenomatos 3.1 

103178 AA205475 Hs.275865 ribosomal protein S1 8 9.9 

55 103206 X72755 Hs.77367 monokine induced by gamma interferon 8.8 

103238 AI369285 Hs.75189 death-associated protein 5.6 

103547 AI376722 Hs.1 80062 proteasome (prosome, macropain) subunit, 9.7 

103549 BE270465 Hs.78793 protein kinase C, zeta 7.9 

103886 AK001278 Hs.105737 hypothetical protein FLJ10416 similar to 6.5 

60 104325 BE379766 Hs.150675 polymerase (RNA) II (DNA directed) polyp 6.3 

104827 AW052006 Hs.8551 PRP4/STK/WD splicing factor 10.9 

104846 AI250789 Hs.32478 ESTs 5.6 

104854 AA041276 Hs.154729 3-phosphoinositide dependent protein kin 12.3 

104867 AA278898 Hs.225979 hypothetical protein similar to small G 2 
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104896 


AW015318 


Hs 23165 


ESTs 


177 


104909 


AW408164 


Hs.249184 


transrrintinn fartor 1 Q /sni 


K 


104916 


AW958157 


Hs. 155489 


NS1-associated protsin 1 


1.7 


104919 


AA026880 


Hs.25252 


nrolartin rpopnfor 


1 & 

1 .H 


104974 


Y12059 


H«s 77RRTi 


UiUII IVUUI lldllt-UJillclll liny H 


1 A. 


104978 


AM 99268 


Hs 19322 


Hnmri sanipns Similar tn RIKFN rDMA 9010 

I i\ft 1 Ik/ OaplCllo, Ollllllul IU r\li\Oi Ul— 'INrt £.\J IU 


7 2 


105012 


AF098158 


Hs 9329 


phromoQnmp 20 nnon nsaHinn frame* 1 

1*1 11UI 1 lUoUl 1 (C z\t upci 1 ICOUI) ly Hal 1 IC 1 


O.J 


105039 


AA907305 


Hs.36475 


ESTs 


2.5 


105079 


M151342 


Hs.12677 


CGI-147 protein 


9.5 


105088 


H58589 


Hs.35156 


Homo *5flnipn<? rDNA Fl 111027 fis rlnnp PI 

1 IVIIIUSCipidld uwiin 1 l—v 1 WJ&.I HO, V/IUIIC rL 


2.2 


105393 


AF1 67570 


Hs.256583 


intf*rlpukin pnhannpr hinriinn fartor 3 Q 

II ILCI IGUIvll 1 dlllQIIl^l UlllUIIIU IGk/lV/l Vj V 


5 4 


105547 


AA262640 


Hs.27445 


unknown 


9.3 


105564 


BE616694 


Hs.288042 


hypothetical protein FLJ14299 


1.4 


105658 


M985190 


Hs.246875 


hypothetical protein FU20059 


9.4 


105746 


AW151952 


Hs.46679 


hypothetical protein FLJ20739 


1.5 


105858 


AF151066 


Hs.281428 


hypothetical protein 


2.9 




AF016371 


Hs.9880 


peptidyl prolyl isomerase H (cyclophilin 


5.2 


106094 


AA533491 


Hs.23317 


hypothetical protein FLJ14681 


6.8 


106350 


AK001404 


Hs.194698 


cyclin B2 


5.7 




AW390282 


Hs.31130 


transmembrane 7 superfamily member 2 


6.3 


1UDD lu 


AA458882 


Hs.79732 


fibulin 1 


7.9 




NM_003595Hs.26350 


tyrosylprotein sulfotransferase 2 


7.7 


lUOf 10 


BE614802 


Hs.184352 


hypothetical protein FLJ 12549 


4.5 


i uoozy 


AW959893 


Hs.27099 


hypothetical protein FLJ23293 similar to 


16.2 


1 U0o4C 


AB037744 


Hs.34892 


KIAA1323 protein 


2.2 


lUOOf O 


N49809 


Hs.11197 


Homo sapiens, clone IMAGE:3343149, mRNA, 16.8 


iuoy/o 


BE1 56256 


Hs.11923 


hypothetical protein 


6.6 




AF264750 


Hs.288971 


myeloid/lymphoid or mixed-lineage leukem 


1.8 


1fY71Q7 


W15477 


Hs.64639 


glioma pathogenesis-related protein 


6.1 


1 H7ft^Q 

I U / OOcJ 


AW732573 


Hs.47584 


potassium voltage-gated channel, delayed 


8.4 


lUf UU 1 


L42612 


Hs.335952 


keratin 6B 


2.5 




BE153855 


Hs.61460 


Ig superfamily receptor LNIR 


2.2 




AW956103 


Hs.61712 


pyruvate dehydrogenase kinase, isoenzyme 


6.7 


1 (18 974 


AF1 29535 


Hs.272027 


F-box only protein 5 


7.1 


1fiftfiA7 
IUOO*f / 


BE546947 


Hs.44276 


homeo box C10 


9.8 


luooyo 


AB029000 


Hs.70823 


KIAA1077 protein 


7.2 




AK001431 


Hs.5105 


hypothetical protein FU10569 


4 


luyu 1 1 


M1 56542 


Hs.72127 


ESTs 


1.4 


i uyuoo 


AA1 64293 


Hs.72545 


ESTs 


2.9 


mQ97^ 


M375752 


Hs.82719 


Homo sapiens mRNA; cDNA DKFZp586F1822 (f 2.9 


iuy*too 


NM_015310Hs.6763 


KIAA0942 protein 


3.2 


1 109AA 


AI668594 


Hs.176588 


ESTs, Weakly similar to CP4Y_HUMAN CYTOC 4.2 


1 luOOU 


AI288666 


Hs.16621 


DKFZP434I1 16 protein 


6.2 


I IUOU i 


H55748 




gb:yq94a01.s1 Soares fetal liver spleen 


6.1 


1 1fY7A9 


AW190338 Hs.28029 


hypothetical protein MGC11256 


7.6 


1 1fY7£9 


BE044245 


Hs.30011 


hypothetical protein MGC2963 


9.3 




AA992380 




gb:ot37g06.s1 Soares_testis_NHT Homo sap 2.3 


1 1 rVKft 


NM_005864Hs.24587 


signal transduction protein (SH3 contain 


6.7 


11119*\ 
I I I \/LD 


N63823 


Hs.269115 


ESTs, Moderately similar to Z1 95_HUMAN Z 3.6 


4 44 470 


AK000136 


Hs.10760 


asporin (LRR class 1) 


7.1 


Mi OIQ 

111 Zov 


N90956 


Hs.17230 


hypothetical protein FLJ22087 


7.9 


Mi OQC 

111 ^00 


AA778711 


Hs.4310 


eukaryotic translation initiation factor 


6.9 


111392 


W46342 


Hs.325081 


Homo sapiens, clone IMAGE:3659680, mRNA, 8.4 


111937 


BE298665 


Hs.14846 


Homo sapiens mRNA; cDNA DKFZp564D016 


(fr 10.6 


112244 


AB029000 


Hs.70823 


KIAA1077 protein 


14.6 


112995 


AA737033 


Hs.7155 


ESTs, Moderately similar to 21 15357A TYK 


5.6 


113777 


BE266947 


Hs.10590 


zinc finger protein 313 


13.4 


113791 


AI269096 


Hs.1 35578 


chitobiase, di-N-acetyl- 


1.3 


113811 


BE207480 


Hs.6994 


Homo sapiens cDNA: FLJ22044 fis, clone H 


3.1 


113834 


T26483 


Hs.6059 


EGF-containing fibulin-like extracellula 


11.3 


113868 


W57902 


Hs.90744 


proteasome (prosome, macropain) 26S subu 


2.7 


113870 


AL079314 


Hs.16537 


hypothetical protein, similar to (U06944 


6.1 


113923 


AW953484 


Hs.3849 


hypothetical protein FLJ22041 similar to 


1.9 


114275 


AW515443 


Hs.306117 


KIAA0306 protein 


15.8 


114895 


AA236177 


Hs.76591 


KIAA0887 protein 


7.1 


114965 


AI733881 


Hs.72472 


BMP-R1B 


2.3 


115061 


AI751438 


Hs.41271 


Homo sapiens mRNA full length insert cDN 


11.8 
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115278 AK002163 Hs.301724 

115291 BE545072 Hs. 122579 

115652 BE093589 Hs.38178 

115693 AF231023 Hs.55173 

115941 AI867451 Hs.46679 

115968 AB037753 Hs.62767 

116011 AL359053 Hs.57664 

116417 AW499664 Hs.12484 

116470 AI272141 Hs.83484 

116637 AK001043 Hs.92033 

117132 AI393666 Hs.42315 

117881 AF161470 Hs.260622 

118528 AI949952 Hs.49397 

119075 M10905 Hs.287820 

119265 BE539706 Hs.285363 

119349 T65004 Hs.163561 

119403 AL1 17554 Hs. 119908 

119789 BE393948 Hs.50915 

120206 H26735 Hs.91668 

120253 AA131376 Hs.326401 

120297 AA191384 Hs.104072 

120325 AA195651 Hs.104106 

120327 AK000292 Hs.278732 

120349 AW969481 Hs.55189 

120356 AF000545 Hs.296433 

120371 AA219305 Hs.104196 

120383 AL109963 Hs.123122 

120386 AW969665 Hs.154848 

120389 AW967985 Hs.325572 

120396 AA134006 Hs.79306 

120418 AW966893 Hs.26613 
120472 AI950087 

120484 AA253170 Hs.96473 

120570 AA280679 Hs.271445 

120582 BE244830 Hs.284228 

120596 AA282074 Hs.237323 

120624 AW407987 Hs.173518 
120695 AA976503 

120713 AW449855 Hs.96557 

120750 AI191410 Hs.96693 

120774 AI608909 Hs.193985 

120807 AA346385 Hs.30002 
120809 AA346495 

120984 BE262951 Hs.99052 

121081 M398721 Hs.186749 

121408 AA406137 Hs.98019 

121505 AA494172 Hs.194417 

121508 AA402515 Hs.97887 

121513 M416653 Hs.181510 

121549 AA412477 Hs.98142 
121558 AA412497 

121655 AA421537 Hs.178072 

121744 AA398784 Hs.97514 

121748 BE536911 Hs.234545 

121773 AB033022 Hs.1 58654 

121832 AW340797 Hs.98434 

121839 AA425691 Hs.191606 

121882 AA426376 Hs.98459 
121911 AA427950 

121999 AA430211 Hs.98668 

122013 AA431085 Hs.98706 

122036 W92142 Hs.271963 

122356 M443794 Hs.98390 

122371 AA868555 Hs.178222 

122372 AA446008 Hs.336677 
122460 AW418788 Hs.99148 
122490 AA448349 Hs.238151 



hypothetical protein FLJ11301 1.5 

hypothetical protein FLJ10461 6.2 

hypothetical protein FLJ23468 10.6 

cadherin, EGF LAG seven-pass G-type rece 6.8 

hypothetical protein FLJ20739 5.5 

KIAA1332 protein 9.8 

Homo sapiens mRNA full length insert cDN 2.4 

Human clone 23826 mRNA sequence 7.4 

SRY (sex determining region Y)-box 4 2.1 

integrin-linked kinase-associated serine 2.7 

p10-binding protein 5.2 

butyrate-induced transcript 1 5.7 

ESTs 7.4 

fibronectin 1 5.7 

ESTs 1.4 

ESTs 8.4 

nucleolar protein NOP5/NOP58 6.7 

kallikrein 5 (KLK5; KLK-L2; stratum com 9.2 
Homo sapiens clone PP1498 unknown mRNA 45.7 

fibroblast growth factor 12B 38.9 

ESTs, Weakly similar to Z195_HUMAN ZINC 15.2 

ESTs 6.4 

hypothetical protein FLJ20285 16.1 

hypothetical protein 16.8 

putative purinergic receptor 28.1 

EST 12.4 

FSH primary response (LRPR1, rat) homolo 9.7 

hypothetical protein DKFZp434D0127 32.6 

ESTs, Moderately similar to ALU7_HUMAN A 21.7 

eukaryotic translation initiation factor 12.5 
Homo sapiens mRNA; cDNA DKFZp586F1323 (f 1 1 .4 
gb:wq05c02.x1 NCI_CGAP_Kid12Homosapien 19.4 

EST 10.4 
ESTs, Weakly similar to ALU1.HUMAN ALU S 14.4 

ZNF135-like protein 10.2 

N-acetylglucosamine-phosphate mutase 7.5 

M-phase phosphoprotein homolog 52 
gb:oq30a04.s1 NCI_CGAP_GC4 Homo sapiens 46.8 

Homo sapiens cDNA FLJ12727 fis, clone NT 5.9 

ESTs, Moderately similar to 21 09260A Be 7 

ESTs 7.8 

SH3-containing protein SH3GLB2; KIAA1848 6.8 

gb:EST52657 Fetal heart II Homo sapiens 4.4 

ESTs 5.6 

ESTs, Highly similar to I37550 mismatch 5.4 

EST 6 

ESTs 13.1 

ESTs 28 

ESTs 6.2 

EST 7.4 

gb:zt95g12.s1 Soares_testis_NHT Homo sap 2.8 
Homo sapiens mRNA; cDNA DKFZp434B1023 (f 7.8 

ESTs 7.1 

hypothetical protein NUF2R 19.5 

KIAA1 196 protein 7.9 

ESTs 5.8 

ESTs, Highly similar to KIAA1 048 protein 5 

ESTs 5 

gb:zw50f02.s1 Soares_total_fetus_Nb2HF8_ 7.2 

EST 6.4 

ESTs 6.5 
ESTs, Weakly similar to ALU5JHUMAN ALU S 13.1 

ESTs 7.3 

ESTs 5 

EST 7.6 

ESTs, Weakly similar to S43569 R01 H1 0.6 9.7 

EST 6.1 
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122492 
122510 
122530 
122572 
122607 
122614 
122616 
122618 
122622 
122717 
122829 
122838 
122856 
122868 
122907 
123016 
123034 
123136 
123152 
123394 
123466 
123486 
123615 
123735 
123753 
124006 
124385 
124440 
124656 
124683 
124735 
124761 
124768 
124788 
124811 
124812 
124822 
124860 
124903 
124930 
124942 
125051 
125056 
125101 
125115 
125280 
127274 
128528 
128670 
128691 
128772 
128781 
128797 
128868 
128891 
128946 
128975 
128995 
129019 
129076 
129088 
129096 
129198 
129347 
129362 
129372 
129404 



AA448417 Hs.104990 
AA449232 Hs.99195 
AW959741 Hs.40368 
AA452601 Hs.99287 
AA453518 Hs.98023 
M453630 Hs.99339 
AA453638 Hs.161873 
AA453641 

AA453987 Hs.144802 
AA456859 Hs.1 78358 
AW204530 Hs.99500 
AA460584 Hs.334386 
AI929374 Hs.75367 
AF005216 Hs.1 15541 
AA470074 Hs.1 69896 
AW338067 Hs.323231 
AL359571 Hs.44054 
AW451999 Hs.194024 
AW601773 Hs.270259 
AA731404 Hs.105510 
AA599042 Hs.1 12503 
BE019072 Hs.334802 
AA609170 

NM_013241Hs.95231 
AA609955 Hs.234961 
AI147155 Hs.270016 
AI267847 

AA532519 Hs.1 29043 
AW297702 Hs.102915 
AA381661 Hs.119878 
R22952 Hs.268685 
AA374756 Hs.93560 
AW368528 Hs.100855 
R43543 Hs.100912 
R46068 
R47948 
AA418160 Hs.86043 
R65763 Hs.101477 
AW296713 Hs.221441 
AI076343 Hs.173939 



Hs.288912 
Hs.188732 



R99978 

T79956 

T81310 

AI472068 

T97341 

AH 23705 



Hs.268892 
Hs.1 00588 
Hs.1 00592 
Hs.286236 

Hs.1 06932 
AW966158 Hs.58582 
R39234 Hs.251699 
AA975486 Hs.103441 
W27939 Hs.1 03834 
BE302796 Hs.105097 
N71826 Hs.105465 
NM_002975Hs.105927 
AA419008 Hs.106730 



F34856 
Y13153 



Hs.292457 
Hs.107318 



BE560779 Hs.284233 

AI816224 Hs.1 07747 
AI950087 

AW296806 Hs.326234 

AA744610 Hs.1 94431 

AA463189 Hs.288906 

N57532 Hs.1 09315 

BE614192 Hs.279869 

U30246 Hs.1 10736 
NM_016039Hs.1 10803 

AI267700 Hs.317584 



ESTs 5.4 

ESTs 11.2 

adaptor-related protein complex 1 , sigma 10.1 

EST 11 

ESTs 61.5 

EST 10.7 

ESTs 107.3 

gb:zx48e06.s1 Soares_testis_NHT Homo sap 31.1 

ESTs 5.6 

ESTs 8.5 

ESTs 81.8 

ESTs 75.3 

Src-like-adapter 5.8 

Janus kinase 2 (a protein tyrosine kinas 5.3 

ESTs 11.5 

Homo sapiens cDNA FLJ11946 fis, clone HE 2.8 

ninein (GSK3B interacting protein) 8.7 

ESTs 5.1 

ESTs 5.2 

ESTs 3.6 

EST 7.4 

Homo sapiens cDNA FLJ 14680 fis, clone NT 2.4 

gb:af12a12.s1 Soares_testis_NHT Homo sap 7.8 

FH1/FH2 domain-containing protein 10 

Huntingtin interacting protein E 30.6 

ESTs 8.1 

gb:aq49a10.x1 Stanley Frontal NB pool 2 57.1 
Human DNA sequence from clone 989H1 1 on 7.8 

ESTs 8.3 
ESTs, Weakly similar to M3K9_HUMAN MITOG 7.9 

ESTs 11.3 

Homo sapiens mRNA for KIAA1 771 protein, 9 

ESTs 8.1 

Homo sapiens cDNA: FLJ22726 fis, clone H 5.1 

hypothetical protein FLJ22604 14.2 

ESTs 7.9 

Homo sapiens cDNA FLJ13558 fis, clone PL 6.6 

EST 23.9 

ESTs 32.4 

ESTs, Weakly similar to ALUB_HUMAN !!!! 22.8 

ESTs, Moderately similar to B34087 hypot 6.1 

EST 135.3 

ESTs 5.4 

KIAA1856 protein 5.6 

gb:ye57e05.s1 Soares fetal liver spleen 9.6 

ESTs 8 

Homo sapiens cDNA FLJ12789 fis, clone NT 12.8 

ESTs, Weakly similar to IDN4-GGTR14 [H.s 2.8 

Homo sapiens, Similar to RIKEN cDNA 1700 7.1 

hypothetical protein MGC5576 7.7 

thymidine kinase 1 , soluble 5.3 

small nuclear ribonucleoprotein polypept 53.9 

stem cell growth factor; lymphocyte seer 1 3.3 

chromosome 22 open reading frame 3 3 
Homo sapiens, clone MGC:16362, mRNA, com 13.3 

kynurenine 3-monooxygenase (kynurenine 3 7.2 

NICE-5 protein 14 

DKFZP566C243 protein 1.9 
gb:wq05c02.x1 NCI_CGAP_Kid12Homosapien 2.9 

ESTs, Highly similar to T46422 hypotheti 5 

palladin 17.1 

WW Domain-Containing Gene 20.9 

KIAA1415 protein ~ 5.8 

melanoma-associated antigen recognised b 7.6 

solute carrier family 1 2 (sodium/potassi 6.7 

CGI-99 protein 2 

ESTs 5 
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129482 AA188185 Hs.289043 
129559 W01296 Hs.11360 
129587 H14718 Hs.11506 
129629 AK000398 Hs.11747 
129649 AD000092 Hs. 16488 
129680 U03749 
129689 AW748482 Hs.77873 
129702 AI304966 Hs. 12035 
129720 AA156214 Hs.12152 
130010 AA301116 Hs.142838 
130097 AL046962 Hs.14845 
130135 AA311426 Hs.21635 
130211 NM_003358Hs.23703 
130242 X79201 Hs.153221 
130359 NM_013449Hs.277401 
130365 W56119 Hs.155103 
130448 BE513202 Hs.15589 
130455 D90041 Hs. 155956 
130471 AL1 21438 Hs. 183706 
130503 BE208491 Hs.295112 
130511 L32137 Hs.1584 
130542 U64675 Hs. 179825 
130553 AF062649 Hs.252587 
130556 AI907018 Hs. 15977 
130567 AA383092 Hs.1608 
130574 AF083208 Hs.16178 
130617 M90516 Hs.1674 
130667 BE246961 Hs.17639 
130693 R68537 Hs. 17962 
130744 H59696 Hs. 18747 
130757 AL036067 Hs. 18925 
130880 BE514434 Hs.20830 
130944 BE382657 Hs.21486 
131046 AA321649 Hs.2248 
131060 AA194422 Hs.22564 
131099 AL133353 Hs.226581 
131135 NM_016569Hs.267182 
131185 BE280074 Hs.23960 
131225 H62087 Hs.31659 
131245 AL080080 Hs.24766 
131283 X80038 Hs.339713 
131569 AL389951 Hs.271623 
131643 AW410601 Hs.30026 
131714 AA642831 Hs.31016 
131722 D13757 Hs.311 
131760 X76732 Hs.3164 
131793 AW966127 Hs.32246 
131885 BE502341 Hs.3402 
131900 AA099014 Hs.231029 
131905 AA179298 Hs.3439 
131941 BE252983 Hs.35086 
131971 BE567100 Hs.154938 
132180 NM_004460Hs.418 
132203 NM_004782Hs.194714 
132273 AA227710 Hs.43658 
132288 N36110 Hs.305971 
132294 AB023191 Hs.44131 
132348 AW067708 Hs. 170311 
132370 AW572805 Hs.46645 
132384 AA312135 Hs.46967 
132450 AA100012 Hs.48827 
132465 AW169847 Hs.49169 
132532 AA454132 Hs.5080 
132574 AW631437 Hs.5184 
132638 AI796870 Hs.54277 
132718 NM_004600Hs.554 
132726 N52298 Hs.55608 



spindlin 6.7 

hypothetical protein FU14784 7.5 

Human clone 23589 mRNA sequence 6.8 

hypothetical protein FLJ20391 3.8 

calreticulin 3.3 
gb:Human chromogranin A (CHGA) gene, pro 14.1 

B7 homolog 3 2.6 

ESTs, Weakly similar to I38022 hypotheti 7.4 

APMCF1 protein 2 

nucleolar phosphoprotein Nopp34 1.6 

forkhead box 03A 2.8 

tubulin, gamma 1 6.1 
ESTs, Moderately similar to CEGT.HUMAN C1.6 

synovial sarcoma, translocated to X chro 5.4 

bromodomain adjacent to zinc finger doma 8.5 

eukaryotic translation initiation factor 1 1 

PPAR binding protein 3.9 

N-acetyltransferase 1 (arylamine N-acety 33.6 

adducin 1 (alpha) 2.7 

KIAA0618 gene product 16.1 

cartilage oligomeric matrix protein (pse 6.1 

RAN binding protein 2-like 1 7.8 

pituitary tumor-transforming 1 14.4 

Empirically selected from AFFX single pr 4.7 

replication protein A3 (14kD) 7.9 

apoptosis antagonizing transcription fac 1 .2 

glutamine-fructose-6-phosphate transamin 12.1 

Homo sapiens ubiquitin protein ligase (U 13.9 

ESTs 2 

POP7 (processing of precursor, S. cerevi 3.1 

protein x 0001 5.7 

kinesin-like 2 2.1 

signal transducer and activator of trans 5.4 

small inducible cytokine subfamily B (Cy 7.4 

myosin VI 5.1 

COX1 5 (yeast) homolog , cytochrome c oxid 7 

TBX3-iso protein * 3.3 

cyclin B1 5.8 

thyroid hormone receptor-associated prot 7.5 

thioredoxin domain-containing 2.8 
Homo sapiens clone F19374 APO E-C2 gene 1 .3 

nucieoporin 50kD 5 

HSPC182 protein 2.9 

putative DNA binding protein 2.9 

phosphoribosyl pyrophosphate amidotransf 3.4 

nucleobindin 2 2.9 

Homo sapiens cDNA FU14656 fis, clone NT 7.9 

ESTs 13.7 
Homo sapiens, clone MGC:15961, mRNA, com 8.7 

stomatin-like 2 11.3 

ubiquitin specific protease 1 2.3 

hypothetical protein MDS025 3.5 

fibroblast activation protein, alpha 14.7 

synaptosomal-associated protein, 29kD 7.8 

DKFZP586L151 protein 10 

solute carrier family 2 (facilitated glu 9.2 

KIAA0974 protein 2 

heterogeneous nuclear ribonucleoprotein 12.5 

ESTs 28.3 

HSPC034 protein 6.1 

hypothetical protein FLJ12085 8.6 

KIAA1634 protein 6.1 

mitochondrial ribosomal protein L16 7.1 

TH1 drosophila homolog 14 
DNA segment on chromosome X (unique) 99212.4 

Sjogren syndrome antigen A2 (60kD, ribon 3.7 

hypothetical protein MGC955 14.3 
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132731 


AI189075 


Hs.301872 


hypothetical protein MGC4840 


5.9 




119744 

1 OCt Tt 


AA010233 


Hs.55921 


glutamyl-prolyl-tRNA synthetase 


6.4 




1 19771 
I t to 


M459713 


Hs.295901 


KIAA0493 protein 


14.6 




1 32798 


AI026701 


Hs.5716 


KIAA0310gene product 


2.5 


5 


119810 


AB007944 


Hs.5737 


KIAA0475 gene product 


4.2 




1 19811 


U78525 


Hs.57783 


eukaryotic translation initiation factor 


6.1 




132842 


NM_016154Hs.279771 


Homo sapiens clone PP1596 unknown mRNA7.1 




1 19851 


U09716 


Hs.287912 


lectin, mannose-binding, 1 


6.1 


10 


132891 


BE267143 


Hs.59271 


U2(RNU2) small nuclear RNA auxiliary fac 


2.7 


119041 


AI817165 


Hs.6120 


hypothetical protein FLJ13222 


2.1 




11907? 

\0£.St 


AA034365 


Hs.288924 


Homo sapiens cDNA FLJ1 1392 fis, clone HE 3.5 




119080 


AA040696 


Hs.62016 


ESTs 


1.3 




119004 


AA112748 


Hs.279905 


clone HQ0310PRO0310p1 


17.1 


15 


133016 


AI439688 


Hs.6289 


hypothetical protein FLJ20886 


4.4 


133177 

< OO 1 / / 


X97795 


Hs.66718 


RAD54 (S.cerevisiae)-like 


4.4 




133208 


AI801777 


Hs.6774 


ESTs 


5.5 




133254 


AI567421 


Hs.273330 


Homo sapiens, clone IMAGE:3544662, mRNA, 1 .3 


i e 


133266 


Al 160873 


Hs.69233 


zinc finger protein 


16.1 




133268 


AW956781 


Hs.293937 


ESTs, Weakly similar to FXD2JHUMAN FORKH 12.2 


So 




M76477 


Hs.289082 


GM2 ganglioside activator protein 


10.4 




133300 


AI950382 


Hs.72660 


phosphatidylserine receptor 


5.7 


in 


111101 

1 000& 1 


AW1 03364 


Hs.727 


inhibin, beta A (activin A, activin AB a 


25.5 


:rr : : 


111*40 


AL037159 


Hs.74619 


proteasome (prosome, macropain) 26S subu 1 .7 


: ; : 




AW1 60781 


Hs. 172589 


nuclear phosphoprotein similar to S. cer 


2.6 




1 11R91 


NM_004893Hs.75258 


H2A histone family, member Y 


13.5 




1 11790 
loot AJ 


L27841 


Hs.75737 


pericentriolar material 1 


6.7 




1 11780 


BE271766 


Hs. 181 357 


laminin receptor 1 (67kD, ribosomal prat 


5.4 




1117A4 


BE622743 


Hs.301064 


arfaptin 1 


12.1 




1 117Q1 

i ooiu i 


M34338 


Hs.76244 


spermidine synthase 


9.7 


J30 


1 *}T7Q7 

loo/y/ 


AL133921 


Hs.76272 


retinoblastoma-binding protein 2 


1.3 




1 11899 


D50525 


Hs.699 


peptidylprolyl isomerase B (cyclophilin 


9.7 






W29092 


Hs.7678 


cellular retinoic acid-binding protein 1 


4.2 




1 1188"; 

1 00000 


AB011155 


Hs.170290 


discs, large (Drosophila) homolog 5 


5 




looooi 


U30872 


Hs.77204 


centromere protein F (350/400kD, mitosin 


9.1 




1 o\>yz4 


D86326 


Hs.325948 


vesicle docking protein p115 


1.8 




1 11Qi\Q 


X81789 


Hs.77897 


splicing factor 3a, subunit 3, 60kD 


10.4 




i OOQOQ 


AL040328 


Hs.78202 


SWI/SNF related, matrix associated, acti 


2.6 




io\>yy/ 


AI824113 


Hs.78281 


regulator of G-protein signalling 12 


13 


40 




BE300078 


Hs.80449 


Homo sapiens, clone IMAGE:3535294, mRNA, 10.3 




AW291946 Hs.82065 


interieukin 6 signal transducer (gp130, 


6.7 




11417R 
lj*fOYO 


X06560 


Hs.82396 


2',5'-oligoadenylate synthetase 1 (40-46 


5.5 




1040/3 


AW362124 Hs.323193 


hypothetical protein MGC3222 


5.8 




11441V; 


AW067903 Hs.82772 


collagen, type XI, alpha 1 


72.9 


45 


114491 


AU077196 


Hs.82985 


collagen, type V, alpha 2 


6.7 


114480 


NM_005000Hs.83916 


Empirically selected from AFFX single pr 


6.2 




104010 


AK001571 


Hs.273357 


hypothetical protein FU10709 


1.4 




114590 

1 Ot OZ;7 


AW411479 


Hs.848 


FK506-binding protein 4 (59kD) 


2.8 




104/Ot 


AW630803 


Hs.89497 


lamin B1 


6.1 


50 


1 j4/yu 


BE002798 


Hs.287850 


integral membrane protein 1 


1.2 


114808 
1040U0 


AD001528 


Hs.89718 


spermine synthase 


2.6 




114ftlV* 


AI701162 


Hs.90207 


hypothetical protein MGC1 1 1 38 


9.1 




114850 


D26488 


Hs.90315 


KIAA0007 protein 


13.3 




134971 


AI097346 


Hs.286049 


phosphoserine aminotransferase 


2 


55 


135181 


BE250865 


Hs.279529 


px19-like protein 


14.9 


135207 


N26427 


Hs.9634 


ESTs, Highly similar to C10_HUMAN PUTAT1 1.7 




135245 


AI028767 


Hs.262603 


ESTs 


12.2 




135257 


AW291023 


Hs.97255 


ESTs, Weakly similar to A46010 X-linked 


7.6 




135307 


AI743770 


Hs.98368 


ESTs, Weakly similar to KIM0822 protein 


5.8 


60 


135321 


AI652069 


Hs.98614 


ribosome binding protein 1 (dog 180kD ho 


12.3 


135354 


AA456454 


Hs.183418 


cell division cycle 2-like 1 (PITSLRE pr 


5.7 




135400 


X78592 


Hs.99915 


androgen receptor (dihydrotestosterone r 


13.9 




302276 


AW057736 


Hs.323910 


HER2 receptor tyrosine kinase (c 


5.3 




317781 


NM_007057Hs.42650 


ZW10 interactor 


2.8 


65 


321114 


AA902256 


Hs.78979 


Gokji apparatus protein 1 


5.5 


322556 


BE041451 


Hs.177507 


hypothetical protein 


2.9 




420802 


U22376 


Hs.1334 


v-myb avian myeloblastosis viral oncogen 


2.3 




424001 


W67883 


Hs.1 37476 


paternally expressed 10 (PEG10; KIAA105 


7 
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425182 


AF041259 


Hs.155040 


zinc finger protein 217 


2.3 


446999 


AA151520 


Hs.334822 


hypothetical protein MGC4485 


7.5 


450701 


H39960 


Hs.288467 


Homo sapiens cDNA FLJ12280 f 


is, clone MA 5.6 


452461 


N78223 


Hs.108106 


transcription factor 


4.7 


453157 


AF077036 


Hs.31989 


DKFZP586G1722 protein 


12.1 
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TABLE 5A 



Table 5A shows the accession numbers for those pkeys lacking unigenelD's for Table 5. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 

123615 30686_-15 AA609170 
124385 656394J AI267847 N27351 

110856 19346J4 AA992380 N33063 N21418 H79958 R21911 H79957 

120472 44573_2 AI950087 N70208 R97040 N36809 AI308119 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 
M908598 AA251875 AI820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
AI219788 AA884444 N92578 F13493 M927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 
AA488964 AA283144 AI890387 AI950344 AI741346 AI689062 M282915 AW102898 AI872193 AI763273 AW173586 
AW150329 AI653832 AI762688 AA988777 AA488892 AI356394 AW103813 AI539642 AA642789 AA856975 AW505512 
AI961530 AW629970 BE612881 AW276997 AW513601 AW512843 M044209 AW856538 AA180009 AA337499 AW961 101 
AA251669 M251874 AI819225 AW205862 AI683338 AI858509 AW276905 AI633006 AA972584 AA908741 AW072629 
AW513996 AA293273 AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 
AW613456 AI373032 AI564269 F00531 H83488 W371 81 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 
F04005 

129019 44573_2 AI950087 N70208 R97040 N36809 AI308119 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 
AA908598 AA251875 AI820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
AI219788 AA884444 N92578 F13493 M927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 
AA488964 M283144 AI890387 AI950344 AI741346 AI689062 AA282915 AW102898 V AI872193 AI763273 AW173586 
AW1 50329 AI653832 AI762688 M988777 AA488892 AI356394 AW1 03813 AI539642 AA642789 M856975 AW505512 
AI961530 AW629970 BE612881 AW276997 AW513601 AW512843 AA044209 AW856538 AA180009 AA337499 AW961 101 
AA251669 M251874 A1819225 AW205862 AI683338 AI858509 AW276905 AI633006 AA972584 AA908741 AW072629 
AW513996 AA293273 AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 
AW613456 AI373032 AI564269 F00531 H83488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 
F04005 

120695 9683_3 M976503 AI917802 M953664 AA404613 AA428771 BE280542 AW194691 AI927301 AI740458 
AI796100 AI935603 AW052210 AA970201 AI633384 M425910 AI017004 AI241295 M402816 AA291468 

122618 305217.1 AA453641 AA454061 

125115 genbank_T97341 T97341 

120809 genbank_M346495 AA346495 

129680 23162.1 U03749 NM_001275 J03483 J03915 AI214509 AW245744 AL046455 AA318960 AI741505 AA843875 AI829382 AI560122 
AI858999 D55958 AI684005 D53170 AA854091 AI025609 D53119 D54729 D55504 D55377 D55313 AW512244 AA846441 
AW043898 AI969102 M405741 AI091983 AA788784 AA706586 AA854361 AW470949 M843095 AA772028 AI148432 
AI038109 AA782478 AA910064 AI220384 AA781296 AA843881AA854064 AA843125 AA843419 AA319036 AA319054 
AI273831 W32275 AI584185 C05724 M789023 AI686818 D54392 AI022485 AA431410 M854232 W39212 W15214 
AA894441 AI803081 AI167381 AW245389 AA319430 AA335156 AI042646 AA327030 AA725170 T27943 AA889304 
AA976699 AI687001 AI621 107 AI865540 AA772107 C06286 AA319661 AA405992 

101045 entrez_J05614J05614 

110501 genbank_H55748 H55748 

121558 genbank_AA412497 AA412497 

121911 genbank_AA427950 M427950 
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TABLE 6: Figure 6 from BRCA 001 US 



Table 6 shows genes upregulated in tumor tissue compared to normal breast tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

10 Unigene Title: Unigene gene title 

R1: Ratio of tumor to normal breast tissue 



^ Pkey ExAccn UnigenelD UnigeneTitle R1 

H? 100147 D13666 Hs.136348 osteoblast specific factor 2 (fasciclin 7.5 

p 100678 AW502935 Hs.740 PTK2 protein tyrosine kinase 2 53.2 

p, 101806 AA586894 Hs.1 12408 S100 calcium-binding protein A7(psorias 8.9 

~~ 102455 U48705 Hs.75562 discoidin domain receptor family, member 6.9 

\M0 103206 X72755 Hs.77367 monokine induced by gamma interferon 8.8 

fin 105743 BE246502 Hs.9598 sema domain, immunoglobulin domain (Ig), 2.6 

£T! 105746 AW1 51 952 Hs.46679 hypothetical protein FLJ20739 1.5 

'•V 106373 AW503807 Hs.21907 histone acetyltransferase 1.8 

%! 110240 AI668594 Hs.176588 ESTs, Weakly similar to CP4Y_HUMAN CYTOC 4.2 

fz5 119260 AK001724 Hs.102950 coat protein gamma-cop 3.2 

120206 H26735 Hs.91668 Homo sapiens clone PP1498 unknown mRNA 45.7 

* 120253 AA131376 Hs.326401 fibroblast growth factor 12B 38.9 

Q 120297 AA191384 Hs.104072 ESTs, Weakly similar to Z195JHUMAN ZINC 15.2 

t=. 120624 AW407987 Hs.173518 M-phase phosphoprotein homotog 52 

!30 120695 AA976503 gb:oq30a04.s1 NCI_CGAP_GC4 Homo sapiens 46.8 

fy 120807 AA346385 Hs.30002 SH3-containing protein SH3GLB2; KIAA1 848 6.8 

121508 AA402515 Hs.97887 ESTs 28 

i. 122607 AA453518 Hs.98023 ESTs 61.5 

?s = : 122616 AA453638 Hs.161873 ESTs 107.3 

f"B5 122618 AA453641 gb:zx48e06.s1 Soares_testis_NHT Homo sap 31.1 

122829 AW204530 Hs.99500 ESTs 81.8 

122838 AA460584 Hs.334386 ESTs 75.3 

123753 AA609955 Hs.234961 Huntingtin interacting protein E 30.6 

124385 AI267847 gb:aq49a10.x1 Stanley Frontal NB pool 2 57.1 

40 124860 R65763 Hs. 101477 EST 23.9 

124930 AI076343 Hs.173939 ESTs, Weakly similar to ALUB_HUMAN !!!! 22.8 

125051 T79956 Hs.100588 EST 135.3 

128781 N71826 Hs.105465 small nuclear ribonucleoprotein polypept 53.9 

129096 AA463189 Hs.288906 WW Domain-Containing Gene 20.9 

45 129347 BE614192 Hs.279869 melanoma-associated antigen recognised b 7.6 

129689 AW748482 Hs.77873 B7homolog3 2.6 

130503 BE208491 Hs.295112 KIM061 8 gene product 16.1 

130511 L32137 Hs.1584 cartilage oligomeric matrix protein (pse 6.1 

131046 AA321649 Hs.2248 small inducible cytokine subfamily B (Cy 7.4 

50 131643 AW410601 Hs.30026 HSPC1 82 protein 2.9 

131925 AF151048 Hs.183180 anaphase promoting complex subunit 1 1 (y 2.7 

132180 NM_004460Hs.418 fibroblast activation protein, alpha 14.7 

132370 AW572805 Hs.46645 ESTs 28.3 

132994 AA112748 Hs.279905 clone HQ0310 PRO0310p1 17.1 

55 133016 AI439688 Hs.6289 hypothetical protein FU20886 4.4 

133266 AI160873 Hs.69233 zinc finger protein 16.1 

133391 AW103364 Hs.727 inhibin, beta A (activin A, activin AB a 25.5 

134169 AI690916 Hs. 178 137 transducer of ERBB2, 1 1.2 

134219 NM_000402Hs.80206 glucose-6-phosphate dehydrogenase 1.9 

60 134405 AW067903 Hs.82772 collagen, type XI, alpha 1 72.9 

134529 AW41 1479 Hs.848 FK506-binding protein 4 (59kD) 2.8 

134975 R50333 Hs.92186 Leman coiled-coil protein 2.6 

135181 BE250865 Hs.279529 px19-like protein 14.9 

322556 BE041451 Hs.177507 hypothetical protein 2.9 
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TABLE 6A 

Table 6A shows the accession numbers for those pkeys lacking unigenelD's for Table 6. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number: Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 

124385 656394J AI267847 N27351 

120695 9683_3 AA976503 AI917802 AA953664 AA404613 AA428771 BE280542 AW194691 AI927301 AI740458 AI796100 AI935603 

AW052210 AA970201 AI633384 AA425910 AI017004 AI241295 AA402816 AA291468 
122618 30521 7_1 AA453641 M454061 
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TABLE 7: Figure 7 from BRCA 001-1 US 

Table 7 shows genes upregulated in tumor tissue compared to normal breast tissue. Open 
reading frames in the sequences have been characterized as having a signal sequence (SS), a 
transmembrane domain (TM) or other. 



10 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of tumor to normal breast tissue 

ORF struct info: Structural characterization of open reading frame for the sequence of the gene 





Pkey 


ExAccn 


UnigenelD 


UnigeneTitle 


R1 


ORF struct info 


y : 


100113 


NM_001269 


Hs.84746 


chromosome condensation 1 


2.3 


TM 


tto 


100114 


X02308 


Hs.82962 


thymidylate synthetase 


2.9 


other 


ipi ! 
3 


100131 


D12485 


Hs.11951 


ectonucleotide pyrophosphatase/phosphodi 


1.9 


other 




100146 


BE1 85499 


Hs.2471 


KIAA0020 gene product 


1.9 


TM 




100147 


D13666 


Hs.1 36348 


osteoblast specific factor 2 (fasciclin 


7.6 


other 


125 


100154 


H60720 


Hs.81892 


KIAA0101 gene product 


9.2 


other 


100163 


W44671 


Hs.124 


gene predicted from cDNA with a complete 


1.6 


other 




100220 


AW015534 


Hs.217493 


annexin A2 


2 


other 




100265 


D38521 


Hs.1 12396 


KIAA0077 protein 


1.5 


other 




100271 


BE1 60081 


Hs.256290 


S100 calcium-binding protein A1 1 (cakjiz 


13.5 


other 


~ : 


100275 


BE242802 


Hs.154797 


KIAA0090 protein 


5.1 


other 


! J0 


100323 


D50920 


Hs.23106 


KIAA0130 gene product 


1.9 


TM 




100335 


AW247529 


Hs.6793 


platelet-activating factor acetylhydrola 


2.7 


other 




100364 


NM 004341 


Hs.154868 


carbamoyl-phosphate synthetase 2, aspart 


2 


other 




100372 


NM 014791 


Hs.184339 


KIAA0175 gene product 


2.6 


other 




100393 


D84145 


Hs.39913 


novel RGD-containing protein 


3.2 


other 


35 


100400 


AW954324 


Hs.75790 


phosphatidylinositol glycan, class C 


1.5 


other 




100418 


D86978 


Hs.84790 


KIAA0225 protein 


2 


other 




100482 


M65028 ' 


Hs.81361 


heterogeneous nuclear ribonucleoprotein 


2.9 


other 




100518 


NM 004415 


Hs.74316 


desmoplakin (DPI, DPII) 


1.9 


other 


40 


100666 


L05424 


Hs.169610 


CD44 antigen (homing function and Indian 


5.7 


other 


100667 


L05424 


Hs.169610 


CD44 antigen (homing function and Indian 


9 


? 




100668 


L05424 


Hs.169610 


CD44 antigen (homing function and Indian 


7.7 


other 




100678 


AW502935 


Hs.740 


PTK2 protein tyrosine kinase 2 


53.2 


other 




100783 


AF078847 


Hs.191356 


general transcription factor IIH, polype 


6 


other 


45 


100892 


BE245294 


Hs.180789 


S164 protein 


1.7 


? 


100945 


AF002225 


Hs.180686 


ubiquitin protein ligase E3A (human papi 


1.5 


other 




100969 


AA157634 


Hs.79172 


solute carrier family 25 (mitochondrial 


6.3 


other 




100988 


AK000405 


Hs.76480 


ubiquitin-like 4 


11.4 


? 




100999 


H38765 


Hs.80706 


diaphorase (NADH/NADPH) (cytochrome b-5 


1.6 


other 


50 


101031 


J05070 


Hs.151738 


matrix metalloproteinase 9 (gelatinase B 


8.4 


other 


101045 


J05614 




gb:Human proliferating cell nuclear anti 


5 


? 




101077 


N99692 


Hs.75227 


Empirically selected from AFFX single pr 


2.6 


other 




101093 


L06419 


Hs.75093 


procollagen-lysine, 2-oxoglutarate 5-dio 


1.4 


? 




101186 


AA020956 


Hs.179881 


core-binding factor, beta subunit 


2 


TM 


55 


101216 


AA284166 


Hs.84113 


cyclin-dependent kinase inhibitor 3 (CDK 


1.8 


other 


101228 


AA333387 


Hs.82916 


chaperonin containing TCP1, subunit 6A ( 


1.7 


TM 




101247 


AA132666 


Hs.78802 


glycogen synthase kinase 3 beta 


1.9 


other 




101249 


L18964 


Hs.1904 


protein kinase C, iota 


1.5 


other 




101332 


J04088 


Hs.156346 


topoisomerase (DNA) II alpha (170kD) 


5.3 


other 


60 


101352 


AI494299 


Hs.16297 


COX17 (yeast) homolog, cytochrome c oxid 


4.2 


other 


101396 


BE267931 


Hs.78996 


proliferating ceil nuclear antigen 


1.9 


TM 




101445 


M21259 




gb:Human Alu repeats in the region 5' to 


1.6 


TM 




101470 


NM 000546 


Hs.1846 


tumor protein p53 (Li-Fraumeni syndrome) 


2.5 


other 




101478 


NMJW2890 


Hs.758 


RAS p21 protein activator (GTPase activa 


5.5 


other 
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10 



15 



SO 



m 

o 



40 



45 



50 



55 



60 



65 



101483 


M?44fifi 


H<! 7fi7fifi 

Mi, / U/ UD 


rirn(Y^IIiin&n-n rnl inp 9j^vnnlntaratp <d-Hi 
ui uwjiiayci i ui umi ic, t^JAuyiuiaioLC *t^ji 


2.1 


other 


101540 


J04977 




Y-rav rpnair rnmnlpmpntinn Hpfprfiup rpn 


1.6 


other 


101573 


AW248421 




nmtftaenmp ^nmenmp mafrfinain^ euhu 

UI UlCCUUI 1 IC \fJ' vdUl 1 K3j 1 1 ICIU1U|JC1II 1 1 bVV OULfU 


5.7 


other 


IU IJQU 


lll¥l__U Ic i J 1 


MO.OOOUO 


rrtflnnfatifMi fertrtrVIII-fleeorifltpH /tntr 
uuoyutciuuii ioului vm cioouucticu ^iiiu 


1 8 


nthpr 


1fl1 

lu I Jot 


MrUO*tOuO 


Mc Q19QQ 


nnanino mir-loriiiHa hinHinn n r/^ta i n /fl rsr 
yUalllllt; IiUUcUUUc UUIUIIty UEUlClll yJ UE 


J.VJ 


9 




RF7Q1fln4 

DCOy lOUn 


rlo-DZDD I 


yuaiiyiaLB UMiuiiiy piutciii i, miichciuii- 




other 


1/117/19 
1 U I / UZ 


AW^fMflRQ 
Mvvau^fuoy 


Lie 170^74 


nr/itoin nhocnhatacp 0 ^frtnnprlu 9A\ rpn 

pi ulchi pi lubpi idLaoc z ^juniiciiy zrt/, icy 


I.O 


nthpr 

UUICI 


1 V I.I 0*T 


ivt/ s ruaa 


He 147040 
no. i H i uta 


\Ail l^UIUOUJJI llld^ lir\C 1 i UIOUIOUCIM 


£. I 


9 


1A17CQ 
lu 1/ OcJ 


MP/)944 

IVIOUZ*T*T 


no. lonuu I 


enlntp warrior famili/ 7 /r*fltinnir* aminci 
oUiUlC UctMlCI tail Illy l ^UaUUIIIL> all III IU 




TM 

1 I VI 




IVIO lUUi 


no. louooH 


/*ar(v%v\/npntiHaeo 131 /t!eenp\ 
valUUAyUcUUUclOc D 1 ^UooUtiJ 


144 


oo, 


101782 


AA306495 


Hs 


phosphocjlucomutsss 1 


5.2 


otfier 


IU IOUJ 


AW4fiQ7A7 


now 3D iz 


etrpee inrlii^prl nhnenhnnrntpin 1 /Hcn7n/M 

oueoo*iiiuuucu~uiiuouiiuuiui9iii i ynou/ u/n 


o.u 


nthpr 


1 U 1 uuu 


rt/AJQUOa*+ 


He 119A0A 
no. i ijttvo 


Qinn Mlrii im-hinHinn nrntpin A7 fnenriae 
o iuu udiuiui 1 1 un iuii ly uiuicin a/ ^uouiicio 


3 9 


SS,TM 


IU 10 IU 


l>ilVI_UUUO IO 


He 1ftnfi19 
no. iouo iz 


pciUAiouiiiai iiiciiiuidiie piuiciii o ^ojmj, eje 


^ 9 


TM 

1 IVI 


IU IU/ a 


Hir\ 1 f UOl *T 


rlg.LtJOOU 


ni irlpar antnantinpnir ^nprm nmfpin /hie: 
nuuicai ctuiuciiiuydiiu opcui i piuiciii ^iiio 


1.6 


other 


lu la l l 


AA4417P7 


He 11QRO.Q 
no. I laQOa 


nhfmnmfan h Armnri gc alnho rv\U/nantiHp 

giyuttprijicin iiurinuiicb, dipiid puiypcpuuc 


O 1 .0 


9 


I U I 3 ZU 


AF1P.9R4R 
ftr lOZuHu 


He fi(Y)A 
no.ouz*t 


ir\ uyiuwi ic, uuwu-icyuiaiui ui nun ii 


1 ft 


nthpr 

UUICI 


IU la/O 


1 141*14 


He fini9H 

ns.ou izu 


1 ImD M o/^qK/I alnha M-fial3i™tr\eamino*r\nlun 


9 A 


nthor 
ULI Eel 


1 fi9nno 

iuzuuj 


RF94514Q 
dczho i*ta 


He ft9AA7 


nmtoin turneinp Ifinsep Q 
piuictii lyiuoinc mi idoc a 


1 .0 


nthpr 

ULI ICI 


luzuoo 


QC9C0197 

□czou iz/ 


He fl9Qfl£ 

ns.ozyuD 


r*M/^9f\ fr*a\\ Hluielrtn r^/rOci 0C\ Q f*ara\n 

ouozu ^csii uivision cycic zu, c. ccrcvi 


Z 


9 


1 UZUO J 


I OOau I 


He 7A1 17 


int@rl©ukin snhsncBr binding fsctor 2, 4 


1 .0 


other 


10,91/17 

lUzlU/ 


DC9£flfi09 

btZOooUz 


He IflO'SCft 

ns.iozooo 


hGat shock protsin 75 


1 4 


other 


10,9193 
iuz izo 


MM OA 10:0,0 


He 

ns. ioot- 


csnirumero proiein m { i /kuj 


1 ft 
1 .0 


other 


lUzlao 


bcoiozou 


ns. loyoz/ 


deatii associated protein 3 




9 


io9ioo 


Awyouooz 


Ue 74CQQ 

ns./4oyo 


polymerase (DNA directed), delta 2, regu 


4 4 
4.4 


9 


lUZZ 1 I 


A AOOQQ7Q 

AAozyy/o 


He '3nif?17 

ns.ou io io 


JTV1 gene 


fi 7 
O./ 


other 


lUZZzU 


1 \OA ^QQ 

uz4ooy 


Ue C^47G 

ns. 00400 


lysosomal 


4 4 
4.4 


1 IVI 


1uzzo4 


awi oo jyu 


ns.z/ooo4 


heterochromatin-like protein 1 


•1 Q 

i.y 


1 IVI 


lUzZOU 


ai noa-104 
ALUOy 1U4 


Ue 1RORJT7 

ns.ioyoo/ 


karyopherin alpha 2 (RAG cohort 1, impor 


4 4 


other 


lUZoUz 


AA0U0o4z 


l_r_ fift^74 

ns.oyl/l 


protein kinase G-tike 2 


O 7 
Z./ 


O 

r 


lU^OOU 


DtzyoUDO 


HS./ /Z04 


chromobox homolog 1 (Drosophila HP1 beta 


1 c; 
1.0 


other 


luzooy 


BFT7A439 
DC0/O40Z 


Ue QCR77 

rts.yoo/ / 


cyclin-dependent kinase 4 


9 

Z.O 


1 IVI 


1UZ040 


1 I97K1G 

uo/o ny 


ns.o/ooy 


aldehyde dehydrogenase 3 family, member 


z 


I IVI 


luzony 


AI 10770,** 


He 9AQ1H7 

ns.zoa iu/ 


uacuiovirai iMr repedi-curiidiniiig z 


1 9 

o.z 


other 


luzooy 


uoyo4u 


Ue 9QQ0R7 

rts.zyyoo/ 


nepaiocyie nuciear lactor o, aipna 


o 
z 


other 


IUZO/4 


UJOOOj 


He Q/1R79 

ns.yuo/ z 


PTK7 protein tyrosine kinase 7 


fi 9 
O.Z 


other 


in9*5Q1 


A A9Gfi074 
AAZyDo/4 


Ue 77404 

ns.//4y4 


deoxyguanosine Idnase 


1 .0 


I tvi 


1 Uz4bD 


U4o/U0 


HS./oooz 


discoidin domain receptor family, member 


7 

/ 


other 


lUZHOO 


NM_UUloOy 


Ue fllCA'Q 

nS.olo4o 


2,4-dienoyl CoA reductase 1, mitochondri 


1 9 

l.O 


OO, 


IUZ300 


uouyoy 


Ue R1fl9ft 

ns.o iozo 


amyloid beta precursor protein-binding p 


1 .0 


9 


luz^oy 


ai AOOj11£ 
ALUoUl IO 


Ue 744 Ofl 

ns./4**zu 


origin recognition complex, subunit 3 (y 


0.0 


other 


1 Uz4y4 


AM 001*57 
All 0010/ 


Ue 7C1QQ 

MS./oiyo 


uupy nomoiog 


Z. I 


other 


109^01 

IUZ0U I 


AF9171Q7 
HrZ lila/ 


He 74*\ft9 

ns./ hooz 


siah binding protein 1 \ FBP interacting 


9 
o.z 


other 


lUZDZZ 


DbzOUo44 


nS.ioooob 


solute carrier family 1 (neutral amino a 


9 Q 
Z.o 


9 


luzooz 


ArU4Uzoo 


HS.701ob 


suppressor of Ty (S.cerevisiae) 5 homolo 


C 7 
O./ 


9 


m9£fi4 
IUZ004 


uoy4zo 


Ue 70/i(?7 

ns./yuo/ 


MAD (mothers against decapentaplegic, Dr 


9 "5 
Z.O 


other 


lUzobo 


\AJQ-1 /on 

woi4oy 


nS.zzoUzo 


RAB31, member RAS oncogene family 


0.0 


other 


lUZOOU 


i icnono 
UoUoUo 


Ue 1COQQ1 

nS.lOzyoi 


CDP-diacylglyoerol synthase (phosphatida 


9 1 
Z. I 


other 


lUZDol 


Al m7799ft 
AUU/ /ZZo 


He 779*;R 

ns./ /zoo 


enhancer of zeste (Drosophila) homolog 2 


1 fi 
l.O 


9 


lUzooz 


1 IC'IO'JO 

Ublzoz 


Ue *JOC7K 

nS.ozo/0 


tubul in-specific chaperone e 


9 1 
Z.I 


other 


lUZQlr 


AWI 01450 


Ue 1QQ7R7 

HS. iyo/0/ 


C0P9 (constitutive photornorphogenic, Ara 


1 0 
1.0 


other 


102618 


AL037672 


Hs.o1071 


extracellular matrix protein 1 


0.0 


ouier 


1 02627 




H« 1*ift174 
ns. i w i / *+ 


ui iu niiycj piuiciu io*t ^rxi uppci iii\cj 


1.3 


other 


102663 


NM 002270 


Hs.168075 


karyopherin (importin) beta 2 


18 


TM 


102676 


BE262989 


Hs.12045 


putative protein 


2.3 


other 


102687 


NM 007019 


Hs.93002 


ubiquitin carrier protein E2-C 


4.4 


? 


102689 


U96132 


Hs.171280 


hydroxyacyl-Coenzyme A dehydrogenase, ty 


6 


? 


102696 


BE540274 


Hs.239 


forkhead box M1 


4.2 


other 


102704 


AU077058 


Hs.54089 


BRCA1 associated RING domain 1 


1.9 


other 


102705 


T97490 


Hs.50002 


small inducible cytokine subfamily A (Cy 


2.3 


SS.TM 


102750 


AB014460 


Hs.66196 


nth (E.coli endonuclease ill)-like 1 


1.2 


TM 


102801 


BE252241 


Hs.38041 


pyridoxal (pyridoxine, vitamin B6) kinas 


6.5 


other 


102812 


U90549 


Hs.236774 


high-mobility group (nonhistone chromoso 


1.6 


other 


102827 


BE244588 


Hs.6456 


chaperonin containing TCP1, subunit 2 (b 


5.6 


TM 


102844 


AV653790 


Hs.324275 


WW domain-containing protein 1 


1.3 


TM 


102868 


X02419 


Hs.77274 


plasminogen activator, urokinase 


4.4 


other 


102925 


BE440142 


Hs.2943 


signal recognition particle 19kD 


1.9 


other 
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102935 


BE561850 


Hs.80506 


small nuclear ribonucleoprotein polypept 


2.4 


? 




1 02968 


AU076611 


Hs. 154672 


methylene tetrahydrofolate dehydrogenase 


2.7 


other 




102983 


BE387202 


Hs. 118638 


non-metastatic cells 1, protein (NM23A) 


3.1 


other 




109<W5 


IJ95742 


Hs.2707 


R1 to S nhac^ transition 1 

\J 1 vv *J Wl ICIOC U Ol IOIUV/II 1 


5.2 


? 


5 


103023 


AW500470 


Hs. 117950 


multifunctional polypeptide similar to S 


1.6 


other 




103038 


AA926960 


Hs 334883 


CDC28 protein kinase 1 


2.5 


TM 




103060 


NM 005940 

HI It n UUU^^U 


Hs 155324 


matrix metalloproteinase 11 (MMP11; stro 


4.5 


other 




103080 

1 UJVOV 


Al 1077931 


Hs 89932 


rvrlin D1 ^PRAD1 ' narathvrnid adpnomatos 

uy viii 1 l> 1 11 1 \rvL/ 1 . uaiau iji uiu uuvi iui i ioluo 


3.1 


other 


10 


103089 

1 UJUUJ 


D31152 


Hs 179729 


collagen, type X, alpha 1 (Schmid metaph 


2.4 


other 


103177 


BE244377 


Hs 48876 


famesyl-diphosphate famesyltransferase 


3.5 


other 




103178 

I U J 1 / o 






rihnQnmfll nrotpin S18 

llUUOUMICIl JJIUICHI UlU 


9.9 


? 




103179 


NM 001777 

\J\J III! 


Hs 82685 


CD47 antigen (Rh-related antigen, integr 


1.3 


other 




103181 


X69636 


Hs 334731 


Homo sapiens, clone IMAGE:3448306, mRNA, 


2 


other 


15 


IUO IOJ 


mm nnfiAO^ 


He 7A1RR 


franemfamhrana nrYYtpin ffi^kR^ pnrinnlaemi 
Uai iomici iiuiai ic (Jiulcim \\j^jt\u), cnuupiaoiiii 


1.6 


other 


103191 


fWty IVU^J 


Hs 9903 


protein phosphatase 4 (formerly X), cata 


2.5 


other 




103193 


MM fl047fifi 


Hs 75724 


fYjatomAr nrntpin comnlex subunit beta 2 


2.2 


TM 




103194 


NM 004939 


Hs.78580 
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ESTs 
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Hs.32168 
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1.6 


TM 
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hypothetical protein M6C2963 
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? 
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2.1 


TM 
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Hs.323401 


dpy-30-like protein 


1.5 


TM 
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Hs.24048 
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6.7 


TM 
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Hs.35669 


ESTs, Moderately similar to ALU1_HUMAN A 


5.7 


other 
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Hs.6614 


ESTs, Weakly similar to A43932 mucin 2 p 


3.4 
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Hs.12727 


hypothetical protein FU21610 


1.7 


TM 
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110856 
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gb:ot37g06.s1 Soares testis NHT Homo sap 


2.3 


other 
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BE384447 


Hs. 16034 


hypothetical protein MGC13186 


3.5 


? 
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Hs.6880 
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2.2 


? 
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Hs.29724 


hypothetical protein FLJ13187 


2.6 


SS, 


110918 
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Hs.24283 
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1.9 


TM 
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signal transduction protein (SH3 contain 


6.7 
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ADP-ribosyltransferase (NAD+; poly(ADP-r 


1.3 
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1.8 


? 
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ESTs, Moderately similar to Z195 HUMAN Z 


3.7 
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2.1 


TM 
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2.3 
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6.8 
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3.6 
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2.6 


other 


111223 
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4.7 
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7.9 


? 
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7 
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5 
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rio.it/ si a 


CCT 

CO t 


6 6 


other 


ihmq 




Hs 2fTC91 


FST<5 MoripratPlv similar tn 7RF1 HUMAN 7 
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ESTs; calsyntenin-2 


2.8 


other 


112244 


AB02Q000 


Hs 70823 

1 IO. f UUiJ 


KIAA1Q77 nratein 


14.6 
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NM 014053 


Hs.270594 


124634 


AI765123 


Hs.143671 


124637 AA1 60474 


Hs.75798 


124642 


AW968856 


Hs.278569 


124649 


N92593 


Hs.313054 


124661 


R48170 


Hs.78436 



Homo sapiens cDNA: FLJ21766 fis, clone C 

ESTs 

ESTs 

ESTs 

EST 

ESTs 

ESTs 

Homo sapiens cDNA FLJ11946 fis, clone HE 

ninein (GSK3B interacting protein) 

ESTs, Weakly similar to KIAA1395 protein 

ESTs 

ESTs 

EST 

myomegalin 

ESTs, Weakly similar to I38022 hypotheti 

Homo sapiens cDNA FU12371 fis, clone MA 

ESTs 

ESTs 

ESTs 

ESTs, Weakly similar to RMS1_HUMAN REGUL 

gb:zv37d10.s1 Soares ovary tumor NbHOT H 

ESTs 

ESTs 

ESTs 

EST 

Human DNA sequence from clone RP1 1-110H4 

zinc finger protein 14 (KOX 6) 

Homo sapiens, clone IMAGE:4098694, mRNA, 

ESTs 

Homo sapiens cDNA FU 14680 fis, clone NT 
KIAA1080 protein; Gokji-associated, gamm 
gb:af12a1 2.s1 Soares_testis_NHT Homo sap 
gb:no97c02.s1 NCI_CGAP_Pr2 Homo sapiens 
gb:zu71d09.s1 Soares_testis_NHT Homo sap 
kinesin protein 9 gene 
FH1/FH2 domain-containing protein 
EST 

Huntingtin interacting protein E 

EST, Weakly similar to S65657 alpha-1 C-a 

gb:ae60g05.s1 Stratagene lung carcinoma 

metastasis-associated 1-like 1 

choline dehydrogenase 

ralA binding protein 1 

ESTs 

HIV-1 rev binding protein 2 
topoisomerase-related function protein 4 
putative G protein-coupled receptor 
ESTs, Weakly similar to AF161356 1 HSPC0 
ESTs 

KIAA0265 protein 

gb:aq49a10.x1 Stanley Frontal NB pool 2 

COBW-like protein 

NY-REN-18 antigen 

gb:yv28h09.s1 Soares fetal liver spleen 

ribosomal protein L17 

Human DNA sequence from clone 989H1 1 on 
kinesin heavy chain member 2 
gb:yv59d09.s1 Soares fetal liver spleen 
ESTs 
ESTs 

ESTs, Weakly similar to neuronal thread 
FLVCR protein 

Homo sapiens cDNA FU 13533 fis, clone PL 

hypothetical protein 

sorting nexin 17 

ESTs 

EphB1 
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124683 


AA381661 


Hs. 119878 


ESTs, Weakly similar to M3K9_HUMAN MITOG 


7.9 


TM 


124712 


R09166 


Hs. 191 148 


ESTs 


5.7 


other 


124735 


R22952 


Hs.268685 


ESTs 


11.3 


? 


124761 


AA374756 


Hs.93560 


Homo sapiens mRNA for KIAA1771 protein, 


9 


other 


124768 


AW368528 


Hs.1 00855 


ESTs 


8.3 


other 


124775 


R41772 


Hs 100878 

1 IO, 1 UUUl KJ 


ESTs 


4.9 


other 


124777 


R41933 


Hs 140237 


ESTs, Weakly similar to ALU 1 HUMAN ALU S 


2.8 


other 


124788 


R43543 


Hs.100912 


Homo sapiens cDNA: FU22726 fis, clone H 


5.1 


other 


124809 


AL355722 


Hs. 106875 


Homo sapiens EST from clone 35214, full 


4.2 


other 


124811 


R46068 


Hs 288912 


hypothetical protein FLJ22604 


14.2 


other 


124812 


R47948 


Hs 188732 


ESTs 


7.9 


other 


124822 


M418160 


Hs 86043 


Homo sapiens cDNA FLJ 13558 fis, clone PL 


6.6 


other 


124825 


M501669 


Hs.336693 


ESTs 


2.3 


SS.TM 


124833 


AW9758B8 


Hs 294100 


ESTs 


2.7 


SS.TM 


124857 


R63652 


Hs 137190 

1 IO, l \JI ■ 


ESTs 


2.3 


other 


124860 


R65763 


Hs.101477 


EST 


23.9 


? 


124863 


AI382555 


Hs.127950 


bromodomain-containing 1 


2 


other 


124876 


AF1 35422 


Hs.27059 


GDP-mannose pyrophosphorylase A 


4.4 


ss, 


124878 


BE397530 


Hs.288057 


hypothetical protein FLJ22242 


2.7 


other 


124902 


H37941 


Hs.101883 


ESTs 


5.7 


other 


124903 


AW296713 


Hs.221441 


ESTs 


32.4 


other 


124930 


AI076343 


Hs 173939 

1 l«J. I f vwwv 


ESTs, Weakly similar to ALUBJHUMAN !!!! 


22.8 


other 


124942 


R99978 


Hs.268892 


ESTs, Moderately similar to B34087 hypot 


6.1 


other 


124958 


AI078645 


Hs.431 


murine leukemia viral (bmi-1) oncogene h 


1.9 


other 


124980 


T40841 


Hs 98681 


ESTs 


4.5 


? 


125002 


T59338 


Hs 269463 


ESTs Weaklv similar to ALU1 HUMAN ALU S 


4.9 


other 


125047 


T79815 


279793 


ESTs 


5 


? 


125051 


T79956 


Hs 100588 


EST 


135.3 


? 


125056 


T81310 


Hs 100592 


ESTs 


5.4 


other 


125101 


AI472068 


Hs 286236 


KIAA1856 protein 


5.6 


other 


125113 


T96595 


Hs.302270 


ESTs, Weakly similar to ALUF_HUMAN !!!! 


1.8 


other 


l£J 1 1 J 


TQ7341 






9.6 


? 


125125 


AI272382 


Hs 240767 


Human DNA ^poupncp from clonp RP1-12G14 

i IUI 1 IOI 1 Ul l/V OVVf Uvl IVV II VII 1 VlUI Iw 1 M 1 I I i 


1.5 


TM 


125147 


¥ ¥ JU 1 JU 




Fmnmfallv Rplprfpd from AFFX ^innlp nr 


1.7 


? 


12"i1fi1 

\£.<J IU 1 


¥¥*T*tUJ/ 


H<! 1442 1 ?? 


EST 


10.7 


? 


125249 




hta 131375 


ESTs Modpratplv similar tn ALUB HUMAN 1 


1.3 


other 


125255 


AFflQRIfi? 


H<! 118631 


uiijcicoo ^uiuou^iinay iiumumy 


9.4 


other 


125279 


AW401809 


Hs.4779 


KIAA1150nrotein 


1.5 


? 


125280 


AI123705 


Hs 106932 


ESTs 


8.1 


? 


125298 




Hs 98Q008 


Homo sanipns rDNA' FLJ21814 fis clone H 


1.5 


other 




AW2Q2171 


Hs 23Q78 


^pjiffnlH attarhmpnt fartnr R 

OOCIIIUIU aLLQWl 11 1 ICI IL 1 UUIUI U 


5.9 


other 


125827 


MM fifi^dOl 

III VI vv v^tU-J 


Hs 974Q6 


YY1 tran^rrinHnn farfnr 

I 1 1 U at IJVl IJJUVJI 1 laVsLVJI 


1.2 


? 


125891 


U29589 


Hs.7138 


cholinergic receptor, muscarinic 3 


6.5 


? 


1 26005 


AW409701 


Hs.1578 


baculoviral IAP repeat-containing 5 (sur 


14.3 


? 


126202 


AA1<i7fi12 


Hs 272630 


vacuolar nmtnn niiinnriplfa nnlvnpntidp 

VCtWUVJIOl I^IUIUII I^UIIIVvJCIKI pVIJf JJCJJLIVJC 


2.5 


ss, 


126695 


AA643322 


Hs 172028 

1 IO. 1 1 i. Wt-lJ 


a riteintfiflrin and mptellonroteinasp doma 


9.1 


SSJM 


127050 


AW411066 

nnTl 1 WSJ 


Hs 274351 

■ IO.*- 1 »\J\J 1 


CGI-89 protein 


17 


other 


127274 


AWQ66158 


Hs 58582 


Homo saniens cDNA FLJ 12789 fis clone NT 

■ i *ji i nj oaklet io uui in i l>u i tmt w iiwj wui iv n i 


12.8 


other 


128355 




Hs 161623 

1 JO. 1 V 1 


ESTs 


7.4 


ss, 


128493 


D87466 


Hs 240112 


KIAA0276 Drotein 


3.1 


TM 


128522 


BE173977 


Hs 10098 

1 IO. ■ vUVU 


putative nucleolar RNA helicase 


9.4 


other 


128527 




Hs 101047 

1 IO. 1 w 1 tj*f f 


transcription factor 3 (E2A immunoglobul 


1.5 


other 


128528 


R39234 


Hs.251699 


ESTs, Weakly similar to IDN4-GGTR14 [H.s 


2.8 


other 


128595 


U31875 


Hs.272499 


short-chain alcohol dehydrogenase family 


12.1 


TM 


128599 


NM 015366 


Hs.102336 


Rho GTPase activating protein 8 


2.4 


? 


128604 


AI879099 


Hs.102397 


GIOT-3 for gonadotropin inducible transc 


1.3 


other 


128608 


BE267994 


Hs.1 02419 


zinc finger protein 


7.2 


other 


128625 


AB037841 


Hs.102652 


hypothetical protein ASH1 


1.3 


other 


128629 


AL096748 


Hs.102708 


DKFZP434A043 protein 


3.2 


other 


128639 


AW582962 


Hs.102897 


CGI-47 protein 


2 


TM 


128656 


AA458542 


Hs.10326 


coatomer protein complex, subunit epsilo 


1.4 


other 


128658 


BE397354 


Hs.324830 


diptheria toxin resistance protein requi 


2.5 


other 


128670 


AA975486 


Hs.103441 


Homo sapiens, Similar to RIKEN cDNA 1700 


7.1 


? 


128691 


W27939 


Hs.103834 


hypothetical protein MGC5576 


7.8 


? 


128696 


BE081143 


Hs.225977 


nuclear receptor coactivator 3 


3.8 


other 


128700 


Y15221 


Hs.103982 


small inducible cytokine subfamily B (Cy 


1.6 


other 


128714 


T85231 


Hs.179661 


tubulin, beta 5 


7.8 


other 
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128717 


AK001564 


Hs. 104222 


hypothetical protein FLJ 10702 


5.5 


other 


128733 


BE147740 


Hs. 104558 


ESTs, Moderately similar to I38022 hypot 


2.7 


TM 


128737 


AF292100 


Hs.104613 


RP42 homolog 


2.8 


TM 


128742 


AA307211 


Hs.251531 


proteasome (prosome, macropain) subunit, 


4.5 


? 


128746 


AI470163 


Hs.323342 


actin related protein 2/3 complex, subun 


2.2 


other 


128747 


AB027249 


Hs. 104741 


PDZ-binding kinase; T-cell originated pr 


2.8 


other 


128772 


BE302796 


Hs.105097 


thymidine kinase 1, soluble 


5.4 


other 


128781 


N71826 


Hs.1 05465 


small nuclear ribonucleoprotein polypept 


53.9 


TM 


128797 


NMJ02975 


Hs.1 05927 


stem cell growth factor; lymphocyte seer 


13.3 


other 


128806 


AW630942 


Hs.1 06061 


RD RNA-binding protein 


2.6 


other 


128814 


AW248431 


Hs.256526 


nuclear prelamin A recognition factor 


2.2 


other 


128830 


BE281170 


Hs.1 06357 


valosin-containing protein 


6 


other 


128835 


AK001731 


Hs. 106390 


Homo sapiens mRNA; cDNA DKFZp586H0924 (f 


1.6 


SS 


128854 


BE159181 


Hs.1 68232 


hypothetical protein FLJ13855 


2.3 


other 


128871 


AF1 89723 


Hs.106778 


ATPase, Ca++ transporting, type 2C, memb 


1.5 


? 


128906 


R57988 


Hs. 10706 


epithelial protein lost in neoplasm beta 


4.8 


other 


128920 


AA622037 


Hs 166468 

1 IO- 1 vU ivU 


nrnnranrimpfi rpll ripath R 

j^i ^iai i iiiigu wdi ucaui •J 


1.4 


nthpr 


128925 


R67419 


Hs 21851 


Hnmn ^anipns rRNA Fl H2Q0flfi<; rlnnp NT 


1 Q 


nthpr 


128946 


Y13153 


Hs.107318 


kvnureninp 1-monooxvapna^p (kvnurpninp 3 

rvji iui in ic. *j I II I WAjf vj^i icuc 1*^/1 IUI Gl III IG %J 


7.3 


? 


128949 


M009647 


Hs 8850 


a rfisinfearin and mptallonmfpina^p rinma 


2.5 


other 


128959 


AI580127 


Hs.107381 


hypothetical protein FLJ11200 


1.3 


other 


128965 


AW1 50697 


Hs.107418 


ESTs 


1.4 


? 


128970 


AI375672 


Hs 165028 


ESTs 


1.3 


other 


128975 


BE560779 


Hs.284233 


NICF-fi nrntpin 


14 


other 


128979 


AW271217 


Hs.281434 


Homo saniens cDNA FLJ14028 fis clone HE 

1 IVI 1 IV OupiwIlO 1 1/ \ ( Uv nvLU IIO| V/IUI Iw 1 1^ 


1.6 


TM 






H<! 107747 


nKF7P5fifiP94? nrntpin 


1 Q 


other 


129019 


A1950087 




nh*wn0^rfl9 y1 MHI P.fiAP Kin*19 Hnmn e.anipn 
y u. vhu i i/uuu£.a i iivi wj/^i __r\iu 14. numu oauicii 


9 Q 


nthpr 


129021 


AL044B7') 


H<! 171081 

ri3i l f Ow I 


KIAAfTttf) nrntpin 


o.o 


nth or 

UU ICI 


129032 


R80088 


Hs.108104 


ubinuitin-coniuaatina pn7vmp E9L *■$ 


3.4 


other 


129076 


AW296806 


Hs.326234 


FST^ Hinhlv similar +n T4fi4?? hvnnthpti 
lysosomal 

1 jwuwui iiai 


5 


other 


129078 


AI351010 


Hs 102267 


2.1 


other 


129088 


AA744610 


Hs 194431 

1 IO. 1 ^"TTv 1 


nallariin 


17.1 


other 


129095 


L12350 


Hs 108623 

i lo. i uuutj 


U H\Jl I tUUOJJUl lull 1 t. 


2.7 


nthpr 

UU ICI 


129096 


AA463189 


Hs 288906 


WW Domain-Containina Gpnp 


20.9 


TM 


129097 


RFWm 

L)f_£.*TOC7>JO 


no. tuoutt 


III ly CI jJIULCIII C.L. \\\\Jf\ lyjj 




other 


129099 


AF146074 


1 lo. 1 UUUUV 


r\ 1 r~UmUllly UClOOCllC, oUU ~l all 111* \s |ui 1 Ix 


>J.O 


TM 

1 IVI 


129136 


W93048 


Hs 250723 




ft 


nth or 


129149 


AA356620 


Hs 108947 


KIAAOn'SO npnp nrnrinrt 




TM 

1 IVI 


129172 


AW1 62916 


Hs.241576 


hvnothpfiral nrntpin PR09S77 


1 R 
1 .o 


TM 

1 IVI 


129192 


AA78RQ14 


H<5 1R32QQ 


F^Te. 

CO I o 


9 1 


9 


129194 


AA1 5070.7 

nn I Jul 3 1 


NO. 1 U9^/ U 


laLCAlll [JIULclll 




TM 

OO, 1 IVI 


129198 


NS7TO 

1 iJf UU£ 


no. lujj u 


KIAA141S nrntpin 




r»thor 


129207 


AI934365 


Hs 109439 

1 IO. 1 UJlJJ 


neipfvi 1 vrjn /n^tpninHiirtix/P fartnr mimp 
uoucuyiyui i ^doicuiiujuuiivc icjului, iiiiiiib 


8.1 


nth pr 

UU ICI 


129228 


U40714 


H<! 210.107 




9 Q 


other 


129229 


/\I V Is/1 \JO 


Hs 10QR4? 
no. i usutu 


nnlvsHpnuIatp hinriinn nmtpin-in+oraptin 


0.0 


9 


1 


AA9594fiR 


He moa 

rid. iu9u 


nKF7nA' : lll1fln nrntpin 




QQ TM 


1 £0£ J J 


AIQR1797 


He. mQflfli 
no. iuoquh 


n 1 llloLUMc laillliy, lllcUIUci A 


7 A 


other 




W9fiV»9 


Lie iinnfln 

no. I IUUOU 


co i s, vveaKiy similar 10 o i o*»yo pregnancy 




other 


I £o£oO 






CO 1 s 


1 .£. 


other 




AA9879?Q 


He «lfl 


Unmn canionc pHWA Fl 11 1^1 1 flc rlAPio Dl 
nUIIIU oapicllo OUtNM. rLJ i IO 1 1 Ho, UJUlio rL 


^ 9 


other 




n/ «juo*t 


no. i iuju 


F_Kr\y rtnlu nrrttoin Q 


4 7 


oo, 


129347 


BE614192 


U|<5 97QftfiQ 
rw.t/ sou? 


iitcidiiunid-a&cUbiaLcU aiiuytfli icuuyniocu u 


7 7 


T^^ 

1 IVI 


129362 


U30246 


Hs.1 10736 


solute carrier family 12 (sodium/potassi 


6.7 


TM 


129366 


BE220806 


Hs.184697 


Homo sapiens clone 23785 mRNA sequence 


8.6 


SS, 


129370 


AI686379 


Hs.110796 


SAR1 protein 


1.4 


TM 


129372 


NM 016039 


Hs.1 10803 


CGl-99 protein 


2 


other 


129403 


AF149785 


Hs.111126 


pituitary tumor-transforming 1 interacti 


7.5 


other 


129404 


AI267700 


Hs.317584 


ESTs 


5.1 


other 


129423 


AA204686 


Hs.234149 


hypothetical protein FU20647 


10.2 


other 


129482 


AA188185 


Hs.289043 


spindlin 


6.8 


other 


129513 


AW843633 


Hs.306163 


hypothetical protein AL110115 


7.1 


SS, 


129515 


AF255303 


Hs.112227 


membrane-associated nucleic acid binding 


2.5 


other 


129527 


AA769221 


Hs.270847 


delta-tubulin 


3.2 


other 


129559 


W01296 


Hs.1 1360 


hypothetical protein FLJ 14784 


7.5 


other 


129560 


AA317841 


Hs.7845 


hypothetical protein MGC2752 


6.8 


other 


129570 AI923097 


Hs.1 1441 


chromosome 1 open reading frame 8 


2.1 


other 


129575 


F08282 


Hs.278428 


progestin induced protein 


1.6 


other 
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129587 


H14718 


Hs.11506 


Human clone 23589 mRNA sequence 


6.8 


other 


129588 


BE408300 


Hs.301862 


postmeiotic segregation increased 2-like 


1.4 


TM 


129591 


N57423 


Hs.1 79898 


HSPC055 protein 


7.4 


other 


129594 


AW403724 


Hs.36989 


coagulation factor VII (senjm prothrombi 


9 


? 


129596 


AF035537 


Hs.1 15521 


REV3 (yeast homoiog)-like, catalytic sub 


1.6 


other 


129628 


U38945 


Hs.1174 


cyclin-dependent kinase inhibitor 2A (me 


2.2 


other 


129649 


AD000092 


Hs.1 6488 


caireticulin 


3.3 


other 


129675 


NM_015556 


Hs.172180 


KIAA0440 protein 


13.4 


other 


129680 


U03749 




gb:Human chromogranin A (CHGA) gene, pro 


14.1 


? 


129689 


AW748482 

nil 1 TVTVt 


Hs 77873 


B7 homolog 3 


2.6 


other 


129702 


AI304966 


Hs.12035 


ESTs, Weakly similar to I38022 hypotheti 


7.5 


TM 


129720 


AA156214 


Hs.12152 


APMCF1 protein 


2 


other 


129721 


NM_001415 


Hs.211539 


eukaryotic translation initiation factor 


1.7 


TM 


129778 


AK001676 


Hs.1 2457 


hypothetical protein FLJ10814 


1.8 


other 


129779 


AA394090 


Hs.12460 


Homo sapiens clone 23870 mRNA sequence 


5.5 


TM 


129800 


AF052112 


Hs 12540 

t lO. lt\<TU 


lysosomal 


1.7 


? 


129806 


AB023148 


Hs. 173373 


KIAA0931 protein 


1.2 


other 


129815 


RE5fi5817 


Hs 26498 


hypothetical protein FLJ21 657 


3.1 


other 


129840 


NM 006590 


Hs. 12820 


SnRNP assembly defective 1 homolog 


1.8 


other 


129861 


AL049999 


Hs 85963 


DKFZP564M182 protein 


2.3 


other 


129864 


AI393937 


Hs 129914 


nint-rplatfri tranirrintion fertor 1 tern 


1.7 


SS, 




A122706Q 


Hs 13015 


hypothetical protein similar to mouse Dn 


2.8 


TM 




BF514376 


Hs 165998 

1 13. IvWO^U 


PAI-1 mRNA-hintlinn nrofpin 

i m I I mi \| ^.n uii lull ly l/ivk^iii 


1.8 


other 




AA4121Q 1 ; 


Hs 13740 


ESTs 


2.5 


other 


12QQ72 

1 £03/ L 


r\i»/ Jd IOJ 


Hs 180628 
no. louu^o 


Hvnamin 1-likp 
ujri loi i UN i iir\c 


1.8 


? 




1 IflQRilfl 


Uc 1?2?qn 
no. iozosu 


£.]] \K/ ill ly CI LHUlCll I OO ^[\WA IO^ 


1.3 


other 


I OUU I u 


MrtOU 1 1 ID 


no. ItZOOO 


llUulcUlai JJliUopilUL'lUlCliI INUppOH 


1 fi 


nthpr 


I OuUO I 


AA9ft7**95 


no. \"tl IO 


CO 1 o 


4.1 


other 


1 OUUO£ 


Of OZ.\JO 


H«s 1473 
no. it/ o 


nnc.trin-i"£ilp;iQ.inn npntiHp 
yciouui ictcaoiiiy pc^jnuc 


1.9 


other 


•nnno.7 

1 JUUDV 


At 04fiQfi9 


H<s 14845 
no. itoto 


fnrkhparf hn* O^A 


2.8 


other 






Hs 148Q1 
no. i*too i 




2.3 


other 


130111 




Hs 149846 

i to. ifswTU 


intporin bpta 5 


2.3 


other 


13011? 

1 OU 1 l£ 


AAQ1fi7fi ( i 


no. iouu iu 


cnlirinn fartnmrolinp/nlntAminp rirh ( 

ofJiiuniy lauiui (Ji uni ic/y luidi i imc iiui \ 


3 


other 


130128 


L76937 


Hs 150477 


Wemer syndroms 


1.8 


other 


13(1115 


AA311426 


Hs 21635 


tubulin, Qsmmd 1 


6.1 


other 


130211 


NM 003358 


Hs.23703 


ESTs, Moderately similar to CEGT_HUMAN C 


1.6 


other 


130212 


nsonm 


Hs 152629 


K1AA0179 nrotein 


1.3 


other 




rso vjkjui 


Hs 51957 


spliclnp, factor, arQinine/serine-rich 2, 


2 


other 


130241 


Al 035588 


Hs 153203 


MyoD fsmily inhibitor 


3.2 


other 


130242 


X79201 


Hs 153221 

1 IO. 1 Oubt 1 
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131511 M732153 Hs.27865 Homo sapiens cDNA: FU21333 fls, clone C 
131528 AU076408 Hs.28309 UDP-glucose dehydrogenase 
131532 BE268278 Hs.28393 hypothetical protein MGC2592 

131543 AW966881 Hs.41639 programmed cell death 2 

131544 AL355715 Hs.28555 programmed cell death 9 (PDCD9) 
131562 NM.003512 Hs.28777 H2A histone family, member L 

131564 T93500 Hs.28792 Homo sapiens cDNA FU11041 fis, clone PL 
131569 AL389951 Hs.271623 nucleoporin 50kD 

131618 BE393822 Hs.29645 Homo sapiens mRNA;cDNADKFZp761C029(fr 

131622 R78195 Hs.29692 Homo sapiens cDNAFLJ1 1436 fis, clone HE 

131623 AB037791 Hs.29716 hypothetical protein FLJ10980 
131643 AW410601 Hs.30026 HSPC182 protein 

131653 AW960597 Hs.30164 ESTs 

131656 AI218918 Hs.30209 KIAA0854 protein 

131669 X52486 Hs.3041 uracil-DNA glycosylase 2 

131692 BE559681 Hs.30736 KIAA01 24 protein 

131714 AA642831 Hs.31016 putative DNA binding protein 

131722 D13757 Hs.311 phosphoribosyl pyrophosphate amidotransf 

131737 AK001641 Hs.31323 inhibitor of kappa light polypeptide gen 

131763 AI878932 Hs.317 topoisomerase (DNA) I 

131772 AA382590 Hs.170980 KIM0948 protein 

131787 D87077 Hs.196275 KIM0240 protein 

131793 AW966127 Hs.32246 Homo sapiens cDNA FLJ14656 fis, clone NT 

131795 BE501849 Hs.32317 high-mobility group 20B 

131798 X86098 Hs.301449 adenovirus 5 E1A binding protein 

131817 U20536 Hs.3280 caspase 6, apoptosis-related cysteine pr 

131824 U28838 Hs.32935 TATA box binding protein (TBP)-associate 

131850 AI251317 Hs.33184 ESTs 

131878 M083764 Hs.6101 hypothetical protein MGC3178 

131885 BE502341 Hs.3402 ESTs 

131900 M099014 Hs.231029 Homo sapiens, clone MGC:15961, mRNA.com 

131904 AF078866 Hs.284296 Homo sapiens cDNA: FU22993 fis, clone K 

131905 AA179298 Hs.3439 stomatin-like 2 

131913 AW207440 Hs.185973 degenerative spermatocyte (homolog Droso 

131916 AA025976 Hs.34569 ESTs 

131925 AF151048 Hs.183180 anaphase promoting complex subunit 1 1 (y 

131929 BE541211 Hs.34804 Homo sapiens CDNAFLJ1 1472 fis, clone HE 

131941 BE252983 Hs.35086 ubiquitin specific protease 1 

131950 AA355113 Hs.35380 x 001 protein 

131962 AK000046 Hs.267448 hypothetical protein FLJ20039 

131965 W79283 Hs.35962 ESTs 

131971 BE567100 Hs. 154938 hypothetical protein MDS025 

131977 U90441 Hs.3622 procollagen-proline, 2-oxoglutarate 4-di 

131985 AA503020 Hs.36563 hypothetical protein FLJ22418 

131991 AF053306 Hs.36708 budding uninhibited by benzimidazoles 1 

132019 H56995 Hs.37372 Homo sapiens DNA binding peptide mRNA, p 

132062 BE266155 Hs.3832 clathrin-associated protein AP47 

132084 NM_002267 Hs.3886 karyopherin alpha 3 (importin alpha 4) 

132103 BE171921 Hs.3991 ESTs 

132105 AV646076 Hs.39959 ESTs 

132116 AW960474 Hs.40289 ESTs 

132176 AA857025 Hs.8878 kinesin-like 1 

132180 NM_004460 Hs.418 fibroblast activation protein, alpha 

132194 R42432 Hs.4212 ESTs 

132207 BE206939 Hs.42287 E2F transcription factor 6 

132235 AV658411 Hs.42656 KIM1681 protein 

132252 AI566004 Hs.141269 Homo sapiens cDNA: FLJ21550 fis, clone C 

132266 AA301228 Hs.43299 hypothetical protein FLJ 12890 

132273 AA227710 Hs.43658 DKFZP586L151 protein 

132276 M653507 Hs.285711 hypothetical protein FLJ 13089 

132288 N36110 Hs.305971 solute earner family 2 (facilitated glu 

132294 AB023191 Hs.44131 KIM0974 protein 

132298 NM_015986 Hs.7120 cytokine receptor-like molecule 9 

132299 AW405882 Hs.44205 cortistatin 

132325 N37065 Hs.44856 hypothetical protein FLJ12116 

132370 AW572805 Hs.46645 ESTs 

132374 AF155582 Hs.46744 corel UDP-galactose:N-acetylgalactosamin 
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132376 AI279892 Hs.46801 
132384 M312135 Hs.46967 
132393 AL135094 Hs.47334 
132450 M100012 Hs.48827 
132452 AW973521 Hs.247324 
132456 AB011084 Hs.48924 
132470 AI224456 Hs.4934 
132484 X16660 Hs.1 19007 
132518 AW885606 Hs.5064 
132530 AA306105 Hs.50785 
132532 AA454132 Hs.5080 
132534 BE388673 Hs.5086 
132543 BE568452 Hs.5101 
132574 AW631437 Hs.5184 
132596 AK001484 Hs.5298 

132611 AA345547 Hs.53263 

132612 H12751 Hs.5327 
132616 BE262677 Hs.283558 
132638 AI796870 Hs.54277 
132668 AB018319 Hs.5460 
132692 AW191962 Hs.249239 
132715 F11875 Hs.5534 
132718 NM_004600 Hs.554 
132724 AI142265 Hs.55498 
132731 AI189075 Hs.301872 
132744 AA010233 Hs.55921 
132760 M1 25985 Hs.56145 
132771 Y10275 Hs.56407 
132773 AA459713 Hs.295901 
132784 AI142133 Hs.56845 
132798 AI026701 Hs.5716 
132807 U07418 Hs.57301 
132810 AB007944 Hs.5737 
132813 BE313625 Hs.57435 
132815 AI815189 Hs.57475 
132817 N27852 Hs.57553 
132821 AJ251595 Hs.169610 
132833 U78525 Hs.57783 
132842 NM_016154 Hs.279771 
132844 F12200 Hs.5811 
132851 U09716 Hs.287912 
132869 AW963217 Hs.203961 
132873 AW007683 Hs.58598 
132875 NM_004850 Hs.58617 
132891 BE267143 Hs.59271 
132897 AW503667 Hs.59545 
132902 AI936442 Hs.59838 

132912 AW732760 Hs.167578 

132913 W78714 Hs.60257 
132940 T79136 Hs.127243 
132942 AA554458 Hs.1 97751 
132952 AI658580 Hs.61426 
132962 AA576635 Hs.6153 

132972 AA034365 Hs.288924 

132973 M035446 Hs.323277 
132977 AA093322 Hs.301404 
132980 AA040696 Hs.62016 
132994 AA1 12748 Hs.279905 
133012 AA847843 Hs.62711 
133015 AJ002744 Hs.246315 
133062 AW500374 Hs.64056 
133069 BE247441 Hs.6430 
133091 AK001628 Hs.64691 
133110 AA808177 Hs.65228 
133134 AF198620 Hs.65648 
133145 H94227 Hs.6592 
133152 Z11695 Hs.324473 



sorting nexin 14 
HSPC034 protein 
hypothetical protein FU14495 
hypothetical protein FLJ12085 
mitochondrial ribosomal protein S14 
KIM0512 gene product; ALEX2 
H.sapiens polyA site DNA 
RAB4, member RAS oncogene family 
ESTs 

SEC22, vesicle trafficking protein (S. c 
mitochondrial ribosomal protein L16 
hypothetical protein MGC10433 
protein regulator of cytokinesis 1 
TH1 drosophila homolog 
CG1-45 protein 

hypothetical protein FLJ13287 

PR01914 protein 

hypothetical protein PR01855 

DNA segment on chromosome X (unique) 992 

KIAA0776 protein 

collagen, type VIII, alpha 2 

Homo sapiens cDNAFLJ 12961 fis, clone NT 

Sjogren syndrome antigen A2 (60kD, ribon 
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tousled-like kinase 2 

CD44 antigen (homing function and Indian 

eukaryotic translation initiation factor 
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NM_004893 Hs.75258 

NM_002047 Hs.75280 

NM_000401 Hs.75334 



U25849 
AV661185 
L27841 
AW969976 



Hs.75393 
Hs.75574 
Hs.75737 
Hs.279009 



AW402048.comp 
T52946 Hs.196209 



BE271766 

M62194 

M557660 

AL133921 

D50525 



Hs.181357 
Hs.75929 
Hs.76152 
Hs.76272 
Hs.699 



AW797468 Hs.285013 
AA147026 Hs.76704 



AB011155 
AW340125 
AB012193 
U30825 
D86326 



Hs.170290 

Hs.76989 

Hs.183874 

Hs.77608 

Hs.325948 



NM_006306 Hs.211602 
L17128 Hs.77719 



BE244332 
X81789 
AI908165 
AL040328 
AI824113 
AB016092 
D31764 



Hs.77770 

Hs.77897 

Hs.169946 

Hs.78202 

Hs.78281 

Hs.197114 

Hs.278569 



NM_003590 Hs.78946 

U41060 Hs.79136 

NM_014742 Hs.79305 

H86504 Hs.173328 

BE559598 Hs.197803 



hypothetical protein MGC2745 2.7 
ESTs, Weakly similarto S19560 proline-r 9,3 
RAD54 (S.cerevisiae)-like 4.5 
ESTs 5.5 
Homo sapiens, Similar to bromodomain-con 2.7 
gokji phosphoprotein 1 1 j 

Homo sapiens, done IMAGE:3544662, mRNA, 1 .3 
ESTs, Weakly similar to FXD2JHUMAN FORKH 1 2.2 
NRAS-related gene 1.2 
gamma-aminobutyric acid (GABA) A recepto 1 .7 
integral type I protein 11.1 
Kruppel-like factor 13 2.9 
acid cluster protein 33 2.5 
v-akt murine thymoma viral oncogene homo 1 .5 
non-metastatic cells 5, protein expresse 2. 1 

KIAA0539 gene product 1.3 
DKFZP564I1922 protein 2.2 
phosphatidylserine receptor 5.7 
inhibin, beta A (activin A, activin AB a 25.5 
hypothetical protein HT023 3.3 
phosphotidylinositol transfer protein, b 1 .6 

protein kinase, interferon-inducible dou 4.1 
protein tyrosine phosphatase, non-recept 1 .5 

hypothetical protein FLJ22237 6.3 
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other 


2 


TM 


7.6 


other 


2.2 


other 


2.9 


other 


1.8 


other 


5.9 


other 


5.7 


other 


1.4 


other 
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10 



15 



220 



\5 



130 

5 : 



35 



40 



45 



50 



55 



60 



65 



452461 


N78223 


Hs.108106 


transcription factor 


4.8 


? 


452511 


BE408178 


Hs.285165 


Homo sapiens cDNA FLJ20845 fis, clone AD 


2.9 


other 


453157 


AF077036 


Hs.31989 


DKFZP586G1 722 protein 


12.1 


SS.TM 


453658 


BE541906 


Hs.87819 


Homo sapiens, clone MGC:2492, mRNA, comp 


4.8 


other 


100685 


AA328229 


Hs. 184582 


ribosomal protein L24 


1.8 


TM 


100690 


AA383256 


Hs.1657 


estrogen receptor 1 


1.6 


other 


100833 


AF135168 


Hs. 108802 


N-ethylmaleimide-sensitive factor 


1.3 


other 


100850 


AA836472 


Hs.297939 


cathepsin B 


1.7 


? 


101161 


NM 006262 


Hs.37044 


peripherin 


16.9 


other 


102481 


U50360 




gb:Human calcium, calmodulin-dependent p 


3.2 


UU ICI 


102831 


AA262170 


Hs.80917 


adaptor-related protein complex 3, sigma 


2 


? 


103549 


BE270465 


Hs.78793 


protein kinase C, zeta 


3 


nthpr 

UU ICI 


103749 


AL1 35301 


Hs.8768 


hypothetical protein FLJ 10849 


1 R 

l-O 


ULl K\ 


104331 


AB040450 


Hs.279862 


cdk inhibitor o21 hindinn nrntpin 

tAJIV II II IIUIll/l 1 LMiUlliy ^IIULCIII 


9 


9 


104532 AI498763 


Hs.203013 


hypothetical protein FLJ12748 


9 1 




104563 


AL1 17403 


Hs.306189 


DKFZP434F1735 protein 


1.2 


nthpr 


105032 


AA127818 




oh'zl12a02^1 Snarps nrennant nfpniQ NhH 


7 
/ 


9 


105039 


AA907305 


Hs.36475 


ESTs 


2.6 


? 


106531 


M454036 


Hs.8832 


ESTs 


1 fi 


UU ICI 


106977 


AL043152 


Hs.50421 


KIAA0203 gene product 


4.9 


other 


107298 


N95657 


Hs.6820 


ESTs, Moderately similar to YOJ1 CAEEL H 


2.5 


TM 


108717 


M1 22393 


Hs.70811 


hypothetical protein FLJ20516 


1.3 


other 


110018 


AW579842 


Hs.104557 


hypothetical protein FLJ 10697 


5.3 


TM 


110330 


AI288666 


Hs.16621 


DKFZP4341 116 protein 


6.3 


other 


111391 


NM 003896 


Hs.225939 


sialyltransferase 9 (CMP-NeuAcilactosylc 


5.1 


SS 


111392 


W46342 


Hs.325081 


Homo sapiens, clone IMAGE:3659680, mRNA, 


8.4 


other 


113554 


AW503990 


Hs 142442 


HP1-BP74 


1 7 


I IVI 


113722 


AV653556 


Hs 184411 

i to. i im i i 


alhi imin 

QIUU1 IIJII 


1 ? 


other 


115008 


AK001827 


Hs 87889 




i. 


other 


115062 AA253314 


Hs 154103 

1 IO. 1 \\J\i 


I IM nrntpin /similar tn rat nrntpin Irina 

UIIVI ^lULCIII ^OlIIIIICIl LU ICtlplULCJII IMHCI 




other 


115121 


AI634549 


Hs.88155 


ESTs 


?8 


nthpr 

, UUICI 


117881 


AF161470 


Hs.260622 


butvratG-induf^ri tran^nint 1 


k a 


TM 

1 IVI 


119075 


M10905 


Hs.287820 


fibronoctin 1 




nth or 

UU ICI 


119615 


AL034423 


Hs.75875 


ubiqui tin-conjugating enzyme E2 variant 


1.3 


other 


120253 


AA131376 


Hs.326401 


fibroblast arowth fartnr 19R 




nthpr 

UU ICI 


125006 


BE065136 


Hs 145696 






? 


127609 


X80031 


Hs.530 


nnllanpn tvnp \\J alnha *3 ff^nnrlnactiiro 


1 .0 


other 


128868 


AA419008 


Hs 106730 


fhrnmrtinmp 99 nnon reaHinn framo ^ 
OI1IU1I IUOU1IIC £.£. uuci i itsauiiiy MOlUC o 


0 


other 


128891 


F34856 


Hs.292457 


Homo ^anipni Hnnp Mfnn"1fi*^fi9 mRMA mm 


IO.O 


other 


128959 


AI580127 


Hs 107381 


hvnothpfiral nmtpin Fl 11190,0. 


in q 


other 


129209 


R62676 


Hs.17820 


Rhn-a^snriatprl milpri-nnil mntaininn n 

1 \l IU aOOUOialCU, ^AJIICU^/Ull UUI llall III iu, JJ 




other 


129449 


AI096988 


Hs.1 11554 


AnP-rihn^ulfltifin fartnr-lilfP 7 
flur i luuoyiauuii icii/iui~iiM5 / 


o.z 


TM 

1 IVI 


129453 


AW974265 


Hs.1 11632 


Lsm3 protein 


3.3 


? 


129629 


AK000398 


Hs. 11747 


hvnnthptiral nrntoin PI I90QQ1 


^ Q 

o.y 


other 


129917 


M30773 


Hs.278540 


nrntpin nhn^nhataQO "X irnmnorlu 9R\ ran 
(Jiuicm jji hjojji laiaoc J ^lUIIIICIiy £-01, 'try 


o.o 


Tim 

1 IVI 


129922 


AF042379 


Hs 13386 


namma.fi ihnlin nnmnlPY nrntpin 9 




other 


129989 


AB015856 


Hs.247433 


activating transcription factor 6 


4 




130182 


BE267033 


Hs 192853 

1 io. I JtWJJ 


ii him litin-pnniimatinn or>7\/mo F9f5 9 fhnmn 
uuiijuiui i"wi ijuyctw ly dixy me uo c ijiuiiiu 




other 


130365 


W56119 


Hs 155103 


Piikarvottf*tran^latinn initiatinn fartnr 


11 

1 1 


other 


130471 


AL121438 


Hs 183706 


aririurin 1 /alnha\ 

auvJuijiii i tciiL#i id/ 


97 


other 


130542 


U64675 


Hs.1 79825 


RAN hindinn nrntpin 9-likp 1 


7 Q 


other 


130586 


AB007891 


Hs 16349 




0.0 


TM 
1 IVI 


130768 


AF258627 


Hs.211562 


ATP-binding cassette, sub-family A (ABC1 


5.2 


other 


130992 


BE398091 


Hs.74316 


desmoplakin (DPI, DPII) 


1.8 


TM 


131047 


H23230 


Hs.22481 


ESTs, Moderately similar to A46010 X-lin 


1.7 


? 


131135 


NM 016569 


Hs.267182 


TBX3-iso protein 


3.3 


TM 


131339 


AF058696 


Hs.25812 


Nijmegen breakage syndrome 1 (nibrin) 


2.6 


other 


131760 


X76732 


Hs.3164 


nucleobindin 2 


2.9 


TM 


131774 


BE267158 


Hs.169474 


DKFZP586J01 19 protein 


5.6 


other 


131853 


AI681917 


Hs.3321 


ESTs, Highly similar to !RX1_HUMAN IROQU 


1.3 


other 


131881 


AW361018 


Hs.3383 


upstream regulatory element binding prot 


3.2 


TM 


131887 


W17064 


Hs.332848 


SWI/SNF related, matrix associated, acti 


3.2 


other 


132031 


AF193844 


Hs.3758 


COP9 complex subunit 7a 


5.9 


? 


132192 


AA206153 


Hs.4209 


mitochondrial ribosomal protein L37 


2.2 


TM 


132203 NM_004782 


Hs.194714 


synaptosomal-associated protein, 29kD 


7.9 


? 


132240 


AB018324 


Hs.42676 


KIAA0781 protein 


4.3 


other 


132348 AW067708 


Hs.170311 


heterogeneous nuclear ribonucleoprotein 


12.5 


other 
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132528 


T78736 


Hs.50758 


SMC4 (structural maintenance of chromoso 


7.4 


? 




132571 


AW674699 


Hs.5169 


suppressor of G2 allele of SKP1, S. cere 


6.9 


other 




132726 


N52298 


Hs.55608 


hypothetical protein MGC955 


14.3 


? 




132863 


BE268048 


Hs.236494 


RAB10, member RAS oncogene family 


10.3 


other 




133016 


AI439688 


Hs.6289 


hypothetical protein FLJ20886 


4.4 


other 




133053 


AI065016 


Hs.6390 


Homo sapiens clone FLB3344 PRO0845 mRNA, 


1.8 


SSJM 




133197 


AI275243 


Hs.180201 


hypothetical protein FLJ20671 


1.8 


other 




133240 


AK001489 


Hs.242894 


ADP-ribosylation factor-like 1 


1.8 


other 




133266 


AI160873 


Hs.69233 


zinc finger protein 


16.1 


other 




133285 


M76477 


Hs.289082 


GM2 aanaliosirip activator nrotein 

V_IIVI£. kjailVJIIU*}lUE» OVUVCIUI ^IIUL^III 


10.4 


SS, 




133383 


BE313555 


Hs.7252 


KIAA1224 Drotein 


1.5 


? 




133540 


AL037159 


Hs.74619 


proteasome (prosome, macropain) 26S subu 


1.7 


other 




133784 


BE622743 


Hs.301064 


arfaptin 1 


12.1 


other 




133791 


M34338 


Hs.76244 


OJJCI 1 1 HUH IC OJf 1 1 U lOOC 


Q 7 


UU ICI 




133850 


W29092 


Hs 7678 


cpllnlar rptinnin arifi-hinriinn nrntpin 1 


4 9 


oo, 




133859 


U86782 


Hs 178761 


9fiS nrntpacnmp-fl^nrtflfpH np,H1 hnmnlnn 
tuo piuicaoui i ic doouoidicu uau i iiuniuiuu, 


9 9 


nfhpr 

UU Id 




133881 


U 30872 


H<! 77204 


rpntrnmprp nrntpin F (^nMfirilrn mitncin 

OC1 III Ull ICI C piULCIII r JWIUUMJ, lllllUOlll 


Q 1 


UU ICI 




134208 


NM 000288 


U<5 7QQQT. 


nprnywnmfl! hinnpnpcrc fnrtnr7 

UGI UAI3UI 1 IQI UlVjyOllGolo lalslUl f 


•? 9 

0.£ 


nfhpr 




134403 


AA334551 


Hs 82767 


^nprm •snpHfip sntinpn 9 


1.4 


nthpr 




134724 




h<5 ?9i«;7fi 

no.ot t j/ u 


rinn finnpr nrntpin 90 
imy iiiiyci uiuicm ll 


1 A 


other 




1 OtUUU 




H« RQ71R 


cnprminp cunfhscp 
ouei mine o/l lUlaoc 




other 




1 ?48"i<) 




He qmi*; 


KIAAfifYY7 nrntpin 


10.0 


other 




lOO ISO 




nb.yu I uo 


TATA tv^v hinHinn rsr/vtpin /TBD\ aeeftwata 

i m i m uoa uinuing protein \ i Dry-assoctaie 


o. 1 


other 










F<5Tc 

CO I o 


1 fi 

1 .o 


t 






T70541 




ESTs 


9 "5 


oo, 






X^776fi 




Human stromelysin-3 mRNA 


4.5 


UU ICI 










Homo sapiens clone 23592 mRNA sequence 


3.1 


nthpr 

UUICI 






AA4 t 5?4?tt 




ESTs 


4.6 


1 IVI 






R63925 




ESTs 


1.4 


other 






AA17?417 




ESTs 


1.9 


other 






AA9Rf]'iRR 




ESTs 


2.2 


other 






AA'ir)499'? 




ESTs Highly similar to CHROMOSOME 


2.4 


nthpr 










ESTs Highly similar to Surf-4 protein [M.musculus] 


5.5 


? 






F02907 




ESTs 


2.3 


TM 






AA480103 




ESTs Weakly similar to !!!! ALU SUBFAMILY J 


2.8 


TM 






AA024664 




Human NADH:ubiquinone oxidoreductase subunit 


6.2 


other 






AA251776 




ESTs 


2.3 


othsr 






AA399047 




ESTs 


2.4 


other 






li JtU>J!7 




EST-RC_N34059 


3.3 


other 






IWi?fi7 

\j0\JtJ\Jl 




Human GABA-A receptor pi subunit mRNA complete cds 


1 7 
1./ 


TM 

I IVI 




AA4Q0RQQ 




ESTs 


3.3 


other 






twr? 




ESTs 


2.9 








741 QR? 




Homo sapiens HP protein (HP) mRNA complete cds 1 .3 








AA591 1ftfi 
nn3i 1 1 00 




ESTs 


1.6 


TM 
1 IVI 










ESTs 


1.3 


other 










VITAMIN K-DEPENDENT GAMMA-CARBOXYLASE 


z.o 


other 








Homo sapiens mRNA for GDP dissociation inhibitor beta 


•1 A 
1.0 


TM 
1 IVI 




W85712 




ESTs Weakly similar to PROCOLLAGEN ALPHA 2(IV 




TM 

1 IVI 




\A/4R79ft 




ESTs Highly similar to HETEROGENEOUS 


3.7 


other 






I Ifi19^9 

UD IZOi 




Human tubulin-folding cofactor E mRNA complete cds 


£..1 


other 




AA49^1*v4 




ESTs 


5.3 


other 






TW7R 




ESTs Weakly similar to ZK1058.4 [C.elegans] 


2.6 


oo, 1 IVI 






AA496000 




ESTs 


1.9 


SS, 






W38150 




EST-RC W38150 


1.7 


?' 






T96595 




EST-RC T96595 


1.8 


TM 






AA227463 




ESTs Weakly similar to No definition line found [C.elegans] 


1.9 


? 




R46025 




ESTs 


2.8 


SS, 






AA233177 




ESTs 


2 


other 






AA338760 




ESTs 


1.3 


? 






AA412106 




ESTs 


6.2 


other 






L47276 




EST-L47276 


3.4 


other 






D82307 




ESTs Weakly similar to TH1 protein [D.melanogaster] 


11.4 


other 




AA293568 




ESTs 


1.5 


other 






R37778 




ESTs 


2.4 


other 






AA250843 




Interferon regulatory factor 5 


14.6 


? 






W49521 




Human prolyl 4-hydroxylase alpha (II) subunit 


6.5 


? 






D80000 




Human mRNA for KIAA0178 gene partial cds 


2 


other 
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R99978 

M1 95036 

Z38501 

U37547 

AA479961 

X57579 

AA449071 

N51855 

AA421213 

AA355201 

N78717 

N73808 

U86782 

AA234817 

D13666 

AA236177 

U50648 

M28211 

M446949 

W03007 

W61011 

W87544 

X02751 

Z14077 

Z38839 

AA410894 

AA504499 



ESTs Weakly similar to line-1 protein ORF2 [H.sapiens] 6.1 
Human Ro/SSA ribonucleoprotein homolog (RoRet 5.3 ? 
ESTs Weakly similar to PROBABLE E5 ' 1.4 other 
Human IAP homolog B (MIHB) mRNA complete cds 3.2 other 
ESTs 1.7 other 

Inhibin beta A (activin A activin AB alpha polypeptide) 15.8 

ESTs 1.3 TM 

ESTs Moderately similar to NAD(+) ADP- 1 .3 other 

ESTs Weakly similar to F28F8.3 [C.elegans] 3.2 other 

ESTs 1.2 SS.TM 

H.sapiens mRNA for translin 1.5 ? 

ESTs 5 ? 

Human 26S proteasome-associated pad1 2.2 other 

ESTs 1.3 other 

Homo sapiens mRNA for osteoblast specific 7.5 SS, 
ESTs 7.1 ? 
Protein kinase interferon-inducible double 4.1 ? 
Homo sapiens GTP-binding protein (RAB4) 2.9 other 
ESTs 2.2 other 
ESTs 1.2 other 
ESTs 1.2 other 
ESTs 1.2 other 
Neuroblastoma RAS viral (v-ras) oncogene homolog 1 .2 ? 
YY1 transcription factor 1.2 other 
ESTs 1.2 ? 
ESTs 1.7 other 

ESTs Highly similar to probable chloride channel 3 [H.sap 1 .3 



other 
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TABLE 7A 

Table 7 A shows the accession numbers for those pkeys lacking unigenelD's for Table 7. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number 

102481 31281.-28 

105032 genbank_AA127818 

409487 1134778J 



Accession 

U50360 
AA1 27818 

H19886AW402806T10231 
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TABLE 8: Figure 8 from BRCA 001-1 US 

5 Table 8 shows genes upregulated in tumor tissue compared to normal breast tissue. 

Specifically, one column shows the ratio of expression of the indicated gene in breast tumor 
tissue compared to other body tissues, and another column shows the ratio of expression of 
the indicated gene in breast tumor tissue compared to normal breast tissue. 



10 





Pkey: 


Unique Eos probeset identifier number 








ExAccn: 


Exemplar Accession number, Genbank accession number 








UnigeneiD: Unigene number 








Unigene Title: Unigene gene title 








R1: 


Ratio of tumor to normal body tissue 








R2: 




Ratio of tumor to normal breast tissue 






W 

w ! 


Pkey 


ExAccn 


UnigeneiD Unigene Title 


R1 


R2 




100075 


AF1 52333 


Hs.284160 protocadherin gamma subfamily B, 4 


1 


3.8 




100229 


AV652249 


Hs.180107 polymerase (DNA directed), beta 


1.7 


5.3 




100262 


D38500 


Hs.278468 postmeiotic segregation increased 2-like 


0.8 


4.8 




100271 


BE160081 


Hs.256290 S100 calcium-binding protein A11 (calgiz 


3.2 


2.3 




100355 


AI907114 


Hs.71465 squalene epoxidase 


3.3 


1.4 


100522 


X51501 


Hs.99949 prolactin-induced protein 


11.9 


0.4 




100552 


AA019521 


Hs.301946 lysosomal 


3.8 


1.2 




100599 


X77343 


Hs.334334 transcription factor AP-2 alpha (activat 


9.4 


9.4 




100676 


X02761 


Hs.287820 fibronectin 1 


3 


7.8 




100690 


M383256 


Hs.1657 estrogen receptor 1 


4.4 


4.4 




100895 


U01351 


Hs.75772 nuclear receptor subfamily 3, group C, m 


1 


3.9 




101046 


K01160 


NM_002122:Homo sapiens majorhistocompat1.7 


4 




101086 


M382524 


Hs.250959 histatin 1 


0.8 


4.1 


i y 


101148 


NM 002923 


Hs.78944 regulator of G-protein signalling 2, 24k 


1.2 


12 




101161 


NM 006262 


Hs.37044 peripherin 


3.1 


1.1 




101201 


L22524 


Hs.2256 matrix metalloproteinase 7 (MMP7; uterin 


4.4 


0.6 




101212 


AI186220 


Hs.83164 collagen, type XV, alpha 1 


3.1 


3.4 




101441 


AW468397 


Hs.100000 S100 calcium-binding protein A8 (calgran 


0.9 


4.2 




101447 


M21305 


gb:Human alpha satellite and satellite 3 


29.9 


0.3 


40 


101469 


AA310162 


Hs.169248 cytochrome c 


0.8 


4.9 


101567 


M33552 


Hs.56729 lysosomal 


1 


5.9 




101600 


BE561617 


Hs.1 19192 H2Ahistone family, memberZ 


2.8 


4 




101624 


M55998 


gb:Human alpha-1 collagen type 1 gene, 3 


3.1 


1.7 




101674 


NM 002291 


Hs.82124 laminin, beta 1 


1.5 


4.1 


45 


101861 


AA350659 


Hs.83347 angio-associated, migratory cell protein 


3.1 


1.4 


101977 


AF1 12213 


Hs.1 84062 putative Rab5-interacting protein 


1.3 


6.9 




102193 


AL036335 


Hs.313 secreted phosphoprotein 1 (osteopontin, 


1.9 


4.9 




102199 


AA334592 


Hs.79914 lumican 


2.2 


3.8 




102304 


AF015224 


Hs.46452 mammaglobin 1 


4.2 


0.7 


50 


102345 


NM 003480 


Hs.300946 Microfibril-associated glycoprotein-2 


1.1 


4.2 


102457 


NM_001394 


Hs.2359 dual specificity phosphatase 4 


4.5 


0.5 




102534 


U96759 


Hs.198307 von Hippel-Lindau binding protein 1 


1.4 


4.2 




102541 


AI379954 


Hs.79025 KIAA0096 protein 


0.9 


3.9 




102827 


BE244588 


Hs.6456 chaperanin containing TCP1 , subunit 2 (b 


1.5 


10.9 


55 


102962 


R50032 


Hs.159263 collagen, type VI, alpha 2 


2.2 


6.2 


102991 


AW293542 


Hs.75309 eukaryotic translation elongation factor 


5.6 


5.7 




103119 


X63629 


Hs.2877 cadherin 3, type 1 , P-cadherin (placenta 


3.7 


0.5 




103175 


X69089 


Hs.79227 myomesin (M-protein) 2 (1 65kD) 


1.3 


4 




103286 


D38616 


Hs.54941 phosphorylase kinase, alpha 2 (liver) 


1.3 


3.8 


60 


103319 


X83492 


Hs.82359 tumor necrosis factor receptor superfami 


0.8 


4.6 


103372 


BE536700 


Hs.4888 seryl-tRNA synthetase 


0.9 


8 




103419 


T34708 


Hs.272927 Sec23 (S. cerevisiae) homolog A 


1.1 


5.1 




103471 


Y00815 


Hs.75216 protein tyrosine phosphatase, receptor t 


3.7 


1.2 




103546 


Z14244 


Hs.75752 cytochrome c oxidase subunit Vllb 


0.9 


4.4 
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103658 


NM_000088 


Hs.1 72928 


collsgen, typ© I, alpha 1 


3.2 


3 




103758 


AA084874 




gb;zn13604.r1 Stratagens hNT neuron (937 


0.9 


10 




103774 


H24185 


Hs.92918 


hypothetical protein 


19 


15.9 




103821 


AA095971 


Hs 198793 

1 IO. 1 %J\J 1 kJ^J 


Homo sapiens cDNA: FU22463 fis, clone H 


1.2 


3 Q 


5 


103869 


BE439604 


Hs.24322 


ATPase, H+ transporting, lysosomal (vacu 


1.4 


3 9 

O.J 




103980 


AW1 30242 


Hs.293476 


hypothetical protein FKSG44 


1.6 


4.1 




104054 


AK001913 


Hs.7100 


hypothetical protein 


1.5 


4.3 




104115 


AF183810 


Hs 26102 


opposite strand to trichorhinophalangeal 


7 


7 


10 


104189 


AB040927 


Hs 301804 

1 iv.vv i uvn 
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high density lipoprotein binding protein 
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hypothetical protein FLJ14735 
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128453 X02761 Hs.287820 fibronectin 1 1.2 

1 28460 T16206 Hs.237164 ESTs, Highly similar to LDHHJHUMAN L-LAC 3.1 

128491 H08379 Hs.165563 hypothetical protein DKFZp434N1429 0.6 

128495 NM_005904 Hs.1 00602 MAD (mothers against decapentaplegic, Dr 1.3 

128546 NM_003478 Hs.101299 cullin 5 1 

128574 AI185977 Hs.38260 ubiquitin specific protease 18 0.8 

128611 NM_014721 Hs.102471 KIAA0680 gene product 1.3 

128652 AA432202 Hs.103147 hypothetical protein FU21 347 1.4 

128653 D87432 Hs.10315 solute earner family 7 (cationic amino 1.2 
128655 AI246669 Hs.324275 WW domain-containing protein 1 0.8 
128684 BE246444 Hs.283685 hypothetical protein FU20396 3 
128717 AK001564 Hs.1 04222 hypothetical protein FU1 0702 2.8 
128774 AA476220 Hs.54457 CD81 antigen (target of antiproliferativ 1.1 
128790 AF026692 Hs.105700 secreted frizzied-related protein 4 1 
128805 AA1 94554 Hs.1 83434 ATPase, H+ transporting, lysosomal (vacu 5.3 
128827 AI638184 Hs.106334 Homo sapiens clone 23836 mRNA sequence 2.2 
128840 AI917602 Hs.106440 ESTs 1 
128869 AA768242 Hs.80618 hypothetical protein 0.8 

128889 D60985 Hs.106909 DKFZP566D1 93 protein 4.6 

128890 AI222020 Hs.182364 CocoaCrisp 3 
128915 AK000140 Hs.107139 hypothetical protein 0.2 
128920 AA622037 Hs.166468 programmed cell death 5 2.5 
128926 AF155096 Hs.107213 hypothetical protein FLJ20585 4 
128930 AA298958 Hs.10724 MDS023 protein 1.2 
128942 AW247536 Hs.10729 hypothetical protein 1.4 
128948 AW953622 Hs.223025 RAB31, member RAS oncogene family 2.3 
128953 AB020716 Hs.107362 KIAA0909 protein 0.9 

128979 AW271217 Hs.281434 Homo sapiens cDNA FLJ14028 fis, clone HE 1.5 

128980 AA258924 Hs.10758 NM_002495*:Homo sapiens NADH dehydrogena 
129005 AI770025 Hs.13323 hypothetical protein FLJ22059 1.2 
129009 C15105 Hs.330716 Homo sapiens CDNAFLJ14368 fis, clone HE 2.1 
129013 AA371156 Hs.107942 DKFZP564M1 12 protein 2.4 
129068 AI634522 Hs.152925 KIAA1268 protein 1.2 
129106 AW504486 Hs.108689 sterol regulatory element binding transc 1.2 
129113 BE543205 Hs.288771 DKFZP586A0522 protein 0.5 

129125 AB002450 Hs.278391 CGI-109 protein 1 

129126 AW881089 Hs.108806 Homo sapiens mRNA; cDNA DKFZp566M0947 (f 
129151 N23018 Hs.171391 C-terminal binding protein 2 2.1 
129230 AA335362 Hs.109646 Empirically selected from AFFX single pr 0.9 
129234 M1891 6 Hs.282997 glucosidase, beta; acid (includes glucos 1.1 

129238 BE542214 Hs.109697 ESTs 1.1 

129239 W57656 Hs.109701 ubiquitin-like 5 3.2 
129241 AI878857 Hs.109706 hematological and neurological expressed 1.9 
129243 BE169531 Hs.109727 TAK1-binding protein 2; KIAA0733 protein 1.2 
129247 R49920 Hs.109733 CGI-131 protein 1.5 
129250 AA344367 Hs.109760 Empirically selected from multiple AFFX 1 
129258 AA250970 Hs.251946 poly(A)-binding protein, cytoplasmic 1-1 1.3 
129260 AF077200 Hs.279813 hypothetical protein 1.6 
129270 AA357185 Hs.109918 ras homolog gene family, member H 1.8 
129277 AB007896 Hs.110 putative L-type neutral amino acid trans 1.1 
129284 AA318224 Hs.296141 ESTs 2.5 
129300 W94197 Hs.110165 ribosomal protein L26 homolog 1.6 
129318 AF189062 Hs.285976 tumor metastasis-suppressor 1.8 
129352 AW51 1656 Hs.170177 Meisl (mouse) homolog 0.9 
129362 U30246 Hs.110736 solute carrier family 12 (sodium/potassi 1.4 
129379 BE278964 Hs.11085 CGI-1 11 protein 1 
129390 AA318271 Hs.250905 hypothetical protein 1 
129416 AA016188 Hs.111244 hypothetical protein 1.8 
129427 AI498631 Hs.111334 ferritin, light polypeptide 1.1 
129470 W92931 Hs.250899 heat shock factor binding protein 1 1.8 
129472 AL050260 Hs.323817 DKFZP547E1010 protein 1 
129475 NM_004477 Hs.203772 FSHD region gene 1 1.1 
129498 AA449789 Hs.75511 connective tissue growth factor 1.9 
129501 AI631811 Hs.180403 STRIN protein 1.1 
129527 AA769221 Hs.270847 delta-tubulin 1.1 
129545 R18087 Hs.323769 cisplatin resistance related protein CRR 1 
129579 AW517695 Hs.286218 junctional adhesion molecule 1 2.3 
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129606 AW968941 Hs.166254 hypothetical protein DKFZp566l 133 2.4 4.4 

129619 AA209534 Hs.284243 tetraspan NET-6 protein 3.2 13 

129620 D79338 Hs.239720 CCR4-NOT transcription complex, subunit 1.6 4.6 

129621 AL110212 Hs.301005 purine-rich element binding protein B 1.1 5.7 
5 129634 AB020335 Hs.181300 sei-1 (suppressor of lin-12, C.elegans)- 0.9 4.3 

129663 AI207406 Hs.11866 translocase of inner mitochondrial membr 1.9 4.8 

129679 AW889132 Hs.11916 ribokinase 0.9 4.1 

129688 U53209 Hs.24937 transformer-2 alpha (htra-2 alpha) 1.3 4.7 

129691 M26939 Hs.1 19571 collagen, type 111, alpha 1 (Ehlers-Danl 4.7 3.7 

10 129712 U46386 Hs.12102 sorting nexin 3 1.2 3.6 

129747 AL050272 Hs.12305 DKFZP566B183 protein 1 8.9 

129788 BE397454 Hs.124969 Homo sapiens clone 24707 mRNA sequence 1.4 3.6 

129796 BE218319 Hs.5807 GTPaseRabH 2.9 5.1 

129797 M62839 Hs.1252 apolipoprotein H (beta-2-glycoprotein I) 0.3 5.1 
15 129800 AF052112 Hs.12540 lysosomal 1.6 8.8 

129834 AL080084 Hs.296155 CGI-100 protein 0.9 5.3 

129836 AW410233 Hs.206521 YME1 (S.cerevisiae)-like 1 1.8 9.9 

y. 129843 NM_014840 Hs.200598 KIAA0537 gene product 0.9 3.6 

U. 129874 AA626937 Hs.181551 hypothetical protein MGC2594 1.4 9.5 

==20 129878 Z43161 Hs.283714 30 kDa protein 1.1 6.3 

0 129904 AL1 19499 Hs.13285 neuronal potassium channel alpha subunit 1 3.5 

in 129917 M30773 Hs.278540 protein phosphatase 3 (formerly 2B), reg 2 5.1 

* ; 129976 X14008 Hs.234734 lysosomal 0.9 4.9 

fe 129982 Z14221 gb:H.sapiens germline transcript of Ig h 1.2 3.6 

f$5 130007 R15917 Hs.142570 Homo sapiens clone 24629 mRNA sequence 4.3 1.3 

v"i 130060 BE277024 Hs.146381 RNA binding motif protein, X chromosome 1.6 3.8 

j 1 130064 X57815.comp Empirically selected from AFFX single pr 1.2 8.2 

Q 130068 M93143 Hs.262869 plasminogen-like 1.4 7.9 

130090 H97878 Hs.132390 zincfingerprotein36(KOX18) 1.4 12.3 

130 130095 AK001635 Hs.14838 hypothetical protein FLJ10773 0.2 4.6 

W 130102 W61005 Hs.14896 DHHC1 protein 1 4.1 

M ! 130112 AA916785 Hs.180610 splicing factor proline/glutamine rich ( 1.2 5.3 

hi 130115 T47294 Hs.1 49923 X-box binding protein 1 3.8 0.8 

130123 NM_005095 Hs.150390 zinc finger protein 262 1 4.2 

<35 130150 BE094848 Hs.15113 homogentisate 1 ,2-dioxygenase (homogenti 0.5 4 

p 130161 R42678 Hs.151385 KIM0564 protein 1 3.7 

jTi 130210 M23115 Hs.1526 ATPase, Ca++ transporting, cardiac muscl 0.4 4.4 

« r *' ; 130213 BE278370 Hs.15265 heterogeneous nuclear ribonucleoprotein 1.7 7.5 

130215 BE301883 Hs.152707 glioblastoma amplified sequence 1 5.6 

40 130232 U29463 gb:Human cytochrome b561 gen 1.2 4.2 

130252 U92014 Hs.153527 Homo sapiens pTM5 mariner-like transposo 1.3 3.6 

130281 W78907 Hs.15395 similar to arginyt-tRNA synthetase (argi 1.5 4.4 

130343 AB040914 Hs.278628 KIAA1481 protein 2.9 7.5 

130385 AW067800 Hs.155223 stanniocalcin 2 3.2 0.2 

45 130414 AW842182 Hs.241392 small inducible cytokine A5 (RANTES) 1.4 10.6 

130417 AW163518 Hs.155485 huntingtin interacting protein 2 1.7 11.7 

130440 AA852868 Hs. 1 32853 KIAA0171 gene product 1.1 5 

130442 NM_006245 Hs.1 18244 protein phosphatase 2, regulatory subuni 1.4 4.3 

130465 AW362955 Hs.15641 Homo sapiens cDNA FU14415 fis, clone HE 1.6 7.6 

50 130479 R44163 Hs.12457 hypothetical protein FLJ10814 0.9 4.1 

130499 AB007915 Hs.158286 KIAA0446 gene product 1 3.8 

130546 AI598022 Hs.193989 TAR DNA binding protein 1.3 4.7 

130568 AA232119 Hs.16085 putative G-protein coupled receptor 1.2 9.4 

130606 AI652143 Hs.288382 hypothetical protein FLJ131 11 1 4.1 

55 130612 BE242873 Hs.16677 WD repeat domain 15 1.1 3.6 

130616 AL049963 Hs.284205 up-regulated by BCG-CWS 0.6 3.8 

130623 AL045128 Hs.1691 glucan(1,4-alpha-), branching enzyme 1 0.9 6.6 

130629 AL042896 Hs.1 697 ATPase, H+ transporting, lysosomal (vacu 0.9 3.9 

130632 AW073971 Hs.238954 ESTs, Weakly similar to KIM1204 protein 0.9 6.9 

60 130639 AI557212 Hs.1 71 32 ESTs, Moderately similar to I54374 gene 2.6 3.9 

130641 AF158555 Hs.239189 glutaminase 1.2 13.8 

130653 AI861791 Hs.278479 TSPY-like 1.3 4 

130655 AI831962 Hs.17409 cysteine-rich protein 1 (intestinal) 2.5 4 

130666 AL117508 Hs.194035 KIAA0737 gene product 1.3 6.2 

65 130669 AI928985 Hs.17680 hypothetical protein MGC1314 similar to 1.4 3.9 

130693 R68537 Hs.17962 ESTs 3.2 0.8 

130694 NM_014827 Hs.17969 KIAA0663 gene product 1.1 4.8 
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Hs.8128 phosphatidylserine decarboxylase 


1.1 


5.9 




mm nnii^n 


8116 pnrlnfhplial PAS domain nrotein 1 


0.5 


4.8 


114310 


ai n?7finn 


H9 8148 splpnnnmtein T 


1.7 


7.9 




UJUUOO 


Hs. 82028 transforming growth factor, beta recepto 


0.8 


7.6 


134364 




Hs. 82226 glycoprotein (transmembrane) nmb 


2.2 


3.6 


134374 


N22687 


Hs 8236 ESTs 


1.9 


3.6 


114178 


AL03578R 


Hs.82425 actin related protein 2/3 complex, subun 


1.5 


8.3 


1141R? 

IO*rOO£ 


DCC'IOQCC 
DC J l£OJO 


Hq IflPflU SH? domain hindinn Glutamic acid-rich Dr 

1 iO- I ijJwvJ I o i i%j uvi t lull t uu iuii iy y luicu i \ wviu i ivi i y*i 


1.1 


3.6 


134415 


A1750762 


Hs.8291 1 protein tyrosine phosphatase type IVA, m 


1.9 


4.6 


114417 


MM 006416 

l"IIVI__UVO*T IU 


H<! 89921 snlutp carrier familv 35 (CMP-sialic aci 


1.2 


7.5 


114491 


AU077196 

AUu III JU 


Hs 82985 collaaen tvrje V atoha 2 


6.6 


8.7 


11441Q 




Hq nftflfiO Rhn GTPa«.p activatina orotein 1 


2 


3.9 


134454 


NM 013230 


Hs.286124 CD24 antigen (small cell lung carcinoma 


3.5 


1.1 


134494 


D86981 


Hs.84084 amyloid beta precursor protein (cytoplas 


1.5 


4.4 


134501 


W84869 


Hq 91 l^fift pnkarvntif* tranQlatinn initiation factor 


1.2 


5.7 


134505 


AW960673 


Hs. 177530 ATP synthase, H+ transporting, mitochond 


1.3 


3.9 


134520 


BE091005 


Hs. 74861 activated RNA polymerase II transcriptio 


1.8 


4.3 


134528 


M23161 


Hs.84775 Human transposon-like element mRNA 


0.8 


5.6 


134545 


AI902899 


Hs.85155 butyrate response factor 1 (EGF-response 


1.4 


5 


134553 


AI203545 


Hq 9Qfi1fiQ K-nhaQp rpQnonop (rvrlin-related^ 


0.8 


3.9 


134573 


NM 016142 


Hq 97Q.R17 Qfprnid dphvrirnnpnaQp hnmoloa 
note/ I / oiciuiu uwijumycnaoc nuiiiuiwy 


1.3 


5.7 


134576 


AB033017 


Hq KIAA1191 nrotein 


0.9 


3.7 


134577 


BE244323 


We A£Q^ ovnnrtin tRNIA /niirlpar pynnrt rprpntnr 


4 


6.8 


134579 


AW936928 


Hq ftSQfi^ DKF7P5G4M182 nrotein 


2.2 


4.3 


134582 


AA927177 


Hq ftfiflAI Pftft trinlpf rpnpaf hindinn nmtpin 1 


1.6 


3.6 


134600 


AF078859 


He ftfi^A7 hvnnthpfiral nrnfpin 


2.1 


3.5 


134655 AF265208 


Hq 19^nQn QWI/QWF rpfafpd matriY aQQnriatpd arti 


1.7 


4.2 


134700 


AK000606 


Hq Rftfift nnlni 9NAP rpnpntnr mmnlpx mpmhpr 1 


4.4 


0.9 


134737 


D17530 


Hq RQd*W drphrin 1 


3.1 


1.6 


134762 T51986 


Hq 9ft*3inft hpmfvilnhin namma R 


0.5 


4.6 


134843 


AA428520 


Hq Qflflfil nrnnPQfpmnp hindinn nmtpin 


1.3 


3.7 


134854 


J03464 


He rvillanen h/np E alnha 9 


8.7 


17.3 


134865 


AA587775 


Uc RROQ^ miilfl PrV7_r1nmain_r'rintaininn nm+pin 
nb.QO^?3 IIIUIU r L/&.-UUM IcIIH-WJI llall III Iy UJUlGll 1 


1.7 


4 


134868 


AB020689 


He QfW1Q k'lAADPft? nrnfpin 


3.4 


0.9 


134874 


AI803761 


He QfW\ft carina nalmi+Avy[+rancfpracA lonn /*hain 
nS.yu*+v>0 ocllllc pdlllllLUyiUctllolCIaoC, lUlly Ollalll 


1.3 


6.9 


134885 


AJ002030 


He Q071 nrnfiocfomno momhranp hindinn nrnfpin 
rio.\7U/ 1 [jiuycoLciuiic iiiciiiuiaiic uinuiuy uiuiciii 


1.4 


9.6 


134891 


R51083 


Hq Q07R7 F^Tq 


\ 


10.1 


134908 


BE089782 


He dH.77 hunnthpfiral nrnfpin 
MowOf / IIJf(JUUIcUwal (JIULCIII 


1.9 


3.9 


134934 


AF005043 


We QlVin nnlv iAnP.rihnQp\ nlumhvdrnlaop 


1 


4.3 


134970 


BE560779 


Hq 9R42^^ NIHF-S nrotein 


1.4 


10.4 


134982 


AK002085 


He Q9**nft Hnmn eanipnc rHWA Fl t1199^fiQ rlnnp PI 


1.6 


4.1 


135011 


AB037835 


Hs.92991 KIAA1414 protein 


12 


5.6 


135032 


AW301984 


Hs.173685 hypothetical protein FLJ12619 


1.7 


7.6 


135035 


AL034344 


Hs.284186 forkhead box C1 


3.2 


0.6 


135051 


AI272141 


Hs.83484 SRY (sex determining region Y)-box 4 


4.2 


4.1 


135060 


AK001887 


Hs.259842 protein kinase, AMP-activated, gamma 2 n 


1.3 


4.8 


135062 AK000967 


Hs.93872 KIAA1682 protein 


2 


3.7 


135077 


AW503733 


Hs.9414 KIAA1488 protein 


2.8 


3.7 


135082 


AB017363 


Hs.94234 frizzled (Drosophiia) homolog 1 


2.4 


4.8 


135107 


T97257 


Hs.337531 ESTs, Moderately similar to I38022 hypot 


1.4 


5.8 


135143 


AA132813 


Hs.69559 KIAA1096 protein 


1.8 


8.5 


135156 


BE563088 


Hs.9552 binder of Art Two 


1.2 


6.8 


135172 


AB028956 


Hs.12144 KIAA1033 protein 


3.1 


1.4 


135181 


BE250865 


Hs.279529 px19-like protein 


1.3 


7.5 


135222 


AA534009 


Hs.183487 interferon stimulated gene (20kD) 


1.3 


3.8 


135232 


AL038812 


Hs.96800 ESTs, Moderately similar to ALU7_HUMAN A 2.1 


3.9 


135289 


AW372569 


Hs.9788 hypothetical protein MGC10924 similar to 


0.9 


8.4 


135290 


AA331901 


Hs.184736 hypothetical protein FLJ10097 


1 


3.8 


135291 


T83882 


Hs.97927 ESTs 


1.2 


3.5 


135349 


AA1 14212 


Hs.9930 serine (or cysteine) proteinase inhibito 


2.6 


8.9 


135357 


AI565004 


Hs.79572 cathepsin D (lysosomal aspartyl protease 


2.5 


5.4 


135398 


M16029 


Hs.287270 ret proto-oncogene (multiple endocrine n 


0.4 


7.9 


135399 


W79431 


Hs.326249 ribosomal protein L22 


1.5 


4.5 


135400 


X78592 


Hs.99915 androgen receptor (dihydrotestosterone r 


3.2 


1.8 
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302665 


R99693 


Hs.224410 Homo sapiens cDNA FU 12843 fis, clone NT 3.6 


3.6 


302892 


AW1 76909 


Hs.42346 


calcineurin-binding protein calsarcin-1 


3.3 


1.6 


302963 


AW673106 


Hs.151945 mitochondrial ribosomal protein L43 


0.9 


4.2 


303131 


AW081061 


Hs.103180 DC2 protein 


3 


17.3 


303150 


AA887146 


Hs.8217 


stromal antigen 2 


6.2 


4 


310125 


M147979 


Hs.285005 mitochondrial import receptor Tom22 


1.2 


6.6 


312662 


AA233808 


Hs.286241 protein kinase, cAMP-dependent, regulato 


1 


3.5 


319429 


BE616412 


Hs.286218 junctional adhesion molecule 1 


1.5 


4.7 


320591 


AA054761 


Hs.169149 karyopherin alpha 1 (importin alpha 5) 


1.2 


5.6 


406779 


AA412048 


Hs.279574 CGI-39 protein; cell death-regulatory pr 


1.3 


3.5 


410691 


AW239226 


Hs.65450 


reticulon 4 


1.2 


13.9 


410763 


AF279145 


Hs.8966 


hypothetical protein FLJ21776 


2 


5.1 


415738 


BE539367 


Hs.295953 ESTs, Weakly similar to AF2200491 uncha 


1.3 


3.9 


420186 NM_015925 


Hs.95697 


liver-specific bHLH-Zip transcription fa 


1.5 


6.2 


422055 


NM 014320 


Hs.1 11029 putative heme-binding protein 


2 


11.3 


425815 


R94023 


Hs.337531 


ESTs, Moderately similar to I38022 hypot 


1.7 


3.6 


426218 


AF1 19043 


Hs.168005 Homo sapiens cDNA FU13372 fis, clone PL 


3.3 


2.8 


427397 


AI929685 


Hs.177656 calmodulin 1 (phosphorylase kinase, delt 


1.3 


4.7 


427466 


AA523543 


Hs.7678 


cellular rettnoic acid-binding protein 1 


1.1 


3.7 


427505 


AA361562 


Hs.178761 26S proteasome-associated pad1 homolog 


3.2 


2.5 


427723 


AI355260 


Hs.279789 histone deacetylase 3 


2.8 


22 


428673 


AW601325 


Hs.324278 Homo sapiens mRNA; cDNA DKFZp566M063 (fr 


1.1 


430219 


X99209 


Hs.235887 HMT1 (hnRNP methyltransferase, S. cerevi 


1.8 


8.8 


430450 


R23553 


Hs.241489 hypothetical protein 


1.1 




432866 


BE395875 


Hs.279609 mitochondrial carrier homolog 2 


1.5 


6.1 


433423 


BE407127 


Hs.8997 


heat shock 70kD protein 1A 


1.3 


7 R 


437562 


AB001636 


Hs.5683 


DEAD/H (Asp-Glu-Ala-Asp/His) box polypep 


1.6 


6.5 


437667 


BE616412 


Hs.286218 junctional adhesion molecule 1 


1.3 


0.0 


437754 


R60366 


Hs.5822 


Homo sapiens cDNA: FLJ22120fis, clone H 


2 


<5 7 

J./ 


440252 


BE513940 


Hs.6101 


hypothetical protein MGC3178 


1.1 


ft O 


441471 


AL042986 


Hs.7857 


erythrocyte membrane protein band 


0.5 


"? 7 


448292 


BE281316 


Hs.47334 


hypothetical protein FLJ14495 


2.5 


4.9 


449404 


H51066 


Hs.23581 


leptin receptor gene-related protein 


1.1 


3.6 


449964 AW001741 


Hs.273193 hypothetical protein FU10706 


1.4 


0.0 


451389 


N73222 


Hs.279009 matrix Gla protein 


4 


11 9 


452685 


AI634651 


Hs.30250 


v-maf muscubaponeurotic fibrosarcoma (a 


0.8 






RC_H15847 s 




peptidylprolyl isomerase B (cyciophilin B) 


1.8 


4ft 




RC W84712 




calumenin 


3.5 


*r.O 




X14008_rna1J 


lysozyme (renal amyloidosis) 


0.9 






RC H86543 f 




ESTs 


1.8 


ft fi 




H07011 




ESTs; Weakly similar to SAS [H.sapiens] 


1.8 


O.v? 




RC_AA164586_s 




ESTs 


ft 0 
D.Z 




RC AA070485 




Homo sapiens clone 23967 


3.4 






RC_H98714_s 




ESTs 


1.6 


0.0 




RC_M406145_f 




ESTs 






AA458584 




SRY (sex determining region Y)-box 4 


3.4 


n a 




AA031548 




cell division cycle 42 (GTP-binding protein; 25kD) 


^ 1 

0. I 




X02761 




fibronectin 1 


3.6 


1 U.£ 




RC_AA487193 


secreted frizzled-related protein 4 


4.7 


A 

*T 




R25326 




Homo sapiens mRNA for putative vacuolar 


0.9 


c 
0 




RC AA393805 


ESTs; Weakly similar to (defline not 


1.1 


8.4 




RC_AA449333 


ESTs 


2.9 


4.6 




RC_AA287681 


_s 




ESTs 


1.3 




RCJ\A490864 


ESTs; Highly similar to heat shock factor 


1.4 


5 




RC C14243 f 




ESTs; Highly similar to heat shock factor 


1.7 


5 




R21443 




ESTs 


1.6 


3.7 




RC_M251902 


Homo sapiens lysophospholipase (LPL1) 


2.2 


3.8 




M21121 s 




small inducible cytokine A5 (RANTES) 


0.9 


9.9 




C00038 s 




ESTs 


2.8 


4.8 




Y00503 




keratin 19 


3.1 


1.1 




RC_R27006 f 




ESTs 


1.6 


3.7 




RC M416886 


ESTs; Weakly similar to predicted using 


3.1 


3.1 




RC_M460450 


fibroblast growth factor receptor 2 (bacteria- 


1.5 


3.7 




RC M488433 


ESTs; Weakly similar to deduced amino acid 


1.1 


4 




RC AA278400 f 




Homo sapiens HR 




U28831 




Human protein immune-reactive with anti-PTH4.4 


0.6 




RC_AA1 99588 


Homo sapiens actin-related protein Arp3 (ARP3) 


1.8 



5.2 



0.8 



3.9 



1.5 



3.6 



4.7 
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AF006082 Homo sapiens actin-related protein Arp2 (ARP2) 1.6 10.9 

RC_H90899 desmoplakin (DPI; DPII) 5.4 5.5 

RC_W95070 desmoplakin (DPI; DPII) 5 2.6 

RC_T90946_f Human mRNA for KIAA263 gene; complete cds 1.1 3.9 

D87258 protease; serine; 11 (IGF binding) 2.4 3.5 

AA313414_s ESTs; Weakly similar to cDNA EST EMBLT1 157 1.5 5.3 

RC_H73484_s ESTs; Weakly similar to similar to Yeast 1.3 6.3 

AFFX-HUMISGF3A/M97935 3 2.3 13.5 

AFFX-HUMRGE/M10098_5 1.1 7.9 

AFFX-M27830_5 0.5 7.4 

AFFX-M27830_5 0.6 5.4 

RC_M063431_f ESTs 0.8 4.1 

RC_T63769_f ferritin; light polypeptide 1.1 3.7 
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TABLE 8A 



Table 8 A shows the accession numbers for those pkeys lacking unigenelD's for Table 8. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number. Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 

108469 116761J AA079487AA128547 AA128291 M079587 AA079600 
125076 190299.1 AA973971 T88817 AA253263 

114636 109698J AA075488 M129081 AA074851 M082852 M074732 AA084908 M084751 M076042M131172 AA085374 AA079519 
AA074510 AA113824 AA102437 AA070833 AA070143 AA084693 AA084389 AA076373 AA075492 M062834 AA084335 
AA078829 AA079344 AA069916 AA079275 AA070914 AA654069 AA081976 AA080957 M083115 M070942 AA085296 

123526 genbank_M608657 M608657 

123533 genbank_AA608751 AA608751 

125090 genbank_T91518 T91518 

125154 genbank_W38419 W38419 

118475 genbank_N66845 N66845 

118505 genbank_N67343 N67343 

101046 entrez_K01160K01160 

129982 221_267 Z14221 AW381862 M97920 AW401444Z66542 M29470 AW406502X61011 M34024AA327072Z14166Z14167Z14165 

AW403806 Z14200 AA383972 Z14205 Z14201 M18513 Z14202 AW403684 X14584 AF062221 U43760 X65892 X65883 X62107 
Z80847 X65885 X65893 AF062142 X65891 X17675 Z47274 Z47277 Z47276 X65888 Z47275 X62109 AF062140 L01278 
AF062134 AF062139 X81723 Z80840 X81733 X81743 X81744 X81732 Z80843 AW402942 AW403516 X65919 AF062190 
AF062177 AF062222 AF062115 Z47240 AF062263 AF062261 AF062223 AF062211 Z47238 AW401714 AW404008 AW404991 
AF062280 M99595 Z47214 Z47232 Z47218 M26995 AF062184 X65895 L38433 X81731 Z1 1946 Z47226 AF062205 AF174012 
L01276 AF062168 AF062136 X81755 X81748 AF174019 AF062285 L06924 L1 1699 AW402665 L09070 L28049 L08090 
AW407843 Z14171 AW402944 L08083 AW405627 L33035 L26907 M17750 L28052 M17751 AJ239360 U19885 L14821 X56526 
AB014341 L12087 L12098 U68231 L12184 AF062242 AF022000 U64499 U00570 AI268604 Y15773 X64239 X62969 U00506 
X73605 X99360 U00577 D83677 AB021539 AF035796 Z33899 U00588 AJ239353 AF174062 Z33901 X98899 AF174058 X63080 
D83676 AB021529 Z18318 U00488 L01412 X81746 U21262 U21272 U00560 AF174060 U00547 U00561 Z18321 S73957 
M26435 AF115130 AF115117 Z92896 U21254 Z92895 AF115112 AF062290 T28938 Y09386 AF174067 U27189 Y09384 
U77373 AF174057 M17749 X69692 AF174038 U64478 U64486 AF174098 X99365 AF174077 AF174083 AF174089 AF015133 
AF127792AJ006171 U00510 U58144AF004323AF115109 AF004324 AF015130AF090414AF090418 AB021536 AB021519 
Z96957 AF021986 Z92898 Z96956 L34164 AF062251 AF052524 U00549 U00541 D83687 AF052522 AF087420 AF052520 
AF022011 L43085 U00536 L19915 L43083 AF090420 AB021530 AB021534 Z96955Z96954 Z33903 X62964AF1 03282 U00509 
AF062298 AF062289 AF062307 AW408326 L33037 L04337 L04328 X81742 L04336 L03385 AF032360 S56184 AF062191 
AF191092 AI906954 AF103184 L04343 AW364860 AF001424 AF103163 Z98717 AF103143 AJ008250 L04323 AF103321 
AF103335 L04342 L03818 L03817 AW404978 AW403690 U86801 AF103150 AJ010334 AF035027 AJ007327 AF1031 15 
AF017458 AJ008207 AJ008183 AJ008196 AJ008241 AJ008208 AF103210 AF068668 AF068670 AF068671 AF068664 
AF068669 AF068666 AF068665 AF068672 AW371244 AW403670 AW408074 AW404575 AW362153 AW403803 AW406702 
AW351514 D78345 T29140 J00231 NM_002179 AW405146 AA301091 X04646 H64660 AW402990 AW406534 T93007 
AI857980 AW368899 AI905833 AW406586 AA482084 AI872299 AA715266 AW404328 AI831674 AI709348 AA6031 12 
AW514864 AA485775 H64492 AW404789 M487630 AA715498 M295885 T27613 T98113 

108470 genbank_AA079500 AA079500 
101447 entrez_M21305 M21305 
124447 genbank_N48000 N48000 
101624 entrez_M55998 M55998 

131791 221_260 X62111 S67984 AJ131056Z47243 Z47235 AF062268 Z47237 AJ131058 AF062130 AF062282 X62108 AA385989 AA464794 
X69861 AW402964 M90808 Z98735 Z98734 Z98736 AF035018 X79161 U00545 AF174046 AF174071 U00552 U96288 
AF021989 AF062255 AF174061 S73953 AF062135 AF062155 X64147 U00555 AF174070 AF129754 U86789 Z98714 Z98738 
S75886 AJ008175 AJ008186 AJ008188 Y08403 H15753 H22208 AA327241 AW405737 H42300 H39677 H26239 H26903 
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H45128 R86072 AA327565 AI660584 AW361537 AW383759 S71043 H15014 H45570 H42819 H45523 H45134 R72043 
H24543 H27636 H27610 T28147 H25496 AW364071 AW364072 H45561 H45556 H42605 AI735017 T47421 R48719 H27570 
H44599 AI459598 H42347 H41938 H24993 AA345888 H22339 AI538691 AJ01 2264 AA664201 AI880450 AA327310 AI991250 
AI833028 AW001210 A1956075 H30467 AA326915 H41943 AI749266 AI744441 AA327377 AW512326 AI735170 H01634 
AI587047 AI571623 AA327486 M327103 AA327195 AA326973 T28143 

124842 217726J R56485 R37248 R59992 

103758 AA084874_f_at AA084874J 

130064 221_264 X57815 L29157 AA367448 AW328098 AW404536 L29156 L29163 H27683 R83195 AA295096 AW327822 H25458 AW404692 
X57819 X57823 AW405604 AW404447 Z34914 AW406542 M427726 AA604389 AW405606 AW40591 8 AW4051 17 
AW407182 L03632 AW405058 L03627 AW407470 R72738 L21959 AW375738 X87888 AF124166 AF054638 AJ241406 
U09902 AW405284 M12025 AA360219 L03631 L03629 AW327727 AF194810 AF058076 AF194686 AF063774 AF063737 
AF063755 AF063740 AF063739 AF063738 S73129 L43088 AF063775 L33026 AF060138 AF194813 AF194812 AF194811 
AF194809 AF194814 AF194808 AF064505 AF064503 AF063765 AF063757 AF063752 AF063716 AF063736 AF194806 
AF058077 AF063747 AF063772 AF063781 AF060137 AF194805 AF060134 AF060132 AF058074 AF063754 AF063704 
U38589 Z18332 AF060122 AF194807 AF060135 AF064506 AF064504 AF063773 L26540 AF058072 AF060131 AF064500 
AF0601 36 AF064513 AF194683 AF194711 AF058075 AF063717 Z19546 AF194581 X72746 U96393 U09890 X98897 
AF194592 M80916AJ241405AF194632AJ388659AF194625 Z74694AF194588AF194601 U09901 U09911 U09892 
AJ010336 AJ006162 AJ249377 AJ241414 AA327392 U97248 X72747 Z46850 X95739 AW406701 AJ243109 AF194609 
S80758 AF194595 AF194596 L22483 Z70262 D84141 AW405758 AF001788 AF194580 D84143 U76684 AF194593 L03630 
X87892 X91 134 U21249 Z46346 AJ132426 AF103659 AJ23371 8 AF021038 AJ233727 H24657 U09882 S75627 AA573599 
AF047231 AF047232 AF103616 AW404484 Z46848 S76132 AF103663 AF103713 Y17940 AF047216 AF103595 Y17956 
AF052799 AF052797 AF052802 AF052798 AF052801 AF052794 AF052796 AF047218 AF052800 AF047217 AF052795 
AF032351 AF103701 AF103708 AF103710 AF103706 S62232 AF103645 AF103632 AF103647 AF103644 AF103640 
AF103692 AF093581 AF103620 AW405934 AI445389 AW383753 AA360256 AF099676 H21654 H39501 AI820828 H53689 
W26785 AW384496 AW407708 AA541663 M91 1602 AI821461 AA588300 AA327050 H42717 AI951280 AA421322 AI923193 
AA864302 H25133 D87023 J03011 M61771 D87017 AA526865 AA253450 

130232 18831.2 U29463 U06715 W24970 AA584303 T39581 M155603AA305043 AA429426 W05664AA1 02382 AA482044 W24487AA319060 

T88946 F10106 AA232161 AA2431 17 AA1 58937 AA1 00864 

109097 genbank_AA167512 M167512 
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TABLE 9: Figure 9 from BRCA 001-2 US 

Table 9 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey: U nique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unfcjene number 

Unfcjene Title: Unigene gene title 



Pkey ExAccn UnigenelD UnigeneTitle 

100690 AA383256 Hs.1657 estrogen receptor 1 

102211 BE314524 Hs.78776 putative transmembrane protein 

103587 BE270266 Hs.82128 5T4 oncofetal trophoblast glycoprotein 

104115 AF183810 Hs.26102 opposite strand to trichorhinophalangeal syndrome I 

105038 AW503733 Hs.9414 KIM1488 protein 

105500 AW602166 Hs.222399 CEGP1 protein 

105990 AI690586 Hs.29403 hypothetical protein FLJ22060 

106155 AA425414 Hs.33287 nuclear factor l/B 

106373 AW503807 Hs.21907 histone acetyltransferase 

106414 BE568205 Hs.28827 mitogen-activated protein kinase kinase kinase 2 

110009 BE075297 Hs.6614 ESTs, Weakly similar to A43932 mucin 2 precursor, intestinal 

111900 AF131784 Hs.25318 Homo sapiens clone 25194 mRNA sequence 

114540 AI904232 Hs.75323 prohibitin 

116470 AI272141 Hs.83484 SRY (sex determining region Y)-box 4 

117280 M18217 Hs.172129 Homo sapiens cDNA: FU21409 fis, clone COL03924 

119771 AI905687 Hs.2533 EST 

121723 AA243499 Hs.104800 hypothetical protein FU1 01 34 

124059 BE387335 Hs.283713 ESTs, Weakly similar to S64054 hypothetical protein YGL050w 

131148 AW953575 Hs.303125 p53-induced protein PIGPC1 

132371 AA235448 Hs.46677 PRO2000 protein 

134169 AI690916 Hs.178137 transducer of ERBB2, 1 

302235 AL049987 Hs.166361 Homo sapiens mRNA; cDNA DKFZp564F1 12 

452410 AL133619 Hs.29383 Homo sapiens mRNA; cDNA DKFZp434 
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TABLE 10: Figure 10 from BRCA 001-3 PCT 



Table 10 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnkjenelD: Unfcjene number 

Unkjene Title: Unkjene gene title 

R1 : Ratio of tumor to normal body tissue 

R2: Ratio of 90 1 " percentile tumor to body 

R3: Ratio of 75 1 " percentile body to tumor 

R4: Ratio of tumor to normal breast tissue 



Pkey 


ExAccn 


UnigenelD 


Unigene Title 


R1 


R2 


R3 


R4 


100082 


AA1 30080 


Hs.4295 


proteasome (prosome, macropain) 26S subu 


4.2 


152 


36 


12.2 


100103 


M380887 


Hs.5085 


dolichyl-phosphate mannosyltransferase p 


9.8 


123 


13 


5 


100131 


D12485 


Hs.11951 


ectonucleotidepyrophosphatase/phosphodi 


13.2 


244 


19 


9.9 


100147 


D13666 


Hs.136348 


osteoblast specific factor 2 (fasciclin 


15.7 


1030 


66 


5 


100154 


H60720 


Hs.81892 


KIAA0101 gene product 


4.1 


320 


78 


10.6 


100157 


D14661 


Hs.119 


Wilms' tumour 1 -associating protein 


4.7 


119 


26 


3 


100169 


AL037228 


Hs.82043 


D123 gene product 


5.1 


106 


21 


9.2 


100203 


BE242284 


Hs.172199 


adenylate cyclase 7 


4.7 


47 


1 


4.3 


100210 


D26361 


Hs.3104 


KIAA0042 gene product 


4.7 


47 


4 


0.7 


100219 


AW972300 Hs.118110 


bone marrow stromal cell antigen 2 


3.8 


350 


93 


1.9 


100234 


D29677 


Hs.3085 


KIAA0054 gene product; Helicase 


4.1 


64 


16 


3 


100248 


NM 015156HS.78398 


KIAA0071 protein 


3.4 


77 


23 


5.9 


100252 


NM_006207Hs.170040 


platelet-derived growth factor receptor- 


4.5 


45 


4 


4 


100260 


D38491 


Hs.322478 


KIAA01 17 protein 


5.9 


59 


1 


2.6 


100279 


D42084 


Hs.82007 


KIM0094 protein 


3.5 


96 


28 


1.3 


100286 


BE247550 


Hs.86859 


growth factor receptor-bound protein 7 


3.1 


306 


98 


1.5 


100294 


AA331881 


Hs.75454 


peroxiredoxin 3 


12.8 


128 


1 


11.7 


100335 


AW247529 Hs.6793 


platelet-activating factor acetylhydrola 


4.2 


187 


44 


5.4 


100365 


AI878927 


Hs.79284 


mesoderm specific transcript (mouse) horn 


4.5 


129 


29 


3.1 


100375 


D80004 


Hs.75909 


KIAA0182 protein 


3.5 


78 


23 


4.8 


100409 


D86957 


Hs.80712 


KIM0202 protein 


10.2 


102 


1 


4.8 


100410 


D86961 


Hs.79299 


lipoma HMGIC fusion partner-like 2 


4 


40 


1 


3.8 


100414 


NM_014735Hs.82292 


KIM0215gene product 


3.2 


32 


2 


2.9 


100418 


D86978 


Hs.84790 


KIAA0225 protein 


3.6 


36 


7 


3.2 


100438 


AA013051 


Hs.91417 


topoisomerase (DNA) II binding protein 


5.6 


76 


14 


2 


100439 


AA347720 


Hs.1 22669 


KIM0264 protein 


3.5 


35 


9 


3.1 


100448 


AF234887 


Hs.57652 


cadherin, EGF LAG seven-pass G-type rece 


5.5 


145 


27 


2.2 


100449 


D87470 


Hs.75400 


KIAA0280 protein 


3.4 


34 


1 


1.2 


100522 


X51501 


Hs.99949 


prolactin-induced protein 


22.7 


760 


34 


1.4 


100552 


AA019521 


Hs.301946 


lysosomal 


14.4 


144 


9 


4.7 


100643 


NM_005032Hs.4114 


plastin 3 (T isoform) 


4.1 


259 


63 


1.9 


100661 


BE623001 


Hs.132748 


Homo sapiens ribosomal protein L39 mRNA, 


3.3 


116 


36 


2.2 


100666 


L05424 


Hs.169610 


CD44 antigen (homing function and Indian 


8.5 


85 


1 


3.2 


100667 


L05424 


Hs.169610 


CD44 antigen (homing function and Indian 


3 


594 


201 


2.3 


100745 


BE207168 


Hs.144630 


nuclear receptor subfamily 2, group F, m 


5 


82 


17 


0.9 


100774 


J05581 


Hs.89603 


mucin 1, transmembrane 


3.5 


37 


11 


2.8 


100783 


AF078847 


Hs.191356 


general transcription factor IIH, polype 


9.7 


97 


10 


7.2 


100821 


M26460 




gb:Homo sapiens (clone 104) retinoblasto 


3.3 


33 


1 


0.8 


100864 


BE563957 


Hs.74861 


activated RNA polymerase II transcriptio 


3.7 


477 


130 


3.1 


100877 


X80821 


Hs.27973 


KIAA0874 protein 


6.3 


63 


4 


5.7 


100892 


BE245294 


Hs.1 80789 


S164 protein 


4.7 


47 


1 


4.2 


101038 


BE297139 


Hs.79411 


replication protein A2 (32kD) 


3.8 


115 


30 


7.1 


101046 


K01160 




NM_002122:Homo sapiens major histocompat 


3.9 


390 


100 


11.1 


101079 


BE264901 


Hs.250502 


carbonic anhydrase VIII 


3.9 


39 


8 


3.6 
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101084 


AW409934 Hs.75528 


nucleolar GTPase 


4.1 


53 


13 


4 




101104 


AW862258 Hs. 169266 


neuropeptide Y receptor Y1 


15.3 


153 


1 


14.1 




101185 


NM_001621Hs.17OO87 


aryl hydrocarbon receptor 


11.3 


113 


8 


3.9 




101188 


L20320 Hs 184298 


cyclin-dependent kinase 7 (homolog of Xe 


3.1 


118 


38 


2 


5 


101201 


L22524 Hs.2256 


matrix metalloproteinase 7 (MMP7; uterin 


8.2 


396 


48 


0.9 




101232 


AU077288 Hs.242894 


ADP-ribosylation factor-like 1 


4 


110 


28 


10.7 




101275 


BE545777 Hs 3273 

ULvlvLl # 1 IO.O£.f O 


Ts translation plonnatinn factor mitoch 


4.2 


50 


12 ■ 


4.4 




101300 


BE535511 Hs.74137 


transmernbrane trafficking protein 


6.6 


135 


21 


13.1 


10 


101396 


BF267931 Hs 78996 


proliferating cell nuclear antigen 


6.4 


249 


39 


22.4 


101447 




nh'Hnman alnha 93tpllitp and ^atpllitp ^ 

UU,| lull IQI 1 CallJI Id •SQldllLv ul IU SCILdllLC \J 


6.5 


878 


135 


0.8 




101448 


IMIvl V/w V » t. 1 1 IO. 1 C7v/Uv/v 


kpratin 5 fpnirfprnnolvsis bullosa simnlpx 


4.8 


622 


130 


0.7 




101470 


NM 000"54fiHs 1846 

I11VI VUvOtUI IO. IUtU 


tumor nrntpin n5^ (\ i-Fraumpni svndromp^ 


5.1 


97 


19 


9.3 




101478 


NM 002890HS.758 


RAS p21 protein activator (GTPase activa 


9.6 


96 


1 


8.5 


15 

JL <J 


101484 


AA051486 Hs 20315 


infprfpron -induced nrotein with tetratri 


11.2 


112 


8 


5.9 


101*507 


X16R96 Hsfi511? 


intprlpnkin 1 rpopntnr tvnp I 


3.9 


39 


2 


3.5 




101621 


BE391804 Hsfi5R61 
oi^oo lout ns.u^uu i 


ni lanulafp hind inn nrntpin 1 infprfpron- 

U,UGM tjMCHC UlllUlliy piULCIII l| IlllClldWII 


3.6 


36 


1 


2.6 




101624 


IVU JOOU 


nh'Hnman alnha-1 collanpn tvnp 1 Gpnp 3 

mU-i luuiaii alalia i VyVJiiciy 01 1 i y^i %/ 


3.1 


2898 


923 


2.2 


I - 


101664 


AA436989 Hs121017 


H9A histonp femilv mpmhpr A 


6.9 


103 


15 


8.4 




101684 


M63256 Hs.75124 


cerebellar degeneration-related protein 


6.4 


64 


2 


4.9 




101724 


L11RQ0 Hsfi50 


hiitlons npmnhinnid antinpn 1 ^9^n/940kD^ 

UUINJUO vvll il/l IMVM allUMvll 1 l&UUS^TUrvL// 


9.4 


94 


1 


0.3 




101754 


S70114 Hs 939489 


TIA1 rvfotoxir nraniilp-a^onafpd RNA-hi 


8.9 


89 


5 


8 


71. 

y l 


1017R7 


Mfi1 057 He 1 fl0ftft4 

IVIO lUUf Hi. IOUOO*r 


parhnYunpntiHacp R1 /tiQ.cirp\ 
i^ai uuAyfjtJijLUJaDt; u i yuooucy 


o.u 


824 


227 


1.4 




101791 


Mft"3899 H«5 R9^54 

lVIUOU£.£ llO.U^OOt 


ppII diui<«inn rvrip 4-Iikp 

LfCil UIVIOIwll UjfUlO ~ IIFNC 


9 


144 


16 


13 


LiL; 


101794 


Mft4fi05 H*? Q57 
iviotouo no.jji 


nutatiwo rtniniri rprpntnr npiirnmpdin W ( 


3.3 


36 


11 


2.4 




1(118(11 
1 U 1 ouo 


A\A/0941Q0 He 155fiQ1 
f\VVUt40i7U no. lOOOiJ 1 


pic-D-vcll lcUr.cn lid Ual lolrllUUUH laOtUI 


5 4 


180 




15.9 


™% i 




Mftfift4Q H«; ?9^Ti3 


nan iunrtion nrofpin hpfa 9 9fikD foonn 

uou jui luiiui I fjiuidit, isszua t-\> r,u yvuiiii 


12 


120 


8 


9 




I U 1 OOo 


AA44RR44 H<* fiQ9 


f^A7T^_9 antinpn 1 pnifhplisl nlu/vinrrttpin 


3.1 


353 


116 


2.8 




miftftfl 


Al 04QR10 Hq Q5943 

r\LU*rao IU no.OOZrTO 


tranQr"rintinn plnnnatinn faH"rir A /QIIY. 

UCtl IOW IUUUII ClUIIUclUUI I IdLrLUl f \ \OM j 


7 3 


73 


-| 


5.3 




ioiqro 

IU I sou 


Al 0^fi9ft7 H«s 1Q4fifi9 


UCliJJUl llll O, aWUlU 


? ft 


"199 


105 


3.3 


Jao 




RF945149 hk 89643 


nrnfpin tvmQinp kinpe.P Q 

[JIUlClll lyivJOIIIC Ml IQOC o 


4.6 


148 


32 


11.3 






1111^1? H<s 75760 
ui itjio no. / 0 / uu 


cfoml parripr nrntpin 9 

OlCIUi IJCIillCI UIUICIH 


Q 5 
o.o 


95 


4 


8.8 




10919"* 

IU<C ItO 


MM OO1fi0QH<i 15Q4 
nivi_uu iouynb. ioo*r 


Lie I lUUIIiclC UIULCEI) r\ \ 1 / MJ) 


4 9 


49 


7 






I KiC t £0 


MM 00fi45fil-k 9ftft915 

INIVI_UUU*iOOno.tOOZ 13 


OfaiyiUdl lOlclaOc 


o.o 


93 


4 


3 




10911Q 


NM 00441QI-k 919ft 


UUal ojJcL-iiiwiy Ui luopi laiaoc o 


5 4 


1^7 

IO/ 


2fi 


2.5 


§5 


1091R9 


A A450974 He 1 5Q9 

rvvTOUZ/*? no. 


r*nr*1R (ro\\ Hiuicinn rvrlp 1R Q rprpui 
OUu IU yuCII UIVIOIUM uyvsic IU, O. UCICVI 


4.6 


151 

IO 1 


33 


2 




1091R5 


RC5119PO We 15G£97 
DCO loZOU MS. lOiJOZ/ 


death associated protein 3 


Q ^ 


yo 


c 
3 


0 

o 




\\J£. loO 


Al MfiM5 ^11 
r\i_uovooj ns.o 10 


OCUclCU UIIUOyilUUIULCIII 1 ^UalCUUVIIUIl, 


45 7 


457 
to/ 


1 


39.7 




1 0991 1 


SCIi 4594 He 7ft77fR 


putative transmembrane protein 


^ Q 


449 




1 1 


40 


109991 


MM 00R7RQH« 1ft44 


L.UVI UUIIIdlll uiuy H 


4 Q 


4Q 


i 
l 


1 fi 


109949 


1 I971A5 He A9547 


retinoic acid receptor responder (tazaro 


T 1 
0. I 


^1 
O 1 


1 
I 


1 -O 




10995ft 

I UZZ.OO 


MM 00154fiHe 14ft51 


inhlhitAr rii ONA hindinn 4 dominant ncn 
IIIIIIUILUi Ul U\\r\ Ulliuniy *t, UUMIIIIalll iicy 


^ ft 

O.O 


1UO 


43 


0.5 




1 09^09 


AA'?0f;^49 UefiQ171 


protein kinase C-like 2 


4 5 


45 


\ 


O.O 




1 09**04 
I UZOUn 


AP015994 Hc4 , R459 


INdlllllldy lUUIII I 


ft 5 

O.O 


905ft 




1.4 


45 


lUZo*tO 


I H751Q He ft751Q 

Vol 0 iy ns.oY ooy 


aldehyde dehydrogenase 3 family, member 


R 4 


49ft 


R7 
Or 




1 09^RQ 


1 I10A40 Uc 9QQftfi7 


hepatocyte nuclear factor 3, alpha 


fi 7 
0./ 


R7 


Q 


O.O 




1 09407 


A\A/R09154 He ft914^ 


c/*r-iiM? lacior £. [pis uomain uanscnpi 


5 

O.O 


5? 
oo 


1 
1 






1 0940Q 


RP100^<10 He 1 1ft795 


seienopnospnaie synineiase z 


0.0 


111 
I I I 


14 
0*t 


7 5 
/ .o 




109457 


MM OOno^He. 9Wi 


dual enpfifir-iiv nhr\cnhataco 4 
uudi opcuitiuiiy fjiiuopiiaLctoc h 


90 9 


909 


K 


1.3 


50 


IUZO*W 


n ivi_uu jy o / n s . i Oo i oy 


kynureninase (L-kynurenine hydrolase) 


"K ft 
0.0 


OO 




1 s 
I. J 


lUlOOf 


UojoJU nS.14Do4/ 


TRAF family member-associated NFK3 activ 


ft 9 


R9 
OZ 


1 


fi ft 
0.0 




1095ftO 


I IROftOfl He 159Qft1 


v-<ur-ti lacyigiyceroi syninase ^pnospnauua 


4 1 


41 
H I 


\ 
1 


O.O 




109R1A 
1 UZ.U 1 o 


Al 017R79 He ft1071 

ni.UO/DlZ no.O Ivi 1 


OYtrapolli ilar matriy nrntpin 1 


10 9 


R9ft 

\}£a3 


62 


17.2 




102638 


U67319 Hs.9216 


caspase 7, apoptosis-related cysteine pr 


5 


66 


13 


5.3 


55 


102663 


NM 002270HS.168075 


karyopherin (importin) beta 2 


6.1 


126 


21 


2.4 


102669 


U71207 Hs.29279 


eyes absent (Drosophila) homolog 2 


4.5 


45 


1 


2.8 




102742 


U79293 Hs.159264 


Human clone 23948 mRNA sequence 


4.1 


41 


1 


2.4 




102784 


U85658 Hs.61796 


transcription factor AP-2 gamma (acttvat 


4.4 


255 


58 


1.6 




102805 


U90304 Hs.25351 


iroquois homeobox protein 5 


3.6 


142 


39 


1.6 


60 


102813 


BE242035 Hs.151461 


embryonic ectoderm development 


3.5 


35 


1 


2.7 


102823 


D85390 Hs.5057 


caruoxypeptkJase D 


5.6 


56 


1 


5.3 




102825 


BE262386 Hs.7137 


clones 23667 and 23775 zinc finger prate 


4.2 


42 


7 


3.7 




102899 


AI815559 Hs.75730 


signal recognition particle receptor ('d 


3.2 


58 


18 


5 




102913 


NM 002275HS.80342 


keratin 15 


5.8 


753 


131 


0.4 


65 


102927 


BE512730 Hs.65114 


keratin 18 


3.1 


815 


266 


1.7 


102961 


AL1 19505 Hs.198166 


activating transcription factor 2 


3.2 


32 


4 


2.6 




102968 


AU076611 Hs.154672 


methylene tetrahydrofolate dehydrogenase 


5.7 


251 


44 


6.6 




103003 


AI910275 Hs.1406 


trefoil factor 1 (pS2) 


5.6 


1346 


239 


5.4 
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110915 BE092285 Hs.29724 hypothetical protein FLJ13187 20.9 

110930 BE242691 Hs.14947 ESTs 3.4 

110970 Y19062 Hs.96870 staufen (Drosophila, RNA-binding protein 3.5 

111084 H44186 Hs.15456 PDZ domain containing 1 4.3 

111125 N63823 Hs.269115 ESTs, Moderately similar to Z1 95_H UMAN Z 5.4 

111132 AB037807 Hs.83293 hypothetical protein 7.2 

111164 N46180 Hs.122489 Homo sapiens cDNA FU13289 fis, clone OV 7.7 

111179 AK000136 Hs.10760 asporin(LRR class 1} 25.1 

111184 AI815486 Hs.243901 Homo sapiens cDNA FU20738 fls, clone HE 3.9 

111190 AK002055 Hs.151046 hypothetical protein FU1 1193 6.3 

111221 AB037782 Hs.15119 KIAA1361 protein 3.7 

111223 AA852773 Hs.334838 KIAA1 866 protein 3.6 

111229 AW389845 Hs.110855 ESTs 4.3 

111234 AA902656 Hs.21943 NIF3(Ngg1 interacting factor 3, S.pombe 3.3 

111241 AA345644 Hs.288880 PAN2 protein 4.8 

111345 AW263155 Hs.14559 hypothetical protein FU10540 4.3 

111353 W20090 Hs.6616 ESTs 4.1 

111357 BE314949 Hs.87128 hypothetical protein FLJ23309 3.8 

1 1 1 378 AW1 60993 Hs.326292 hypothetical gene DKFZp434A1 114 4.3 

111389 AK000987 Hs.169111 oxidation resistance 1 3.4 

111540 U82670 Hs.9786 zinc finger protein 275 3.5 

111806 BE071382 Hs.279008 hypothetical protein FLJ20170 3.5 

111884 AW502285 Hs.127236 hypothetical protein FLJ12879 3.2 

111923 BE383234 Hs.25925 Homo sapiens, clone MGC: 15393, mRNA.com 6.2 

111929 AF027208 Hs.112360 prominin (mouse)-like 1 8.1 

111942 R40576 Hs.21590 hypothetical protein DKFZp564O0523 4.2 

111987 NM_015310Hs.6763 KIAA0942 protein 6.5 

112092 R44538 gb:yg29c02.s1 Soares infant brain 1 NIB H 3.3 

112134 R41823 Hs.7413 ESTs; calsyntenin-2 6.1 
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112198 AI432672 Hs.288539 hypothetical protein FLJ221 91 3.5 
112244 AB029000 Hs.70823 KIAA1 077 protein 5.7 
112253 R51818 gb:yg77h12.s1 Soares infant brain 1 NIB H 4 
112269 R53734 Hs.25978 ESTs, Weakly similar to 2109260A B cell 3.7 
112275 AW972635 Hs.301904 hypothetical protein FLJ 12671 4.3 
112280 AA863360 Hs.26040 ESTs, Weakly similar to fatty acid omega 2.8 
112305 AK000914 Hs.26244 hypothetical protein FLJ10052 3.5 
112483 AW969785 Hs.285885 Homo sapiens cDNAFLJ1 1321 fis, clone PL 4.2 
112513 R68425 Hs.13809 hypothetical protein FLJ10648 4.7 
112571 AA412205 Hs.140996 ESTs 4.8 
112971 Z42387 Hs.83883 transmembrane, prostate androgen induced 4.5 
113023 AL134324 Hs.7312 ESTs 3.2 
113047 AI571940 Hs.7549 ESTs 9.6 
113073 N39342 Hs.103042 microtubule-associated protein 1B 9.1 
113083 AA283057 Hs.266957 hypothetical protein FLJ14281 6.5 
113287 T66847 Hs.194040 ESTs, Weakly similar to I38022 hypotheti 3.5 
113296 AW449560 Hs.89576 inner mitochondrial membrane peptidase 2 3.5 
113523 AI791905 Hs.95549 hypothetical protein 7.6 
113604 AI075407 Hs.296083 ESTs, Moderately similar to I54374 gene 3.1 
113617 AI869372 Hs.17207 Homo sapiens CDNAFLJ1 1922 fis, clone HE 3.6 
113702 T97307 gb:ye53h05.s1 Soares fetal liver spleen 12.3 
113783 AL359588 Hs.7041 hypothetical protein DKFZp762B226 4.6 
113791 AI269096 Hs.135578 chitobiase, di-N-acetyl- 3.6 
113794 T62849 Hs.11090 membrane-spanning 4-domains, subfamily A 3.3 
113804 BE247683 Hs.14611 dual specificity phosphatase 1 1 (RNA/RNP 3.3 
113808 W44735 Hs.9286 Homo sapiens cDNA: FLJ21278 fis, clone C 5.1 
113847 NM_005032Hs.4114 plastjn 3 (T isoform) 3.2 
113849 AA457211 Hs.8858 bromodomain adjacent to zinc finger doma 4.3 
113867 AW002834 Hs.24095 ESTs 6.1 
113886 W76027 Hs.23920 hypothetical protein FLJ11105 4 
113923 AW953484 Hs.3849 hypothetical protein FLJ22041 similar to 3.7 
113936 W17056 Hs.83623 nuclear receptor subfamily 1, group I, m 4.3 
113950 AI267652 Hs.30504 Homo sapiens mRNA; cDNA DKFZp434E082 (fr 10.7 
114030 AI825386 Hs.164478 hypothetical protein FLJ21 939 similar to 4.4 
114051 AB026436 Hs.177534 dual specificity phosphatase 10 4.5 
114057 AF1 16653 Hs.34192 Homo sapiens PRO0823 mRNA, complete cds 3.5 
114082 AK001612 Hs.26962 Homo sapiens cDNA FU 10750 fis, clone NT 3.1 
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131941 


BE252983 Hs.35086 


ubiquitin specific protease 1 


7.4 


103 


14 


6.5 


55 


131945 


NM 002916HS.35120 


replication factor C (activator 1) 4 (37 


3.7 


37 


1 


3.4 


131949 


AK000010 Hs.258798 


hypothetical protein FLJ20003 


3.5 


35 


1 


2.5 




131965 


W79283 Hs.35962 


ESTs 


5.5 


168 


31 


4.4 




131977 


U90441 Hs.3622 


procollagen-proline, 2-oxoglutarate 4-di 


3.7 


37 


9 


2.8 




131985 


AA503020 Hs.36563 


hypothetical protein FLJ22418 


40.2 


402 


1 


4 


60 


131993 


AI878910 Hs.3688 


cisplatin resistance-associated overexpr 


7.3 


73 


1 


1.2 


132064 


AA121098 Hs.3838 


serum-inducible kinase 


22.6 


226 


10 


0.9 




132094 


NM 016045HS.3945 


CGM07 protein 


3.1 


227 


73 


16.8 




132109 


AW190902 Hs.40098 


cysteine knot superfamily 1, BMP antagon 


3.5 


73 


21 


6.3 




132116 


AW960474 Hs.40289 


ESTs 


3.6 


141 


39 


12.6 


65 


132143 


D52059 Hs.7972 


KIAA0871 protein 


4.9 


49 


1 


4.1 


132160 


W26406 Hs.295923 


seven in absentia (Drosophila) homoiog 1 


4.4 


53 


12 


2.1 




132164 


AI752235 Hs.41270 


procollagen-lysine, 2-oxoglutarate 5-dio 


5 


225 


45 


9.1 




132180 


NM 004460HS.418 


fibroblast activation protein, alpha 


10.7 


433 


41 


7.2 
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132197 AI699482 Hs.42151 
132256 AI078645 Hs.431 
132298 NM_015986Hs.7120 
132316 U28831 Hs.44566 
132325 N37065 Hs.44856 
132358 NM_003542Hs.46423 
132384 AA312135 Hs.46967 
132388 W32624 Hs.278626 
132393 AL135094 Hs.47334 
132407 BE613126 Hs.47783 
132425 N87549 Hs.1 25287 
132440 AB020699 Hs.1 12751 
132465 AW169847 Hs.49169 
132522 AB023164 Hs.5070 
132528 T78736 Hs.50758 
132530 M306105 Hs.50785 
132543 BE568452 Hs.5101 
132572 AI929659 Hs.237825 
132592 AW803564 Hs.288850 
132602 AW606927 Hs.5306 

132616 BE262677 Hs.283558 

132617 AF037335 Hs.5338 

132618 AL050025 Hs.279916 
132632 AU076916 Hs.5398 
132668 AB018319 Hs.5460 
132742 AA025480 Hs.292812 
132790 AW242243 Hs.168670 
132811 U25435 Hs.57419 
132852 AL120050 Hs.58220 
132856 NM_001448Hs.58367 
132880 BE077155 Hs.177537 
132902 AI936442 Hs.59838 
132906 BE613337 Hs.234896 
132914 AL047045 Hs.60293 
132968 AF234532 Hs.61638 
132977 AA093322 Hs.301404 
132990 X77343 Hs.334334 
132994 AA112748 Hs.279905 
133011 NM_006379Hs.171921 
133015 AJ002744 Hs.246315 
133070 U92649 Hs.64311 
133091 AK001628 Hs.64691 
133192 AA218564 Hs.67052 
133197 AI275243 Hs.180201 
133199 AF231981 Hs.250175 
133221 W32474 Hs.301746 
133240 AK001489 Hs.242894 
133271 Z48633 Hs.283742 
133291 BE297855 Hs.69855 
133294 AJ001388 Hs.69997 
133350 AI499220 Hs.71573 
133362 AK001519 Hs.7194 
133370 AF245505 Hs.72157 
133407 AF017987 Hs.7306 
133422 AB033061 Hs.73287 
133435 AI929357 Hs.323966 
133479 W01556 Hs.238797 
133493 AW998046 Hs.194369 
133504 NM_004415Hs.74316 
133517 NM_000165Hs.74471 
133536 W25797 Hs.177486 
133578 AU077050 Hs.75066 
133633 D21262 Hs.75337 
133640 AW246428 Hs.75355 
133669 NM_006925Hs.166975 
133681 AI352558 Hs.75544 
133746 AW410035 Hs.75862 



ESTs 

murine leukemia viral (bmi-1) oncogene h 

cytokine receptor-like molecule 9 

KIAA1641 protein 

hypothetical protein FLJ12116 

H4 histone family, member G 

HSPC034 protein 

Arg/Abl-interacting protein AigBP2 

hypothetical protein FLJ14495 

B aggressive lymphoma gene 

zinc finger protein ZNF140-like protein 

KIAA0892 protein 

KIAA1634 protein 

KIAA0947 protein 

SMC4 (structural maintenance of chromoso 
SEC22, vesicle trafficking protein {S. c 
protein regulator of cytokinesis 1 
signal recognition particle 72kD 
Homo sapiens cDNA: FLJ22528 fis, clone H 
hypothetical protein DKFZp586F1122 simil 
hypothetical protein PR01855 
carbonic anhydrase XII 
hypothetical protein FLJ20151 
guanine monphosphate synthetase 
KIAA0776 protein 

ESTs, Weakly similar to T33468 hypotheti 
peroxisomal famesylated protein 
CCCTC-binding factor (zinc finger protei 
Homo sapiens cDNA: FU23005 fis, clone L 
glypican 4 

hypothetical protein DKFZp761B1514 
hypothetical protein FLJ10808 
geminin 

Homo sapiens clone 122482 unknown mRNA 
myosin X 

RNA binding motif protein 3 

transcription factor AP-2 alpha (activat 

clone HQ0310PRO0310p1 

sema domain, immunoglobulin domain (Ig), 

UDP-N-acetyl-alpha-D-gatactosamine:polyp 

a disintegrin and metalloproteinase doma 

KIAA0483 protein 

vacuolar protein sorting 26 (yeast homol 
hypothetical protein FLJ20671 
homolog of yeast long chain polyunsatura 
RAP2A, member of RAS oncogene family 
ADP-ribosylation factor-like 1 
H.sapiens mRNA for retrotransposon 
NRAS-related gene 
zinc finger protein 238 
hypothetical protein FLJ 10074 
CGI-74 protein 
DKFZP564I1922 protein 
secreted frizzled-related protein 1 
KIAA1235 protein 

Homo sapiens clone H63 unknown mRNA 

ESTs, Moderately similar to I38022 hypot 

arginine-glutamic acid dipeptide (RE) re 

desmoplakin (DPI, DPII) 

gap junction protein, alpha 1, 43kD (con 

amyloid beta (A4) precursor protein (pro 

translin 

nucleolar and coiled-body phosphprotein 
ubiquitin-conjugating enzyme E2N (homolo 
splicing factor, arginine/serine-rich 5 
tyrosine 3-monooxygenase/tryptophan 5-mo 
MAD (mothers against decapentaplegic, Dr 
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133765 


M62194 Hs.75929 


cadherin 1 1 , type 2, OB-cadherin (osteob 


3.2 


560 


174 


2.6 




133780 


M557660 Hs.76152 


decorin 


5.4 


144 


27 


13.3 




133784 


BE622743 Hs.301064 


arfaptin 1 


4.7 


47 


1 


4.1 




133814 


NM_002462Hs.76391 


myxovirus (influenza) resistance 1, homo 


3.3 


380 


114 


4.9 


5 


133829 


AW630088 Hs.76550 


Homo sapiens mRNA; cDNA DKFZp564B1264 (f 6.7 


304 


46 


7.8 




133845 


AA1 47026 Hs.76704 


ESTs 


6.2 


600 


97 


4.1 




133913 


AU076964 Hs.7753 


calumenin 


3.3 


889 


267 


5 




133968 


AA355986 Hs.232068 


transcription factor 8 (represses inter! 


3.7 


91 


25 


2.6 


10 


133990 


R48316 Hs.7822 


Homo sapiens mRNA; cDNA DKFZp564C1216 (f 3.4 


91 


27 


8.5 


133999 

■ w www w 


AA535244 HS7830") 


RAB2, member RAS oncogene family 
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78 
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BE513171 Hs.79086 


mitochondrial ribosomal protein L3 
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protease, serine, 15 
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erbb2-interacting protein ERBiN 
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fatty acid synthase 
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cathepsin K (pycnodysostosis) 
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HUEL (C4orf1)-interacting protein 
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41 
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NM nf)2884Hs 865 


RAP1 A, member of RAS oncogene family 
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AW0RR993 Hs171<i81 


ubiquitin C-terminal hydrolase UCH37 
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bone morphogenetic protein receptor, typ 
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hypothetical protein FU10879 
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cytochrome b-245, beta polypeptide (chro 


3.2 
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45 
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AF129536 Hs.284226 


F-box only protein 6 


7 


70 


6 


6 




134856 


BE281128 Hs.9030 


TONDU 


3.1 


31 


1 
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134880 


AI879195 Hs.90606 


15 kDa selenoprotein 


5.7 


57 
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134917 


X87241 Hs.166994 


FAT tumor suppressor (Drosophila) homolo 


3.2 


153 


48 


4.7 
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AL137491 Hs.125511 


Homo sapiens mRNA; cDNA DKFZp434P1530 (f 4 


452 


114 


2 


134982 


AK002085 Hs.92308 


Homo sapiens cDNA FLJ1 1223 fis, clone PL 
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134989 


AW968058 Hs.92381 


nudix (nucleoside diphosphate linked moi 
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114 


14 
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135029 
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hydroxysteroid (17-beta) dehydrogenase 7 
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115 
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AL034344 Hs.284186 


forkhead box CI 
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1.4 
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AI272141 Hs.83484 


SRY (sex determining region Y)-box 4 


3.3 


1296 


394 
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135062 


AK000967 Hs.93872 


KIAA1682 protein 


3.8 


240 
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135073 


W55956 Hs.94030 


Homo sapiens mRNA; cDNA DKFZp586E1624 (1 
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13 


7.9 




135098 


AW274526 Hs.277721 


ovarian carcinoma antigen CA125 


3.3 


33 
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W52493 


Hs.94694 


Homo saDiens cDNA FLJ10561 fis clonp NT 


5.3 


53 


1 


4.1 




135144 


NMJ16255HS.95260 


Autosomal Highly Conserved Protein 


7.4 


74 


5 


2.4 
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AK001835 


Hs.267812 


sorting nexin 4 


6^6 


69 


11 


6.3 




135155 


AI207958 


Hs.166556 


Homo sapiens, Similar to TEA domain fami 


6.1 


61 
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5.1 




135172 
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Hs.12144 


KIAA1033 protein 


3.4 


RR 


26 


1.4 




135242 


AI583187 


Hs.9700 


cyclin E1 


3.1 


31 
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135243 


BE463721 


Hs.97101 


putative G protein-coupled receptor 


3.4 


169 


50 


9.1 




135269 


NM_003403Hs.97496 


YY1 transcription factor 


3.4 










135356 


BE312948 


Hs.18104 


hypothetical protein FLJ11274 


3.1 


O 1 


1ft 


1.7 




135357 


AI565004 


Hs.79572 


cathepsin D (lysosomal aspartyl protease 


4.7 


710 


151 


2.5 




135389 


U05237 


Hs.99872 


fetal Alzheimer antigen 


20.6 


206 


4 


19.1 




135397 


L14922 


Hs.166563 


replication factor C (activator 1)1(14 


3.2 


32 


1 


2.4 




135400 X78592 


Hs.99915 


androgen receptor (dihydrotestosterone r 


3.2 


117 


37 


9.4 






AI471525 


Hs.247486 


ESTs 


3.8 


58 


16 


5.5 






X70683 


Hs.93668 


ESTs 


1.8 


1047 


596 


1.6 






L14922 


Hs.82128 


5T4 oncofetal trophoblast glycoprotein 


5 


285 


58 


1.2 






M23263 


Hs.904 


amylo-1 ;6-glucosidase; 4-alpha-glucanotransferas 


3.1 


31 


1 


2.6 




AI267886 


Hs.148027 


polymerase (RNA) II (DNA directed) polypeptide B 


7.8 


137 


18 


11.9 




AA044840 


Hs.241676 


stromal cell-derived factor 1 


4.7 


114 


25 


0.9 






N90960 


Hs.227459 


ESTs; Moderately similar to III! ALU SUBFAMILY 


4.7 


151 


32 


9.3 




AA873285 


Hs.137947 


ESTs 


4.7 


47 


3 


4.4 






T56679 


Hs.865 


RAP1A; member of RAS oncogene family 


4 


40 


1 


3.4 






AA305536 




"EST1 76522 Colon carcinoma (Caco-2) cell line 


I 


3.6 


121 


34 


11.8 




AI369384 




arylsulfatase D 


3.5 


113 


33 


1.7 






AA219081 


Hs.242396 


ESTs; Moderately similar to !!!! ALU SUBFAMILY] 


3.4 


107 


32 


9.9 
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TABLE 10A 



Table 10 A shows the accession numbers for those pkeys lacking unigenelD's for Table 10. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number: Gene cluster number 
Accession: Genbank accession numbers 



Pkey 


CAT number 


Accession 


123619 


371681 1 


AA602964AA609200 


104602 


524482 2 


H47610 R86920 


121581 


283769 1 


AA416568 AA442889 M417233 AA442223 


123523 


qenbank_AA608588 AA608588 


100821 


tigr_HT4306 


M26460U09116 


125091 


genbank_T91518 


T91518 


125150 


NOT_FOUND_entrez_W38240 W38240 


118475 


genbank_N66845 


N66845 


104787 


genbank_AA027317 


AA027317 


106055 


genbank_AA417034 


AA417034 


113702 


genbank T97307 


T97307 


101046 


entrez_K01160 


K01160 


101447 


entrez_M21305 


M21305 


101624 


entrez_M55998 


M55998 


124677 


genbank_R01073 


R01073 


110581 


genbank_H61560 


H61560 


119023 


genbank_N98488 


N98488 


110775 


genbank_N22414 


N22414 


112092 


genbank_R44538 


R44538 


112253 


genbank_R51818 


R51818 


107014 


genbank_M598820 


AA598820 


114988 


genbank_AA251089 


AA251089 
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as 



35 



40 



45 



50 



55 



60 



TABLE 11: Figure 11 from BRCA 001-3 PCT 

Table 11 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey: Unique Eos probeset identifier number 

1 0 ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: Ratio of tumor to normal body tissue 

M; R2: Ratio of 90* percentile tumor to normal body 

ft 5 R3: Ratio of 75* percentile normal body to tumor 

~! R4: Ratio of tumor to normal breast tissue 



Pkey 


ExAccn UnigenelD Unigene Title 


R1 


R2 


R3 


R4 


100131 


D12485 Hs. 11951 ectonucleotidepyrophosphatase/phosphodi 


13.2 


244 


19 


9.9 


100147 


D13666 Hs.136348 osteoblast specific factor 2 (fasciclin 


15.7 


1030 66 


5 


100522 


X51501 Hs.99949 prolactjn-induced protein 


22.7 


760 


34 


1.4 


100656 


L05424 Hs.169610 CD44 antigen (homing function and Indian 


8.5 


85 


1 


3.2 


101104 


AW862258 Hs.169266 neuropeptide Y receptor Y1 


15.3 


153 


1 


14.1 


101478 


NM_002890Hs.758 RAS p21 protein activator (GTPase activa 


9.6 


96 


1 


8.5 


101724 


L11690 Hs.620 bullous pemphigoid antigen 1 (230/240kD) 


ft A 

9.4 


94 


1 


U.O 


101754 


S70114 Hs.239489 TIA1 cytotoxic granule-associated RNA-bi 


8.9 


89 


5 


8 


101888 


AL049610 Hs.95243 transcription elongation factor A (Sll)- 


7.3 


73 


1 


5.3 


102165 


BE313280 Hs.159627 death associated protein 3 


9.3 


93 


5 


8 


102304 


AF015224 Hs.46452 mammaglobin 1 


8.5 


2058 243 


1.4 


102348 


U37519 Hs.87539 aldehyde dehydrogenase 3 family, member 


6.4 


428 


67 


2.3 


102457 


NM_001394Hs.2359 dual specificity phosphatase 4 


20.2 


202 


5 


1.3 


102567 


U63830 Hs.146847 TRAF family member-associated NFKB activ 


8.2 


82 


1 


6.8 


102823 


D85390 Hs.5057 carboxypeptidase D 


5.6 


56 


1 


5.3 


103557 


AL133415 Hs.297753 vimentin 


7.5 


136 


18 


3.4 


103613 


NM_000346Hs.231 6 SRY (sex determining region Y)-box 9 (ca 


7.3 


73 


1 


5.2 


104115 


AF183810 Hs.26102 opposite strand to trichorhinophalangeal 


29 


290 


1 


26.8 


104667 


AI239923 Hs.30098 ESTs 


14.9 


149 


1 


6.4 


104804 


AI858702 Hs.31803 ESTs, Weakly similar to N-WASP (H.sapien 


7.7 


77 


1 


5.1 


104807 


AI139058 Hs.125790 leucine-rich repeat-containing 2 


7 


70 


1 


6.5 


104896 


AW015318 Hs.23165 ESTs 


7.4 


74 


1 


6 


104943 


AF072873 Hs.114218 frizzled (Drosophila) homolog 6 


16.2 


162 


1 


4.2 


105038 


AW503733 Hs.9414 KIM1488 protein 


5.5 


55 


1 


5.2 


105329 


M234561 Hs.22862 ESTs 


2.8 


131 


47 


3.9 


105500 


AW602166 Hs.222399 CEGP1 protein 


25.4 


508 


20 


3 


105516 


AK001269 Hs.30738 hypothetical protein FU10407 


8.3 


83 


3 


1.8 


105730 


AW377314 Hs.5364 DKFZP564I052 protein 


6.9 


69 


1 


4.4 


106012 


AI240665 Hs.8895 ESTs 


21.2 


212 


6 


17.4 


106095 


AF115402 Hs.11713 E74-like factor 5 (ets domain transcript 


26.3 


356 


14 


1 


106155 


AA425414 Hs.33287 nuclear factor l/B 


9.9 


483 


49 


1.8 


107102 


AB037765 Hs.30652 KIAA1 344 protein 


6.3 


63 


1 


5.4 


107136 


AV661958 Hs.8207 GK001 protein 


2.5 


392 


155 


4.3 


107151 


AW378065 Hs.8687 ESTs 


15.6 


156 


7 


10.8 


107922 


BE153855 Hs.61460 Ig superfamily receptor LNIR 


9 


90 


1 


5.5 


108339 


AW151340 Hs.51615 ESTs, Weakly similar to ALU7 HUMAN ALU S 


18.7 


187 


1 


17 


109112 


AW419196 Hs.257924 hypothetical protein FU13782 


4.1 


334 


82 


3.4 


109292 


AW975746 Hs.188662 KIAA1702 protein 


7.1 


71 


1 


6.5 


109415 


U80736 Hs.1 10826 trinucleotide repeat containing 9 


12.3 


123 


1 


11.3 


109912 


AW390822 Hs.301528 L-kynurenine/alpha-aminoadipateaminotra 


14.2 


142 


1 


9.5 


110009 


BE075297 Hs.6614 ESTs, Weakly similar to A43932 mucin 2 p 


6.3 


693 


110 


7.2 


110915 


BE092285 Hs.29724 hypothetical protein FLJ13187 


20.9 


209 


1 


19.5 


111164 


N46180 Hs.122489 Homo sapiens cDNA FU13289fis, clone OV 


7.7 


77 


1 


5 
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111179 


AK000136 


Hs.10760 asporin (LRR class 1) 


25.1 


288 


12 


6.7 




111190 


AK002055 


Hs.151046 hypothetical protein FU11 193 


6.3 


63 


1 


5.8 




111223 


AA852773 


Hs.334838 KIM1866 protein 


3.6 


402 


112 


4.9 




111357 


BE314949 


Hs.871 28 hypothetical protein FU23309 


3.8 


425 


111 


4 


5 


112244 


AB029000 


Hs.70823 KIM1077 protein 


5.7 


567 


100 


6.7 




113047 


AI571940 


Hs.7549 ESTs 


9.6 


124 


13 


9 




113702 


T97307 


gb:ye53h05.s1 Soares fetal liver spleen 


12.3 


129 


11 


11.7 




1 14124 


W57554 


Hs.125019 lymphoid nuclear protein (LAF-4) mRNA 


24.2 


242 


10 


5.6 


10 


114118 

1 It I oo 




Hs.15740 Homo sapiens mRNA; cDNA DKFZp434E033 (fr 6.7 


67 




6.3 


114768 


AF2 12848 


He IflOQQQ <stc hnmnlnnni iq fartnr 


■n 7 

f O.f 


137 


1 


8.9 




114860 


AL157545 


He A917Q hi"Ami*vlrtmain anri PHn finnor fYintaininn ^ 
no.H-z i / y uiuiiiuuuiiiditi ctiiu r ml/ hi ty ci uuiiuaitiuiy, o 


Q 1 


91 


1 


7.6 




1 14965 


AI733881 


He 79479 RIUIP R1R 


otz q 

00.3 


359 


10 


29.7 




114988 




nh"7cftAfrm c1 M^l Pf5AP fiPR1 Hnmn eani^rv 
yu.cowHluO.o I iNul^uUnr^uo I nuinu oaUieu, 


5 1 1 .O 


115 


1 


6.9 


15 


115206 


AWIRIfiQ 1 ) 
nil iujujj 


He -1flfi*79 CQTe 


o.O 


58 


1 


5 


115719 




Ho 5GR99 Mnmn eanionc Hnno tMAftF '^07981 mRNA 

ns.jyozz noiTio sapiens, cione iiviaocoou* zo i, uiimmm, 


7 fi 


144 


19 


13.9 




115844 




ns.oozyoo nypouieucai protein ivioodj/u 


D.Z 


62 


1 


5.4 




116470 


AI272141 


ns.oiMOH okt isex aeienriinirig region j)-doxh 


1.0 


1047 


596 


1.6 




116786 


H25836 


Hs.301527 ESTs, Moderately similar to unknown [H.s 


zz.o 


228 


9 


12.4 


Las. 


117280 


M18217 


He 17919Q Hnmn canifsnQ rfSMA* FI l91dflQ fie. Hfme> C 
no. i f z izy nut mu odfJieiio vL/imm. ri_j£ i*+yj no, oiuiic o 


O.J 


322 


83 


4.4 


*2b 


11741? 


1 *0£000 


Hs.42645 solute carrier family 1 6 (monocarboxylic 


17 A 


174 


g 


6.9 


H 


118472 


Al WHS 

fAl- 1 Of J*rJ 


Hs.42179 bromodomain and PHD finger containing, 3 


1H.0 


145 


1 


2.4 




11Q971 


MIUU 1 1 IO 


Hs.65328 Fanconi anemia, complementation group F 


ft O 


82 




6.4 




119771 


Aiqnsfia7 

ruou ouui 


Hs.2533 EST 


0.0 


2073 


595 


2.1 




19fWfi9 

1 ZlU JUL 


RF?44'ififl 


Hs.302267 hypothetical protein FLJ10330 


ft R 
0.0 


127 


15 


1.6 


115 


121463 


AKfinnoRo 


Hs.239681 hypothetical protein FLJ20275 


in ^ 


103 




9.3 




19179^ 


AA0414QQ 


Hs.104800 hypothetical protein FLJ10134 


0 0 


214 


74 


3.7 




1 99Qftt 

I ZZJUO 


rtM*+/ 0*t*tO 


Hs.69559 KIAA1096 protein 


7 O 

1 .z. 


72 


\ 


5.7 


Has' 


123137 


AI071Q11 


Hs.100686 ESTs, Weakly similar to JE0350 Anterior 


y.y 


351 


36 


13.9 




I *co □ t y 


M602964 


gb:no97c02.s1 NCI_CGAP_Pr2 Homo sapiens 


ft c 
0.0 


85 


1 


4.3 




19^7flQ 


AA706910 


Hs.112742 ESTs 






16 

1 u 


4 8 


1 : 


i94nnfi 

I ZHUUO 


AI147155 


Hs.270016 ESTs 


5.8 




55 


17 


f=!: 


1 94fi^Q 
I ZtUQy 


BE387335 


Hs.283713 ESTs, Weakly similar to S64054 hypotheti 


1U.4 


Ran 

oou 


oo 


O.O 


F| ; 


1 94 Vtft 


AA249027 


Hs.241507 ribosomal protein S6 


1U.0 


105 


1 


9.9 




19^97Q 
i zyzr y 


AW401809 


Hs.4779 KIAA1 150 protein 


lo.l 


111 


-I 

1 


O. 1 


35 


[ZOO I r 


AA287921 


Hs.164950 ESTs 


C 7 
Q.I 


fi7 

Of 


-j 


g 




IZf 403 


D60237 


Hs.14368 SH3 domain binding glutamic acid-rich pr 


on e 
oU.o 


ins 

ouu 


A 


OR S 

£0.0 


=" ; 


l ZOOUO 


AI954968 


Hs.279009 matrix Gla protein 


( .0 


7"i 
f o 


1 
1 


O.O 




19ft4fi9 


AI694143 


Hs.296251 programmed cell death 4 


7 O 


79 


I 


<i ft 
0.0 


40 


1987Qfi 


AF026692 


Hs.105700 secreted frizzled-related protein 4 


A ~J A 

\ I A 


409 


24 


7.8 


1 9flQ9* 

i zoyzo 


R67419 


Hs.21851 Homo sapiens cDNA FLJ 1 2900 fis, clone NT 


7.1 


1QO 
oyz 


oo 


1 fi 

0.0 




1 zy U 1 1 r 


AA1 15333 


Hs.107968 ESTs 


8.2 






7.4 




19Q99Q 


AF013758 


Hs.109643 polyadenylate binding protein-interacOn 


l.\ 


71 


1 
i 






1 9QTV7 


NM_014918Hs.110488 KIAA0990 protein 


n c 
9.0 




1 


fi S 

O.O 


45 


19QTCA 


BE220806 


Hs.184697 Homo sapiens clone 23785 mRNA sequence 


7.1 


150 
lOU 


91 
£ i 


14 "i 


i zyoz i 


AB028945 


Hs.12696 cortactin SH3 domain-binding protein 


11.4 


114 


i 


IO 




IOUUOD 


BE061916 


Hs.125849 chromosome 8 open reading frame 2 


6.7 


R7 


1 


5 7 

O.I 




1 ^nn^7 


AF027153 


Hs.324787 solute carrier family 5 (inositol transp 


1 


\ 


1 


■| 




louuyo 


AK001635 


Hs. 14838 hypothetical protein FLJ 1 0773 


14.6 


01Q 


1^ 

IO 


7 fi 


50 




AB040914 Hs.278628 K1AA1481 protein 


13.2 


1^1 
Oo 1 


o<; 

£0 


104 


iouooo 


AW067800 Hs.155223 stanniocalcin 2 


72.2 


700 


1 


1 Q 
1 .a 






BE385099 


Hs.334727 hypothetical protein MGC3017 


6.5 


oo 


A 
■t 


O.O 




IOU*T*T 1 


U63630 


Hs.155637 protein kinase, DNA-activated, catalytic 


6.1 


61 


■] 


5.7 




130455 


D90041 


Hs.155956 N-acelyltransferase 1 (arylamine N-acety 


10.8 


706 


66 


9.2 


55 


130604 


AA383256 


Hs.1657 estrogen receptor 1 


32.2 


322 


1 


4.7 


130617 


M90516 


Hs.1674 glutamine-fructose-6-phosphate transamin 


10 


100 


1 


7.6 




130712 


AJ271881 


Hs.279762 bromodomain-containing 7 


17.5 


175 


2 


12.8 




131148 


AW953575 Hs.303125 p53-induced protein PIGPC1 


3.8 


585 


153 


3.7 




131388 


NM_014810Hs.92200 KIM0480 gene product 


7.6 


76 


1 


5 


60 


131564 


T93500 


Hs.28792 Homo sapiens cDNA FU1 1 041 fis, clone PL 


4.7 


381 


81 


6.4 


131742 


AA961420 


Hs.31433 ESTs 


11.7 


117 


1 


10.1 




131877 


J04088 


Hs.156346 topoisomerase (DNA) II alpha (170kD) 


6.8 


68 


1 


5.6 




131985 


AA503020 


Hs.36563 hypothetical protein FLJ22418 


40.2 


402 


1 


4 




132316 


U28831 


Hs.44566 KIAA1641 protein 


18.6 


186 


10 


1.5 


65 


132528 


T78736 


Hs.50758 SMC4 (structural maintenance of chromoso 


9.3 


93 


1 


8.4 


132742 


AA025480 Hs.292812 ESTs, Weakly similar to T33468 hypotheti 


6.5 


65 


1 


5.6 




132990 


X77343 


Hs.334334 transcription factor AP-2 alpha (activat 


12.7 


311 


25 


2.4 




133015 


AJ002744 


Hs.246315 UDP-N-acetyl-alpha-D-galactosamine:polyp 


4.6 


427 


93 


10.4 
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133199 


AF231981 


Hs.250175 homolog of yeast long chain polyunsatura 


3 


816 


275 


3.9 


133240 


AK001489 


Hs.242894 ADP-ribosylation factor-like 1 


8.1 


81 


1 


4.6 


133271 


Z48633 


Hs.283742 H.sapiens mRNA for retrotransposon 


12.4 


124 


6 


10.8 


133640 


AW246428 Hs.75355 ubiquitin-conjugating enzyme E2N (homolo 


8.5 


85 


1 


7.2 


133746 


AW410035 Hs.75862 MAD (mothers against decapentapfegic, Dr 


9.3 


93 


1 


7.8 


133999 


AA535244 


Hs.78305 RAB2, member RAS oncogene family 


7.8 


78 


1 


5.6 


134110 


U41060 


Hs.79136 LIV-1 protein, estrogen regulated 


4.5 


1472 330 


2.1 


134485 


X82153 


Hs.83942 cathepsin K (pycnodysostosis) 


34.3 


411 


12 


5.1 


134654 


AK001741 


Hs.8739 hypothetical protein FU 1 0879 


6.4 


64 


1 


5.1 


134880 AI879195 


Hs.90606 15 kDa selenoprotein 


5.7 


57 


1 


5 


135029 


H58818 


Hs.187579 hydroxysteroid (17-beta) dehydrogenase 7 


11.5 


115 


1 


10 


135389 


U05237 


Hs.99872 fetal Alzheimer antigen 


20.6 


206 


4 


19.1 


128305 AI954968 


Hs.279009 matrix Gla protein 


9.4 


94 


3 


5.3 
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TABLE 11 A 

Table 11 A shows the accession numbers for those pkeys lacking unigenelD's for Table 11. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accession 

123619 371681J AA602964 AA609200 

113702 genbank_T97307 T97307 

114988 genbank_AA251089 AA251089 
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TABLE 12: Figure 12 from BRCA 001-3 PCT 



Table 12 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnfcjenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of tumor to normal body tissue 

R2: Ratio of 90" 1 percentile tumor to body 

R3: Ratio of 75 m percentile body to tumor 

R4: Ratio of tumor to normal breast tissue 



Pkey 


ExAccn UnigenelD Unigene Title 


R1 


R2 


R3 


R4 


100131 


D12485 Hs.11951 phosphodiesterase I (PC-1 ) 


13.2 


244 


19 


9.9 


105500 


AW602166 Hs.222399 ESTs 


25.4 


508 


20 


3 


112244 


AB029000 Hs.70823 KIAA1077 protein 


5.7 


567 


100 


6.7 


114124 


W57554 Hs. 12501 9 ESTs 


24.2 


242 


10 


5.6 


119771 


AI905687 Hs.2533 ESTs 


3.5 


2073 


595 


2.1 


121723 


AA243499 Hs.104800 ESTs 


2.9 


214 


74 


3.7 


128790 


AF026692 Hs.105700 secreted frizzled-related protein 4 


17.4 


409 


24 


7.8 


131148 


AW953575 Hs.303125 ESTs 


3.8 


585 


153 


3.7 


131985 


AA503020 Hs.36563 ESTs 


40.2 


402 


1 


4 


133199 


AF231981 Hs.250175 Homo sapiens clone 23904 mRNA sequence 3 


816 


275 


3.9 
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TABLE 13: Table 1 from BRCA 001-5 US 
5 Table 13 depicts a preferred group of genes upregulated in breast cancer cells. 



Pkey: Unique Eos probeset identifier number 

1 0 ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of tumor to normal body tissue 



A5 



"f— '■ 

o 


Pkey 


ExAccn 


UniGene ID 


Unigene Title 


R1 




100038 


M97935 




control 


16.7 




100039 


M97935 




control 


6.3 


1,2b 


100040 


M97935 




control 


8.3 




100041 


M97935 




control 


14.8 




100082 


AB003103 


Hs.4295 


proteasome (prosome; macropain) 26S sub 


7.5 




100091 


AF000177 


Hs.1 11783 


Lsm1 protein 


4.9 




100100 


AF006084 


Hs.11538 


actin related protein 2/3 complex; subunit 


4.7 


li5 


100103 


AF007875 


Hs.5085 


dolichyl-ph osphate mannosyltransferase p 


13.4 




100114 


D00596 


Hs.82962 


thymidylate synthetase 


15.9 


s 


100121 


D10495 


Hs. 155342 


protein kinase C; delta 


4.6 




100123 


D10523 


Hs. 168669 


oxoglutarate dehydrogenase (lipoamide) 


7.5 


: - 


100128 


D11094 


Hs.61153 


proteasome (prosome; macropain) 26S sub 


4.4 


130 


100131 


D12485 


Hs.11951 


phosphodiesterase l/nucleotide pyrophosp 


8.7 


; w 


100137 


D13627 


Hs.15071 


chaperonin containing TCP1; subunit 8 (t 


9.5 




100144 


D13643 


Hs.75616 


Human mRNA for KIAA0018 gene; comp 


6 




100147 


D13666 


Hs.136348 


osteoblast specific factor 2 (fasciclin l-like 


8.5 




100154 


D14657 


Hs.81892 


KIAA0101 gene product 


10.5 


m 


100164 


D14812 


Hs.173714 


MORF-relatedgeneX 


4.6 




100169 


D14878 


Hs.82043 


D123 gene product 


7.9 




100190 


D21090 


Hs.178658 


RAD23 (S. cerevisiae) homobg B 


5.6 




100203 


D25538 


Hs.172199 


adenylate cyclase 7 


9.9 


40 


100209 


D26308 


Hs.76289 


biliverdin reductase B (flavin reductase (N 


4.9 


100215 


D26598 


Hs.82793 


proteasome (prosome; macropain) subunit 


14.2 




100216 


D26599 


Hs.1390 


proteasome (prosome; macropain) subunit 


11.3 




100219 


D28137 


Hs.118110 


bone marrow stromal cell antigen 2 


5.7 




100227 


D28915 


Hs.82316 


interferon-induced; hepatitis C-associated 


5.7 


45 


100248 


D31888 


Hs.78398 


KIAA0071 protein 


7.4 


100287 


D43950 


Hs.1600 


chaperonin containing TCP1 ; subunit 5 (e 


5.6 




100294 


D49396 


Hs.75454 


antioxidant protein 1 


12.9 




100307 


D50525 


Hs.699 


hypothetical protein 


8.4 




100335 


D63391 


Hs.6793 


platelet-activating factor acetylhydrolase; 


6.8 


50 


100340 


D63487 


Hs.82563 


KIAA0153 protein 


4.4 


100355 


D78129 


Hs.71465 


Homo sapiens mRNA forsqualene epoxid 


12.6 




100363 


D78514 


Hs.78563 


ubiquitin-conjugating enzyme E2G 1 (horn 


4.6 




100368 


D79987 


Hs.1 53479 


extra spindle poles; S. cerevisiae; homolo 


6.5 




100372 


D79997 


Hs.1 84339 


KIAA0175 gene product 


8.4 


55 


100375 


D80004 


Hs.75909 


KIAA0182 protein 


4.5 


100379 


D82060 


Hs.278721 


Ke4 gene; mouse; human homolog of 


8.1 




100387 


D83777 


Hs.75137 


KIAA0193gene product 


10.7 




100393 


D84145 


Hs.39913 


novel RGD-containing protein 


7.2 




100398 


D84557 


Hs.155462 


minichromosome maintenance deficient (m 


7.2 


60 


100405 


D86425 


Hs.82733 


nidogen 2 


5.4 


100406 


D86479 


Hs.1 18397 


AE-binding protein 1 


4.3 




100409 


D86957 


Hs.80712 


KIAA0202 protein 


11.9 




100421 


D86985 


Hs.79276 


Human mRNA for KIAA0232 gene; comp 


9.7 




100446 


D87464 


Hs.10037 


KIAA0274 gene product 


6.4 


65 


100447 


D87465 


Hs.74583 


KIAA0275 gene product 


10 


100448 


D87469 


Hs.57652 


EGF-like-domain; multiple 2 


6.2 
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ATPase; H+ transporting; lysosomal (vacu 

fucosyltransferase 8 (alpha (1 ;6) fucosyltr 

Ras-Like Protein Tc4 

Carboxyl Methyltransferase, Aspartate, A 

Collagen, Type Viii, Alpha 1 

Ribosomal Protein L39 Homolog 

Epican, Alt. Splice 11 

Epican, Alt. Splice 12 

Fibronectin, Alt Splice 1 

Mucin 1, Epithelial, Alt. Splice 9 

Basic Transcription Factor, 44 Kda Subun 

Cytosolic Acetoacetyl-Coenzyme A Thio 

Rad2 

Protein Kinase Pitslre, Alpha, Alt. Splice 
Cathepsin B 
Transcription Factor liia 
Guanosine 5' -Monophosphate Synthase 
Ras Inhibitor Inf 

Endothelial Cell Growth Factor 1 
Oncogene E6-Ap, Papillomavirus 
lymphocyte cytosolic protein 1 (L-plastin 
ubiquitin-like 4 

interferon; gamma-inducible protein 30 
diaphorase (NADH/NADPH) (cytochrom 
acid phosphatase 5; tartrate resistant 
biglycan 

matrix metalloproteinase 9 (gelatinase B; 
replication protein A2 (32kD) 
Human MHC class II HLA-DQ-beta mRN 
glucose phosphate isomerase 
voltage-dependent anion channel 1 
chemokine (C-X-C motif); receptor 4 (fus 
Human neuropeptideY receptor Y1 (NPY 
heat shock 70kD protein 4 
Homo sapiens liver expressed protein gen 
H2A histone family; member 0 
cyclin-dependent kinase inhibitor 3 (CDK 
sorbitol dehydrogenase 
glycogen synthase kinase 3 beta 
proteasome (prosome; macropain) 26S sub 
inner membrane protein; mitochondrial (m 
Homo sapiens (cell line HL-6) alpha topo 
phosphomevalonate kinase 
COX17 (yeast) homolog; cytochrome c ox 
interferon-stimulated protein; 15 kDa 
proliferating cell nuclear antigen 
heterogeneous nuclear ribonucleoprotein C 
apolipoprotein C-l 

NADH dehydrogenase (ubiquinone) flavo 
Human somatic cytochrome c (HCS) gene 
protective protein for beta-galactosidase ( 
RAS p21 protein activator (GTPase activa 
interferon-induced protein 56 
myxovirus (influenza) resistance 2; homol 
X-ray repair complementing defective rep 
Human propionyl-CoA carboxylase beta-s 
hypoxanthine phosphoribosyltransferase 1 
DNA segment on chromosome X (unique) 
H2A histone family; member Z 
H2B histone family; member A 
H2A histone family; member A 
nucleolin 

cerebellar degeneration-related protein (62 
protein phosphatase 2 (formerly 2A); regu 
TIA1 cytotoxic granule-associated RNA-b 
pyrroline-5-carboxylate reductase 1 
carboxypeptidase B1 (tissue) 
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101770 


M81601 


Hs.78869 


transcription elongation factor A (Sll); 1 


4.6 




101791 


M83822 


Hs.62354 


cell division cycle 4-like 


9.7 




101803 


M86546 


Hs. 155691 


pre-B-cell leukemia transcription factor 1 


5.5 




101809 


M86849 


Hs.323733 


Homo sapiens connexin 26 (GJB2) mRNA 


22.5 
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101839 


M93036 


Hs.692 


membrane component; chromosomal 4; su 


4 
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M94250 


Hs.82045 


midkine (neurite growth-promoting factor 


7.6 
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M99701 


Hs.95243 


transcription elongation factor A (Sll)-like 


11.4 




101973 


S82597 


Hs.80120 


UDP-N-acetyl-alpha-D-galactosamine:po 


4.6 


10 
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U00968 


Hs.166 


Human SREBP-1 mRNA; complete cds 


4.1 
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Hs.82643 


protein tyrosine kinase 9 


4.4 




102025 


U0391 1 


Hs.78934 


mutS (E. coli) homolog 2 {colon cancer; n 


4 
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Hs.83734 


syntaxin 4A (placental) 


6.1 
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heat shock 10kD protein 1 (chaperonin 10 


4.4 


15 
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interleukin enhancer binding factor 2; 45k 


10.4 
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Hs.75760 


sterol carrier protein 2 


9.5 
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Hs.1575 


small nuclear ribonucleoprotein D3 polyp 


6.6 
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U15173 


Hs.155596 


BCL2/adenovirus E1B 19kD-interacting p 


4.3 
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U16954 


Hs.75823 


ALL1 -fused gene from chromosome 1q 


6.9 


H' 


102179 


U19713 


Hs.76364 


allograft inflammatory factor 1 


4.8 
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Hs.83551 


microfibrils ^associated protein 2 


7.2 
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U20758 


Hs.313 


secreted phosphoprotein 1 (osteopontin; b 


7.2 
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U21090 


Hs.74598 


polymerase (DNA directed); delta 2; regu 


4.3 


If" 1 
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Hs.574 


fructose-b is phosphatase 1 


4.5 
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interferon; alpha-inducibie protein (clone 


9.9 


35 
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Hs.78776 


putative transmembrane protein 


4.9 
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lysyl oxidase-like 1 


8.5 
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proteasome (prosome; macropain) 26S sub 


5.4 


~! 
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chromobox homolog 3 (Drosophila HP1 g 


7.7 
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U28014 


Hs.74122 


caspase 4; apoptosis-related cysteine prot 


5.4 


= 30 


102260 


U28386 


Hs.159557 


karyopherin alpha 2 (RAG cohort 1 ; impo 


6.3 




102261 


U28488 


Hs. 155935 


complement component 3a receptor 1 


5.7 


Li 


102273 


U30888 


Hs.75981 


ubiquitin specific protease 14 (tRNA-guan 


6.1 




102298 


U32849 


Hs.54483 


N-myc (and STAT) interactor 


4.1 


TU 
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U33052 


Hs.69171 


protein kinase C-like 2 


4.3 
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Hs.90073 


chromosome segregation 1 (yeast homolo 


5.4 
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Hs.82327 


glutathione synthetase 


4.1 
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U35451 


Hs.77254 


chromobox homolog 1 (Drosophila HP1 b 
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s"s : 
5 ^ ; 


102348 


U37519 


Hs 87539 


alriphvrip dphvriroapnasp fl 


9.4 


40 


102361 


U39400 


Hs.75859 


chromosome 11 open reading frame 4 


5.2 


102362 


U39412 


Hs.75932 


N-ethylmaleimide-sensitive factor attachm 


9.3 




102369 


U39840 


Hs.299867 


hepatocyte nuclear factor 3; alpha 


7.7 




102395 


U41767 


Hs 92208 

1 low c-i^\jyj 


a disintporin and rnptallonrntpinasp riomai 


10.4 




102409 


U43286 


Hs.1 18725 


selenophosphate synthetase 2 


6.2 


45 


102418 


U43923 


Hs.79058 


suppressor of Ty (S.cerevisiae) 4 homolog 


4.1 


102425 


U44772 


Hs 3873 


nalmitovl-nrofpin thinp^fpra^p I'rpmiri-linn 


4.8 




102457 


U48807 


Hs 2359 


dual ^npfifiritv nhnsnhata<?p 4 


6.3 
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Hs 81 548 
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8.6 




1 157877 






4 3 
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Hs 1QRRQQ 
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fi 3 
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H<! 2S4498 
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5 




102562 


1159309 

U J7UVC7 


Hs 75853 


fnmarafp hvrirataQP 

lUlllalOlC 1 IjrUI CJU30C 


5 


55 


102568 


U59877 


Hs.223025 


RAB31; member RAS oncogene family 


9.1 


102580 


U60808 


Hs.152981 


CDP-diacylglycerol synthase (phosphatid 


7.9 




102581 


U61145 


Hs.77256 


enhancer of zeste (Drosophila) homolog 2 


7.6 




102590 


U62136 


Hs.79300 


Homo sapiens enterocyte differentiation a 


7 




102591 


U62325 


Hs.324125 


amyloid beta (A4) precursor protein-bindi 


4 


60 


102592 


U62389 


Hs.11223 


Human putative cytosolic NADP-depende 


5 


102617 


U65928 


Hs.198767 


Jun activation domain binding protein 


6.1 




102618 


U65932 


Hs.81071 


extracellular matrix protein 1 


23.2 




102638 


U67319 


Hs.9216 


caspase 7; apoptosis-related cysteine prot 


8.9 




102663 


U70322 


Hs.168075 


karyopherin (importin) beta 2 


7.1 


65 


102666 


U70660 


Hs.279910 


ATX1 (antioxidant protein 1; yeast) homo 


4.7 


102679 


U72661 


Hs.11342 


ninjurin 1; nerve injury-induced protein-1 


4.7 




102687 


U73379 


Hs.93002 


ubiquitin carrier protein E2-C 


7.7 




102704 


U76638 


Hs.54089 


BRCA1 associated RING domain 1 


5.6 
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Hs.2934 
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X70040 
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V7A04 Q 
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Hs.2903 
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small inducible cytokine subfamily A (Cy 
Human clone 23759 mRNA; partial cds 
asparaginyl-tRNA synthetase 
Human clone 23801 mRNA sequence 
Human clone 23948 mRNA sequence 
tumor susceptibility gene 101 
nucleolar protein p40 
E4F transcription factor 1 
pyridoxal (pyridoxine; vitamin B6) kinase 
nuclear RNA helicase; DECD variant of D 
Human clone 23773 mRNA sequence 
carboxypeptidase D 

chaperonin containing TCP1 ; subunit 2 (b 
Human uncoupling protein homolog (UCP 
Human D9 splice variant B mRNA; comp 
Homo sapiens Nedd-4-like ubiquitin-prot 
plasminogen activator; urokinase 
heme oxygenase (decycling) 1 
aldolase A; fructose-bisphosphate 
cytochrome c oxidase subunit Vic 
hematopoietic cell-specific Lyn substrate 
non-metastatic cells 1; protein (NM23A) 
G1 to S phase transition 1 
trefoil factor 1 (breast cancer; estrogen-ind 
interieukin 1 receptor antagonist 
multifunctional polypeptide similar to SA 
matrix metalloproteinase 1 (interstitial col 
matrix metalloproteinase 11 (stromelysin 
proteasome (prosome; macropain) subunit 
ribonucleotide reductase Ml polypeptide 
cyclin D1 (PRAD1: parathyroid adenomat 
interieukin enhancer binding factor 1 
proteasome (prosome; macropain) subunit 
translocating chain-associating membrane 
PCTAIRE protein kinase 1 
isocitrate dehydrogenase 2 (NADP+); mit 
intercellular adhesion molecule 3 
macrophage stimulating 1 receptor (c-met 
protein phosphatase 4 (formerly X); cataly 
coatomer protein complex; subunit beta 2 
DEAD/H (Asp-Glu-Ala-Asp/His) box pol 
eukaryotic translation elongation factor 1 
monokine induced by gamma interferon 
Human endogenous retrovirus mRNA for 
retinoblastoma-binding protein 7 
retinoblastoma-binding protein 4 
v-rel avian reticuloendotheliosis viral onco 
testis enhanced gene transcript 
hexabrachion (tenascin C; cytotactin) 
lung resistance-related protein 
immature colon carcinoma transcript 1 
coatomer protein complex; subunit beta 
SMA5 

small nuclear ribonucleoprotein polypepti 
serine/threonine kinase 9 
uracil-DNA glycosylase 
SULT1C sulfotransferase 
chloride channel; nucleotide-sensitive; 1A 
synaptobrevin-like 1 
methionine-tRNA synthetase 
cofilin 1 (non-muscle) 
DR1 -associated protein 1 (negative cofact 
Sec23 (S. cerevisiae) homolog B 
adaptor-related protein complex 2; sigma 
Ksapiens mRNA for Ptg-12 protein 
translocase of inner mitochondrial membr 
16 
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103464 Y00285 Hs.76473 insulin-like growth factor 2 receptor 4.2 

103470 Y00796 Hs.174103 integrin; alpha L (antigen CD1 1A (p180); 4.5 

103494 Y08991 Hs.83050 phosphatidylinositol 3-kinase-associated p 4.1 

103505 Y09912 Hs.33102 transcription factor AP-2 beta (activating 4.5 

5 103547 Z14982 Hs.180062 proteasome (prosome; macropain) subunit 4.3 

103551 Z15115 Hs.75248 topoisomerase (DNA) II beta (180kD) 4 

103565 Z22548 Hs.146354 thioredoxin-dependent peroxide reductase 7.6 

103587 Z29083 Hs.82128 5T4 oncofetal trophoblast glycoprotein 14.6 

103621 Z47727 Hs.150675 polymerase (RNA) II (DNA directed) pol 6.3 

10 103622 Z48042 Hs.278672 membrane component; chromosome 11; s 4.4 

103658 Z74615 Hs.172928 collagen; type I; alpha 1 5.9 

103680 Z93784 Homo sapiens DNA sequence from PAC 4.4 

103772 AA092473 Hs.278554 chromobox homolog 3 (Drosophila HP1 g 4.9 

103774 M092898 Hs.92918 ESTs; Weakly similar to R07G3.8 [C.eleg 6.1 

15 103821 AA157623 Hs.198793 KIAA0750 gene product 23.3 

103835 AA172215 Hs.93748 ESTs; Moderately similar to TRANSCRIP 4 

103886 AA236384 Hs.1 05737 ESTs; Weakly similar to gene 9306 protei 4.9 

£T 103890 AA236843 Hs.72085 ESTs; Weakly similar to unknown [S.cere 7.8 

D 103892 AA243523 Hs.239189 ESTs 4.8 

SO 104054 AA393432 Hs.7100 hypothetical protein 5.3 

i'n 104115 AA428090 Hs.26102 ESTs 28.7 

j£ 1 104136 AA442669 Hs.268371 zv68f6.r1 Soares_total_fetus_Nb2HF8_9w 5.7 

W 104147 AA451992 Hs.283037 ESTs; Highly similar to HSPC039 protein 6.9 

Hi 104173 AA476564 Hs.76561 ESTs; Weakly similar to finger protein HZ 5.2 

25 104181 AA479521 Hs.283740 ESTs 7.8 

104183 AA480838 Hs.1 14309 ESTs 5.1 

Ci 104192 AA486946 Hs.21321 Homo sapiens mRNA; cDNA DKFZp564 4.3 

104209 AB000221 Hs.1 6530 small inducible cytokine subfamily A (Cy 12.3 

~. 104234 AB002357 Hs.168212 kinesin family member 3B 6.2 

SO 104271 C01687 Hs.7381 ATP synthase; H+ transporting; mitochon 4.2 

M: 104278 C02582 Hs.109253 ESTs; Highly similar to N-terminai acetyl 4.5 

n\ 104307 D52818 Hs.1 11680 endosulfine alpha 4.7 

M: 104309 D55869 Hs.284123 Homo sapiens mRNA full length insert cD 4.2 

#" 104370 H19378 Hs.21851 Homo sapiens mRNA; cDNA DKFZp586 6.4 

i35 104446 L44497 Hs.7351 ESTs 4.9 

s"i 104453 M19169 Hs.123114 cystatinSN 11.6 

= 104476 N33807 Hs.324275 protease; serine; 15 5.6 

104558 R56678 Hs.88959 Human DNA sequence from clone 967N2 6.3 

104592 R81003 Hs.325820 serine protease; umbilical endothelium 13.6 

40 104634 AA004274 Hs.19151 ESTs 6.3 

104636 AA004415 Hs.106106 ESTs 10.1 

104658 AA007145 Hs.27268 Homo sapiens mRNA; cDNA DKFZp564 4.3 

104667 AA007234 Hs.30098 ESTs 16.6 

104675 AA009596 Hs.301553 ESTs; Moderately similar to !!!! ALU SU 4.6 

45 104767 AA025534 Hs.8852 ESTs 4.8 

104785 AA027163 Hs.7942 ESTs 8.1 

104791 AA029046 Hs.301871 ESTs; Moderately similar to cAMP indue 10.9 

104804 AA031357 Hs.31803 ESTs; Weakly similar to N-WASP[H.sap 5.5 

104807 AA032147 Hs.23296 ESTs 10.4 

50 104837 AA039469 Hs.21126 ESTs; Weakly similar to KIAA0299[H.s 4.6 

104849 AA040270 Hs.241507 Homo sapiens mRNA; cDNA DKFZp564 4.3 

104867 AA045481 Hs.225979 Human gene from PACs37M17 and 305B 4.5 

104884 AA053021 Hs.14511 SCO (cytochrome oxidase deficient yeast 4.7 

104906 AA055809 Hs.26802 ESTs; Weakly similar to phosphoprotein [ 8.8 

55 104919 AA057193 Hs.25252 ESTs 5.5 

104921 AA057839 Hs.1508 ESTs 4.2 

104926 AA058846 Hs.33363 DKFZP434N093 protein 7 

104938 AA064627 Hs.318725 ESTs; Highly similar to CGI-72 protein [H 7.1 

104943 AA065217 Hs.114218 ESTs 5.7 

60 104957 AA074919 Hs.10026 ESTs; Weakly similar to ORFYJL063c[S 4.7 

104961 AA076672 Hs.33905 ESTs 5.5 

104968 AA084602 Hs.29669 ESTs 4.3 

104975 AA086071 Hs.50758 chromosome-associated polypeptide C 8.3 

104977 AA088228 Hs.18272 ESTs 6.2 

65 104978 AA088458 Hs.19322 ESTs 6.7 

104987 AA101723 Hs.1 1861 ESTs 9.2 

105002 AA113266 Hs.182704 ESTs; Moderately similar to alternatively 6.9 
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105012 AA1 16036 Hs.9329 

105019 M121879 Hs.9280 

105029 M126855 Hs.13268 

105033 M1 27964 Hs.274329 

105035 AA1 28486 Hs.8859 

105039 AA1 30349 Hs.36475 

105062 M134968 Hs.36529 

105076 AA142858 Hs.37810 

105087 M147884 Hs.9812 

105091 AA148859 Hs.179909 

105093 M149051 Hs.32405 

105107 M1 52302 Hs.25035 

105127 M158132 Hs.301957 

105132 M1 59501 Hs.247280 

105143 AA1 65333 Hs.24808 

105154 AA171736 Hs.35947 

105162 AA176690 Hs.4084 

105186 AA191512 Hs.28005 

105209 AA205072 Hs.227743 

105223 AA211388 Hs.7750 

105252 M227428 Hs.9728 

105253 AA227448 Hs.5003 
105261 AA227871 Hs.6361 
105263 AA227926 Hs.6682 
105274 AA228122 Hs.281866 
105297 AA233451 Hs.183858 
105309 M233790 Hs.4104 
105312 AA233854 Hs.23348 
105342 AA235286 Hs.157078 
105376 AA236559 Hs.8768 
105386 AA236950 Hs.8115 
105397 AA242868 Hs.7395 

105399 AA243007 Hs.16420 

105400 AA243052 Hs.65648 
105404 AA243303 Hs.21187 
105409 AA243562 Hs.301855 
105436 AA252172 Hs.237856 
105483 AA255874 Hs.23458 
105493 AA256268 Hs.10283 

105495 AA256317 Hs.28785 

105496 AA256323 Hs.301997 
105500 AA256485 Hs.222399 
105507 AA256678 Hs.226318 
105538 AA258860 Hs.32597 
105544 AA261954 Hs.24678 
105546 AA262032 Hs.268281 
105549 AA262417 Hs.5415 
105551 AA262477 Hs.25292 
105560 M262783 Hs.306915 

105565 AA278302 Hs.18349 

105566 AA278323 Hs.17481 
105575 AA278717 Hs.12772 
105584 AA279012 Hs.3454 
105596 M279418 Hs.18490 
105604 AA279787 Hs.15467 
105610 AA279991 Hs.99872 
105621 AA280865 Hs.6375 
105627 AA281245 Hs.23317 
105638 AA281599 Hs.247817 
105645 AA282138 Hs.11325 
105650 M282347 Hs.25635 
105666 AA283930 Hs.34906 
105674 AA284755 Hs.279789 
105687 AA286809 Hs.28423 
105700 AA287643 Hs.35254 
105705 AA290767 Hs.101282 
105709 M291268 Hs.26761 



chromosome 20 open reading frame 1 
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ESTs 
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ESTs 

ESTs 

ESTs 

ESTs 
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ESTs 

DKFZP566G223 protein 

ESTs; Weakly similar to contains similari 

HBV associated factor 

ESTs 

methyl-CpG binding domain protein 4 
KIAA1 025 protein 

Homo sapiens mRNA; cDNA DKFZp564 
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ESTs 
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ESTs 
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transcriptional intermediary factor 1 
ESTs; Weakly similar to cDNA EST yk38 
S-phase kinase-associated protein 2 (p45) 
ESTs 

ESTs; Weakly similar to !!!! ALU SUBFA 
ESTs 

ESTs; Weakly similar to house-keeping p 

ESTs; Highly similar to SH3 domain-bind 

RNA binding motif protein 8 

ESTs 

ESTs 

ESTs; Moderately similar to cAMP indue 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA DKFZp586 
DKFZP434N1 26 protein 
CGI-96 protein 

ESTs; Moderately similar to CCR4-associ 
ring finger protein (C3H2C3 type) 6 
ESTs 

ESTs; Weakly similar to 62D9.a [D.melan 
ESTs 

ribonuclease HI; large subunit 
ESTs 

ESTs; Weakly similar to partial CDS [C.e 
Homo sapiens clone 24606 mRNA sequen 
ESTs 

ESTs; Weakly similar to KIAA0665 prate 
ESTs 

ESTs; Moderately similar to putative pho 
ESTs; Weakly similar to trithorax homolo 
Homo sapiens mRNA; cDNA DKFZp564 
ESTs 

Homo sapiens mRNA for for histone H2B 
ESTs 

ESTs; Highly similar to HSPC003 [H.sap 
ESTs 

CDW52 antigen (CAMPATH-1 antigen) 
ESTs 

ESTs; Weakly similar to hypothetical pro 
Homo sapiens mRNA; cDNA DKFZp434 
DKFZP586L0724 protein 



Attorney Docket No.:018501-005210US 



105731 AA292711 Hs.29131 ESTs 6.4 

105753 AA299789 Hs.110857 ESTs 7 

105774 M348014 Hs.23412 ESTs 7.1 

105784 AA350771 Hs.17850 ESTs 13.4 

5 105791 AA358038 Hs.14368 SH3-binding domain glutamic acid-rich p 4.3 

105807 M393803 Hs.16869 ESTs; Moderately similar to COLLAGEN 5.3 

105808 M393808 Hs.286131 KIAA0438 gene product 4.1 

105812 AA394126 Hs.20814 ESTs; Highly similar to CGl-27 protein [H 14.6 

105813 AA394140 Hs.18585 ESTs 4.9 
10 105819 AA397920 Hs.28783 Homo sapiens mRNA; cDNA DKFZp564 4.9 

105870 AA399623 Hs.101067 ESTs 4.8 

105874 AA400074 Hs.171118 ESTs 4 

105896 AA400999 Hs.7838 Human ring zinc-finger protein (ZNF1 27- 4.8 

105934 AA404248 Hs.16577 ESTs 5.2 

15 105935 AA404277 Hs.263727 ESTs; Weakly similar to bisphosphate 3'- 4 

i_s.. 105966 AA406105 Hs.5344 adaptor-related protein complex 1; gamma 8.3 

105974 AA406321 Hs.6224 KIAA0895 protein 4.6 

C 105990 AA410336 Hs.29403 ESTs; Weakly similar to PROBABLE AT 4.5 

Q 105995 AA410510 Hs.5345 ESTs 4.9 

30 106000 AA410972 Hs.20726 ESTs 5.8 

? :v 106007 AA411462 Hs.11042 ESTs; Weakly similar to veli 1 [H.sapiens 6.9 

W 106016 AA411819 Hs.8164 KIAA0898 protein 5 

HI 106034 AA412473 Hs.14928 ESTs 6.6 

'C- 106042 AA412700 Hs.169895 ubiquitin-conjugating enzyme E2L 6 4.6 

25 106057 AA417067 Hs.289074 ESTs 4.5 

106065 AA417558 Hs.25206 ESTs 12.3 

106070 AA417761 Hs.5957 Homo sapiens clone 24416 mRNA sequen 5 

«. 106103 AA421104 Hs.12094 ESTs 15.4 

W 106126 AA424006 Hs.22972 ESTs; Moderately similar to H5AR [M.m 6.4 

1=30 106154 AA425304 Hs.6994 ESTs 5.1 

f*.\ 106157 AA425367 Hs.34892 ESTs 11.1 

*z. 106166 AA425872 Hs.19561 NADH dehydrogenase (ubiquinone) 1 alp 19.3 

106204 AA428024 Hs.21479 ESTs 4.7 

Ci 106210 AA428239 Hs.10338 ESTs 5.7 

=35 106220 AA428582 Hs.32196 ESTs; Moderately similar to metargidinp 7.7 

— : 106236 AA429951 Hs.21104 ESTs 8 

106240 AA430074 Hs.18552 ESTs; Weakly similar to Ylr218cp [S.cere 4.4 

106263 AA431462 Hs.28329 ESTs 4.9 

106288 AA435536 Hs.24336 ESTs 8.8 

40 106293 AA435591 Hs.301444 signal sequence receptor; gamma (transloc 8.7 

106310 AA436244 Hs.17240 ESTs 4.5 

106317 AA436568 Hs.108124 ESTs 4 

106328 AA436705 Hs.28020 KIAA0766 gene product 4.4 

106341 AA441798 Hs.5243 ESTs; Moderately similar to plL2 hypoth 23.7 

45 106348 AA442253 Hs.10702 ESTs 4.7 

106350 AA442763 Hs.194698 cyclin B2 6.1 

106371 AA443923 Hs.170310 ESTs 6.8 

106389 AA446949 Hs.6236 ESTs 4.7 

106394 AA447223 Hs.25320 Homo sapiens clone 25142 mRNA sequen 4.4 

50 106426 AA448282 Hs.16206 ESTs; Weakly similar to F55C1 2.5 [C.ele 4.5 

106459 AA449741 Hs.4029 glioma-amplified sequence-41 4.8 

106462 AA449912 Hs.30532 ESTs; Highly similar to CGI-77 protein (H 5.2 

106468 AA450047 Hs.14770 ESTs 6.8 

106479 AA450351 Hs.75251 ESTs 12.4 

55 106494 AA452108 Hs.18387 transcription factor AP-2 alpha (activating 4.5 

106503 AA452411 Hs.29679 ESTs; Highly similar to mediator [H.sapie 5.1 

106507 AA452584 Hs.267819 protein phosphatase 1; regulatory (inhibito 4.9 

106533 AA453786 Hs.145998 ESTs 8.3 

106568 AA455970 Hs.28285 patched related protein translocated in ren 7.6 

60 106586 AA456598 Hs.57787 ESTs 8.2 

106589 AA456646 Hs.28661 ESTs 4.8 

106606 AA457730 Hs.283437 Homo sapiens clone 23851 mRNA sequen 4.4 

106611 AA458904 Hs.26267 ESTs; Weakly similar to torsinA [H.sapie 7 

106614 AA458934 Hs.256150 ESTs 4.5 

65 106628 AA459657 Hs.12311 Homo sapiens clone 23570 mRNA sequen 6.5 

106637 AA459961 Hs.250824 ESTs 5.5 

106644 AA460239 Hs.12680 ESTs 4.4 
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106664 AA460969 Hs.7510 mitogen-activated protein kinase kinase ki 8.4 

106698 AA463745 Hs.29403 ESTs; Weakly similar to PROBABLE AT 5.3 

106719 AA465171 Hs.236844 ESTs 5.6 

106726 AA465339 Hs.3886 ESTs 10.1 

5 106747 AA476473 Hs.171957 triple functional domain (PTPRF interact! 10.4 

106759 AA477263 Hs.25584 ESTs 4.2 

106765 AA477717 Hs.306117 interleukin 13 receptor; alpha 1 6.9 

106784 AA478558 Hs.227913 API5-like 1 5.1 

106831 AA482014 Hs.29463 centrin; EF-hand protein; 3 (CDC31 yeast 5.1 

10 106836 AA482112 Hs.238707 ESTs 4.8 

106840 AA482548 Hs.5534 ESTs 10.3 

106856 AA486183 Hs.285123 ESTs; Weakly similar to similar to oxyste 6.2 

106865 AA487228 Hs.19479 ESTs 4.5 

106878 AA488872 Hs.12314 Homo sapiens mRNA; cDNA DKFZp586 7.9 

15 106888 AA489101 Hs.24734 oxysterol binding protein 6.4 

L_| 106895 AA489665 Hs.25245 ESTs 4.6 

106909 AA490323 Hs.250747 SUMO-1 activating enzyme subunit 1 4.2 

Ui 106919 AA490885 Hs.21766 ESTs 12.3 

Pi 106920 AA490899 Hs.296323 ESTs 6.2 

7Z0 106941 AA496204 Hs.237971 ESTs 4 

— 106942 AA496347 Hs.31314 retinoblastoma-binding protein 7 4.8 

W- 106948 AA496788 Hs.21077 KIAA0532 protein 4 

pi 106968 M504631 Hs.26813 ESTs; Weakly similar to hypothetical 43.2 4.4 

l r '\ 106973 AA505141 Hs.11923 Human DNA sequence from clone 167A1 5.4 

"25 106980 AA521 121 Hs.8858 bromodomain adjacent to zinc finger dom 4.1 

p 106981 AA521157 Hs.74101 ESTs 5.7 

106998 AA598461 Hs.195464 insulin-like growth factor binding protein 18.7 

107008 AA598710 Hs.23740 ESTs 6.2 

W 107028 AA599214 Hs.24143 ESTs 4.1 

PO 107032 AA599472 Hs.247309 succinate-CoA ligase; GDP-forming; beta 5.3 

pi 107052 M600134 Hs.12482 glyceronephosphateO-acyltransferase 4.8 

107053 AA600147 Hs.5741 ESTs; Weakly similar to NADH-cytochro 5.8 

107056 M600310 Hs.18720 programmed cell death 8 (apoptosis-induc 4.9 

F=i 107080 AA609210 Hs.19221 ESTs 8.4 

35 107102 AA609723 Hs.30652 ESTs 8 

W 107109 AA609943 Hs.32793 ESTs 9.5 

107129 AA620553 Hs.4756 flap structure-specific endonuclease 1 4.9 

107132 M620598 Hs.9052 ESTs 5.3 

107136 AA620795 Hs.8207 ESTs 4 

40 107140 M620889 Hs.170088 ESTs 6.7 

107151 M621169 Hs.8687 ESTs 19 

107159 AA621340 Hs.10600 ESTs; Weakly similar to ORF YKR081c [ 8.1 

107174 AA621714 Hs.25338 ESTs 8.5 

107217 D51095 Hs.35861 DKFZP586E1621 protein 7.2 

45 107252 D59971 Hs.25925 ESTs 7.9 

107295 T34527 Hs.80120 UDP-N-acetyl-alpha-D-galactosamine:po 5.6 

107299 T40327 Hs.30661 lung resistance-related protein 8.4 

107324 T81665 Hs.278422 DKFZP586G1 122 protein 7.5 

107372 U85625 Hs.8297 ribonuclease 6 precursor 4.7 

50 107373 U85773 Hs.154695 phosphomannomutase 2 4.8 

107481 W58247 Hs.279766 Homo sapiens kinesin superfamily motor 6.3 

107531 Y13936 Hs.17883 protein phosphatase 1G (formerly 2C); ma 8.3 

107859 AA024835 Hs.47584 potassium voltage-gated channel; delayed 7.3 

107890 AA026030 Hs.61311 ESTs; Weakly similar to CALPAIN 2; LA 7.3 

55 107908 AA026894 Hs.42826 ESTs 4.9 

108039 AA041341 Hs.46670 ESTs 5.4 

108040 AA041551 Hs.159971 ESTs 8.4 
108102 AA046424 Hs.49433 ESTs; Weakly similar to HYPOTHETICA 6.6 
108217 AA058686 Hs.62588 ESTs 7.7 

60 108255 AA063157 Hs.172608 ESTs 4 

108358 AA071514 Hs.1634 ESTs 4 

108609 AA100694 Hs.69499 Human DNA sequence from BAC 15E1 o 5.5 

108647 AA112396 Hs.44276 ESTs; Moderately similar to HOMEOBO 14.3 

108676 AA1 15562 Hs.274417 Homo sapiens mRNA; cDNA DKFZp564 5.2 

65 108687 AA120785 Hs.54347 ESTs 5.6 

108695 AA121315 Hs.70823 KIAA1 077 protein 10.5 

108733 AA1 26422 zn84f1.s1 Stratagene lung carcinoma 9372 4.4 
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108774 AA128125 Hs.71040 ESTs; Moderately similar to CELL GROW 4.6 

108828 M131584 Hs.273344 DKFZP564O0463 protein 5.5 

108872 AA134063 Hs.111680 ESTs 7.2 

108884 M1 34958 Hs.293591 ESTs 11.3 

5 108893 AA135894 Hs.194691 retinoic acid induced 3 8.9 

109008 AA156360 Hs.87128 ESTs 14.7 

109010 AA156460 Hs.44229 dual specificity phosphatase 12 4.9 

109011 AA156542 Hs.72127 ESTs 4.6 
109042 AA159525 Hs.71779 Homo sapiens DNA from chromosome 19 7.2 

10 109086 AA166695 Hs.270737 tumor necrosis factor (ligand) superfamtly 4 

109090 AA167006 Hs.70499 ESTs 5.9 

109101 M167708 Hs.52184 ESTs 4.2 

109112 AA169379 Hs.257924 ESTs 4 

109160 M179387 Hs.301997 DKFZP434N1 26 protein 4 

15 109166 AA179845 Hs.73625 RAB6 interacting; kinesin-like (rabkinesin 13.6 

109178 AA181600 Hs.283707 ESTs 11.8 

109179 AA181902 Hs.192789 ESTs; Weakly similarto HI! ALU SUBFA 5.4 
="~ : 109261 AA1 95255 Hs.61779 ESTs 6.7 
Ci 109270 AA195515 Hs.3585 ESTs; Weakly similar to alternatively spli 4.9 
fiO 109277 AA196332 Hs.86043 ESTs 5.4 
tf- 109313 AA206800 Hs.86276 ESTs; Moderately similar to zinc finger p 5.5 
Ul 109415 AA227219 Hs.1 10826 trinucleotide repeat containing 9 20.1 
Ci 109454 AA232255 Hs.295232 ESTs 4.7 
pi 109467 AA232904 Hs.63187 ESTs 6.8 
;25 109481 AA233342 Hs.289069 ESTs; Weakly similar to WD40 protein C 10.6 

109508 AA233892 Hs.55902 ESTs; Weakly similarto!!!! ALU SUBFA 8 

O 109514 AA234087 Hs.262346 ESTs; Weakly similar to ORF2: function 8.2 

109572 F02027 Hs.171937 ESTs 4.8 

L. 109632 F04165 Hs.235873 ESTs; Weakly similar to K11C4.2 [C.eleg 5.2 

£30 109644 F04477 Hs.291531 ESTs; Moderately similarto GLYCERAL 6.6 

M, 109703 F09684 Hs.24792 ESTs; Weakly similar to ORFYOR283w 7.1 

== : 109726 F10009 Hs.9196 ESTs 5 

109747 F10161 Hs.22969 ESTs 4.7 

S'> 109799 F10770 Hs.180378 Homo sapiens clone 669 unknown mRNA 4.5 

fS5 109814 F10979 Hs.153106 Homo sapiens clone 23728 mRNA sequen 8.7 

J" 110189 H20543 Hs.6278 DKFZP586B1621 protein 16.6 

1^ 110240 H25577 Hs.176588 ESTs; Weakly similar to CYTOCHROME 6.2 

110280 H29285 Hs.32468 ESTs 4.5 

110520 H56965 Hs.4082 yr09f06.s1 Soares fetal liver spleen 1 NFL 5.7 

40 110561 H59617 Hs.5199 ESTs; Weakly similar to UBIQUITIN-CO 19.5 

110707 H95079 Hs.15617 ESTs; Weakly similar to !!!! ALU SUBFA 6.2 

110734 H98714 Hs.24131 ESTs 30.2 

110770 N22262 Hs.131705 ESTs 5.8 

110780 N23174 Hs.22891 solute carrier family 7 (cationic amino aci 8.2 

45 110787 N24716 Hs.12244 ESTs; Weakly similarto C44B9.1 [C.eleg 6.7 

110794 N25262 Hs.27931 ESTs 5.9 

110799 N26101 Hs.323401 Human ring zinc-finger protein (ZNF1 27- 4 

110818 N29454 Hs.27552 ESTs; Weakly similar to putative p150 [H 4.3 

110839 N30856 Hs.30246 solute carrier family 19 (thiamine transpo 12.8 

50 110844 N31952 Hs.167531 Homo sapiens mRNA full length insert cD 10.1 

110854 N32919 Hs.27931 ESTs 4.7 

110856 N33063 ESTs; Weakly similarto S1 64 [H.sapiens 4.2 

110860 N33438 Hs.1 70065 ESTs 12.5 

110897 N39148 Hs.6880 DKFZP434D1 56 protein 4 

55 110915 N46252 Hs.29724 ESTs 23.2 

110935 N48787 Hs.305979 protease inhibitor 1 (anti-elastase); alpha- 4.8 

110970 N51374 Hs.96870 Homo sapiens mRNA full length insert cD 5.4 

111006 N53375 Hs.166146 Homer; neuronal immediate early gene; 3 4.7 

111008 N53388 Hs.7222 ESTs 13.3 

60 111018 N54067 Hs.3628 mitogen-activated protein kinase kinase ki 5.7 

111084 N59543 Hs.15456 PDZ domain containing 1 8.3 

111100 N62522 Hs.20450 ESTs 14.3 

111125 N63823 Hs.269115 ESTs 7.9 

111132 N64378 Hs.83293 ESTs; Highly similarto unknown function 4.4 

65 111139 N64683 Hs.290943 ESTs 6 

111164 N66857 Hs.14808 ESTs; Weakly similarto!!!! ALU CLASS 4.1 

111172 N67102 Hs.21851 Homo sapiens mRNA; cDNA DKFZp586 5.5 
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IN 04100 


nS.4o9oo 


rOTn 

to IS 


-HQ/100 


Nbbloo 


(J. ~?AGAC\ 

HS./4o4y 


colS 


1104/TI 


INbb/by 


(is.^yl Uoo 


ESTs 


44 0/170 


iNobolo 


HS.4zl /y 


CblS 


11(54/0 


NOOo40 




CCTo. U/nnHu .imIU. i. nil A 1 t I r*t A PP 

cols; weaKly similar to !!!! ALU OLAoo 


1 i04yo 


MA71AG 
INOf I4y 


ns.ou I io 


CO IS 


4 

lloozo 


Nb/ooy 




ESTs 


1 lftE/19 


NboU 1U 


nS.4y4z/ 


colS 


•1 -1 QC/Vl 

l lobuu 






ESTs 


i looyo 


IN / 1 /O i 


nS.OUUol 


Homo sapiens mRNA full length insert cD 


1 lobyo 




nS.OUloY 


cbTS 




NIQH71Q 
iy 




CCTe 
CO IS 


1 1P.QG.9 


pjy^yoo 




to is, nigniy similar to ubi-yu protein in 


1 1P.Q7R 

i ioy/o 


NQ^fiOQ 


Uc Q??Q1 


col S 


1 1 oyob 


Ny4Jbi 


MS.lZoooU 


colS 


1 1QQQQ 


Ny44jy 


MS.401UO 


colS 


119027 


N99256 


ns. n 40 n 


CCTe 

col S 


119042 


R05316 


nS.04/z 


COT*. 

col S 


119075 


R36451 


Ue "5fl7flOrt 


fib rone din 1 


119260 


T15916 


no. i utc/ju 


co i s, niyniy suniidr 10 coal proiein gamm 


119271 


T16387 




CCTe 
CO 1 S 


119298 


T23820 


Ue ^ RK>l7fl 


cyclin T2 


119302 


T25725 




colS 


119341 


T62571 


Uc 

MS.140JOO 


microtu bule-associated protein 7 


119495 


W35390 


He cccoq 


CQTc 

colS 


119580 


W42451 


ns.y^zou 


high-mobility group protein 2-like 1 


119602 


W46286 


MS.zoao94 


EbTs; Weakly similar to ZK1058.5 [C.ele 


119620 


W47620 


Hs.56009 


2'-5'oligoadenylate synthetase 3 


119676 


W60473 


U« CT7Q7 


ColS 


119717 


W69134 


Uc A7QS7 

ns.o/yo/ 


cols 


119729 W69747 


nS.y4oUo 


KlAAlUb/ protein 


119805 


W73788 


1 I _ A OOH O 

Hs.43213 


ESTs 


119859 


W80702 


nS.0o4b1 


ColS 


119867 


W80852 


Hs.250696 


KDEL (Lys-Asp-6lu-Leu) endoplasmic re 


119873 


W81129 


Hs.44865 


Homo sapiens mRNA; cDNA DKFZp586 


119899 


W84767 


Hs.58698 


ESTs 


119940 


W86779 


Hs.272531 


DKFZP586B0319 protein 


119943 


W86835 


Hs.14158 


copine III 


119970 


W87812 


Hs.93581 


Homo sapiens mRNA; cDNA DKFZp586 


120131 


Z38656 


Hs.75887 


coatomer protein complex; subunit alpha 


120150 


Z39549 


Hs.153746 


ESTs 


120206 


Z40805 


Hs.91668 


ESTs 


120241 


Z41815 


Hs.65946 


ESTs 


120255 


AA169752 


Hs.5672 


ESTs; Weakly similar to Similarity to Yea 


120314 AA194166 


Hs.221040 


KIAA1038 protein 


120325 


AA195651 


Hs.104106 


ESTs 


120352 AA211400 


Hs.193172 


ESTs 
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10 



15 



cao 



-25 



135 



40 



45 



50 



55 



60 



65 



120428 

120524 

120528 

120571 

120649 

120655 

120668 

120712 

120713 

120724 

120873 

120885 

120919 

120948 

120969 

120977 

121103 

121291 

121320 

121463 

121596 

121723 

121748 

122125 

122522 

122655 

122704 

122782 

122856 

122882 

122928 

122974 

122997 

123016 

123107 

123111 

123114 

123136 

123137 

123169 

123176 

123338 

123436 

123442 

123449 

123494 

123503 

123533 

123619 

123673 

123729 

123819 

123960 

124000 

124006 

124012 

124021 

124049 

124059 

124243 

124308 

124314 

124315 

124350 

124352 

124357 

124390 



AA236822 

AA261852 

AA262107 

AA280738 

M287115 

AA287347 

AA287833 

AA292654 

M292655 

M293470 

M358015 

M365515 

AA381125 

AA397822 

AA398116 

AA398155 

AA398936 

AA401753 

AA403008 

M411745 

AA4 16740 

M419622 

M421171 

AA434411 

AA449444 

AA454756 

AA456326 

AA459894 

M463740 

AA465381 

AA476578 

AA478625 

AA479295 

AA480103 

AA486071 

AA486273 

AA486407 

M487449 

AA487468 

AA488892 

M489020 

AA504249 

AA598714 

AA598803 

AA598899 

AA599786 

AA600121 

AA608751 

AA609200 

AA609471 

AA609778 

AA620636 

AA621785 

D57317 

D60302 

D80240 

F02859 

F10523 

F13673 

H66710 

H93575 

H94877 

H94892 

N21359 

N21626 

N22401 

N29325 



Hs.173694 

Hs.192905 

Hs.104413 

Hs.34892 

Hs.1 92843 

Hs.238205 

Hs.292913 

Hs.102506 

Hs.96557 

Hs.100747 

Hs.301872 

Hs.301444 

Hs.104650 

Hs.129206 

Hs.97600 

Hs.97697 

Hs.8186 

Hs.301927 

Hs.239681 

Hs.174104 

Hs.1 04800 

Hs.234545 

Hs.98806 

Hs.98969 

Hs.97837 

Hs.99445 

Hs.99472 

Hs.75367 

Hs.108812 

Hs.1 01 840 

Hs 194215 

Hs.1 06290 

Hs.323231 

Hs.104207 

Hs.191721 

Hs.1 29928 

Hs. 194024 

Hs.1 00686 

Hs.69233 

Hs.187585 

Hs.223014 

Hs.1 11496 

Hs.1 12493 

Hs.112110 

Hs.293156 



Hs.158549 

Hs.278672 

Hs.1 12264 

Hs.287733 

Hs.74861 

Hs.270016 

Hs.241471 

Hs.13974 

Hs.74519 

Hs.283713 

Hs.1 33525 

Hs.241507 

Hs.215766 

Hs.288757 

Hs.101282 

Hs.102406 

Hs.7535 



KIAA1097 protein 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

eukatyotic translation initiation factor 2 al 

ESTs 

ESTs 

EST 

ESTs; Moderately similar to !!!! ALU SU 
ESTs 

ESTs; Highly similar to similar to mago n 

casein kinase 1; gamma 3 

ESTs 

EST 

lung cancer candidate 

T-cell receptor; alpha (V;D;J;C) 

ESTs; Weakly similar to KIAA0554 prate 

ESTs 

ESTs; Weakly similar to Mouse 19.5 mRN 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

Src-like-adapter 

ESTs; Weakly similar to B0041.5 [C.eleg 

ESTs 

ESTs 

Kelch motif containing protein 

ESTs; Weakly similar to alternatively spli 

ESTs 

ESTs 

ESTs; Moderately similar to KIAA0454 p 
ESTs 

ESTs; Weakly similar to secreted cement 
ESTs; Weakly similar to Gag-Pol polypro 
ESTs 
ESTs 

protease; serine; 15 
ESTs 

Homo sapiens mRNA; cDNA DKFZp564 

ESTs 

ESTs 

ESTs; Weakly similar to !!!! ALU SUBFA 

ESTs 

ESTs 

membrane component; chromosome 11; s 
ESTs 

methylrnalonate-semialdehyde dehydroge 
activated RNA polymerase II transcription 
ESTs 

HUM5G11A Human fetal brain (TFujiwa 
ESTs 

primase; polypeptide 2A (58kD) 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA DKFZp564 
GTP-binding protein 

v-ral simian leukemia viral oncogene horn 
Homo sapiens mRNA; cDNA DKFZp434 
ESTs 

yw37g07.s1 Morton Fetal Cochlea Homo 
ESTs; Highly similar to COBW-like place 



5.6 

5.6 

4.5 

4.9 

4.5 

6.7 

8.3 

4.6 

10.6 

5.4 

7.1 

4.6 

8.2 

8.6 

10.5 

10.9 

7.4 

5.3 

13.5 

8.9 

22.6 

8 

5.6 
5.3 
4 
4 

6.2 

5.3 

13.1 

5.5 

6.3 

6 

12.5 

4.4 

8.3 

4.2 

5.2 

4.2 

14.6 

4.5 

5.2 

4 

7.3 
5.9 
4.1 
4 

12.8 

7.9 

23.1 

6.6 

4.7 

4 

7.6 

4.4 

20.6 

6.7 

4.7 

4.7 

7.7 

5.5 

11.4 

13.7 

14 

8.6 

7.2 

5.2 

7.9 
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124438 


N40188 


Hs.11090 


ESTs 


9.5 




124447 


N48000 




Homo sapiens mRNA; cDNA DKFZp586 


4.8 




124457 


N50114 


Hs.266175 


ESTs 


6.1 




124539 


N63172 


Hs 146409 


rpll riix/isinn rvnlp 49 fRTP-hindino nrntp 


5.6 


5 


124626 


N74604 


Hs.11090 


ESTs 


12.8 




124632 




Hs 106117 


infprif^iikin 1^ rpf^pntnr alnha 1 


6.4 




124644 


N91279 


hk 109(554 
no. twujt 


FST^* Moripratplv similar in outpr mpmhr 

Lu i o, i vkajgi aici y oiiiiiicii i\j uuici iiiciiiui 


8.3 




124676 


Rfl1fH7 

r\v 1 uj/ 


Hs 181011 
no. loiuiv 


nhnenhtvilvr^pratp mnta^-P 1 /Krain^ 
fjiiuofji luyiy wCiciic iiiuiooc i \u\a\wj 


12.3 


10 


124677 


I\V 1 V / O 




F^Tq* Weaklv similar to Mlf Al II CLASS 

CO 1 O) V V CQMy OlMIHCIi l\J lill rtl_U ULnOQ 


5.4 


124724 


R12405 


119491 


Homo canipoc mRMA" rDNIA nKF7n^fifi 


6.6 




1 74771 

1 tri t I O 


R4f)Q91 


He 1 flfififU 

no. lUOOU*r 


CO 1 o 


4.9 




1 94777 


rvt 1 300 




CO 1 O 


7.2 




1 24792 


R44357 


Hs 48712 


ESTs' Weaklv similar to cDNA EST EMB 

1 — 1 U| V » ^CJIM J Oil! IllCtl ViJ vUl l/ \ L_VJ 1 L.IVILJ 


8.6 




124857 


R63652 


Hs 137190 


ESTs 


4.9 


194Q11 


RSRQQ9 


no. 1 ouu 1 £. 


CO I o 


4 7 


V. a 


1 94Qfi , i 
1 3 J J 


1 IVJjO 


He ^94R41 

n&.Ot'rtrr I 


F^Te- Wpflklv similar tn till AI 1 1 91 IRFA 


4 4 






T11114 
1 1 1 tot 


H* 4^1 


UIUI II ic icUr.cn lid viidi \u\\li- i } Vi lOUycl ic II 


19 fi 




1 4.UUOO 


T7fiORQ 


He 970 1 **4 
r lo.z/ u i wH 


ESTs 


4.1 


S ; 


125092 


T92544 


H<! I^^R 


Cnft4 airtinpn ^Ipi iknrvtp antinpnl 


14.8 


wo 


I £3 I Ot 


\A/1*\4Q5 
VV I3*ri73 


He 19Q781 
no. O I 


ulllUMluoUllIc ^ I upcll icaUifiy lldlilc -J 


fi 7 

Q.I 




125144 


\Att7QQQ 

VVOf C7C7C3 


He 94^fi 


CO t o 


4 R 




1W54 

1 tU 1 


1/1/00410 




CO 1 O 


3.0 




1 95943 


Wftfi493 


He 1ft^41^ 
no, 1 UCJn 1 0 


CO I 0 


fi fi 


: 


1 9597Q 




He 477Q 


ccTq- M/vHpra+plu cimilar tn cimilar tn AH 

COI Oj ivlUVJclGLCf J OIIIIIICII Ollllllal IU ML/ 


5.8 


45 


1 9R9QQ 




He 10.9790. 
no, \ \J£.I z\) 


CO 1 O 


19 9 

I L.C 




1 9s^n^ 

liUOUO 




He 9AA1Q^ 


CO 1 o 


10 9 


5 


19*; 004 

1 WOVH 


£.03000 


He 194Q4H 

no. I£.*t*7*tU 


^TP_hinHinn nrnfpin 

\J I r "UlllUHty LrlUlCIII 


fi ft 




195474 


MM 1 3 IZ IO 


He 7*\1f»^ 
no. / O IUO 


Lyiuoiiic o-inuuuuAyyciiaoc(iiyptU[Jiiciii 


fi 
o 






flfi0.449*}9 


He 9flAQfi7 


FQTe 
COI o 


■i 4 


Ho 


19*; win 


MM liDJUH 


He 9fi7fti 9 


enrtinn npvin 4 

OUI Ul ly 1 IGAII 1 t 


4.1 






MMOU/ OOO 


He 74fi4Q 

no, / *rv*ri? 


wjlUvlllUlIK; O UAlUdOC oUUUI III V IL> 


11 <5 


*Si 
4- 


1 95fi7fl 


AI419R91 


He ft9fift^ 


KsUHJ ctiiLiy el I \rsi I'lcialcU al luyci 1, ill icy 1 1 


4 


f "i 


1 95fiQfi 


AA7484R? 

MM/ *TO*tQO 


He IQI^Sfi 
no. 1 7 1 ouu 


npnpral tran<5/*rinttrtn faptnr IIH* nnlvnpnti 


9.4 






Mli0OH3O 


He 7^799 

no. / J>/ LL 


rihnnhnrin II 


fi? 


is 


1 9*5ftS9 


HflQ9Qfl 


He 7fifi^n 
no. / uuuu 


Hnmn eanipne mRNA 1 rHWA nKF7n5fi4 

nulllU odJJICllD IHrxllIM, wL^INM LT\ri.p>Jlrr 




125972 


AA434562 


1 IOiWWTvU 


ESTs 


4.1 




126160 




He 9fi*i^Qft 


FRTq* Wpnklv Qimilnr tn tranQfnrmatinn-r 

COId, VVCOIMy OIIIIIICII tu U Cll IdlUI 1 1 ICUIUI II 


1R4 






IW3UOQ 


He 1 940R4 
no. itTvot 


fnmnr nprrnQi^ fartnr rprpnfnr ci inprfamilv 
uiiiiui iicuiuoio laviui icvcpiui oupci idiuiiy 


a. J 


40 


19RT37 


Alflfifi4G.fi 


He 4fVifin 


cimilar tn Q r*ora>i/iciao RFR1 
oiiiiiicii lu o. ucievioidc r\cr\ i 




19fi4fl'i 


1 I4fi97ft 


He 1994AQ 
no. 1 c^rrO^j 


CO I o 


7 "i 






W409R9 


He 14fi^m 
no. i tuJ iu 


Lois, vvcdMy animal iu uuiauvt? u isu 


4 1 
1. 1 




1 9fifiQfl 


VV / 0900 


He 1fl1^n7 
Do. IO loU/ 


hictnna 1 familt/ 

no niSiOne, Tamiiy ort 


4 <; 




1 9R71 9 


AA90RRK9 


He 7Q49 
no, / 


CO 1 o 


<i 9 


45 


19(5791 


T795RQ 


He 19^^Q 
no. IZOoOi? 


i ny- 1 ceil sunace anngen 


4 4 


19(57(54 


Ml 00HO3O 


He 10917ft 
no. 1 Ux 1 / 0 


Co 1 5 


4 fi 




1 9(5R(14 

JZDOU4 


fll9fVW*4 
MIZUOOOf 


He 1fiflfi9ft 
no, lOUOtO 


FQTc 
col 5 


11 7 
1 1./ 




19(5R1Q 
I ZOO I q 


a Aonccoc 
MMOUOOOO 


He 970fiH7 

no. £.1 you/ 


FQTc 
CO I o 


A 




1 9fiA77 


a in*;9fiA7 

AIU3AH7 


We 9fi109 

ns.zoiuz 


FQTc 
col S 


7 
1 


50 


1 9fiQQ1 


D-54CCO 

Kol 00Z 


Me fl91 


biglycan 


5 fi 
0.0 


1 97470 


AA51T799 
MMO 10/ el. 


He 17Q79Q 

no. 1 /i7/ £3 


nnllanon - tuna Y* alnha 1 /QnhmiH matanh 
UUIIdycll, lypt? A, dipild I \ol/MlillU lllcuapil 


14 ? 

I*T.0 




197114 


A A.00fiQ9R 

MMozoyzo 


He 9A4914 


FQTc 
col S 


4 <\ 




1 97fifi1 


WU/ -COO 




ESTs; Weakly similar to weak similarity t 


fi 1 
0. 1 




127677 


AA9 16752 


Hs 264190 


ESTs; Highly similar to MEM3 [M.muscu 


17.3 


55 


127814 


AA761755 


Hs.136713 


ESTs; Weakly similar to V4-1 [H.sapiens 


4,1 


127997 


AI281549 


Hs.311054 


ESTs 


5.5 




128092 


AA904617 


Hs. 166229 


ESTs 


5.8 




128218 


H02682 


Hs.292154 


ESTs; Moderately similar to recombinatio 


5.8 




128466 


D59653 


Hs.241471 


EST 


7.4 


60 


128482 


U83908 


Hs.296251 


programmed cell dealh 4 


5.8 


128517 


AA280617 


Hs. 100861 


ESTs; Weakly similar to p60 katanin [H.s 


8.3 




128530 


AA504343 


Hs.183475 


Homo sapiens clone 25061 mRNA sequen 


6.6 




128559 


AA226801 


Hs. 101 448 


metastasis associated 1 


5.2 




128574 


AA412048 


Hs.38260 


keratin 8 


5,1 


65 


128595 


U31875 


Hs. 152677 


short-chain alcohol dehydrogenase family 


27.1 


128610 


L38608 


Hs. 10247 


activated leucocyte cell adhesion molecule 


13.2 




128629 


AA399187 


Hs. 102708 


DKFZP434A043 protein 


6.7 




128649 


AA142853 


Hs.103106 


Homo sapiens mRNA for G7b protein (G 


4.5 
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128651 M446990 Hs.103135 

128653 R48943 Hs.10315 

128656 AA458542 Hs.10326 

128717 T30617 Hs.104222 

128727 M64174 Hs.50651 

128764 N49308 Hs.104938 

128781 X85372 Hs.105465 

128793 W93562 Hs.105749 

128835 W15528 Hs.106390 

128845 M455658 Hs.10649 

128871 AA400271 Hs.1 06778 

128922 AA252023 Hs.9589 

128925 D61676 Hs.21851 

128938 AA410325 Hs.107260 

128946 N29353 Hs.107318 

128948 AA485655 Hs.223025 

128955 F10290 Hs.185807 

129005 AA460049 Hs.13323 

129009 AA131421 Hs.75607 

129017 H13108 Hs.107968 

129057 X62466 Hs.276770 

129075 AA129465 Hs.83765 

129095 L12350 Hs.108623 

129124 AA234530 Hs.108802 

129160 M131252 Hs.109007 

129164 AA282183 Hs.109045 

129180 R40556 Hs.318401 

129224 X89109 Hs.109606 

129229 AA211941 Hs.109643 

129240 W24360 Hs.237868 

129241 AA435665 Hs.109706 
129243 H88033 Hs.109727 
129247 AA151574 Hs.109733 
129259 AA090695 Hs.181385 
129270 Z35227 Hs.109918 
129281 AA026318 Hs.289101 
129300 C20976 Hs.110165 

129318 N93155 Hs.285976 

129319 AA037467 Hs.30340 
129351 AA167268 Hs.62349 
129366 H18027 Hs.184697 
129383 W92984 Hs.288224 
129388 M151621 Hs.1 10964 
129391 T80814 Hs.1 1101 
129404 M172056 Hs.317584 
129406 N23707 Hs.111138 
129426 M412087 Hs.111323 
129453 AA421213 Hs.111632 
129513 C00225 Hs.306163 
129519 AA298786 Hs.112242 
129606 R21443 Hs.166254 
129622 AA278243 Hs.323949 

129626 AA447410 Hs.111334 

129627 AA258308 Hs.71968 

129628 U26727 Hs.1 174 
129642 R50008 Hs.1 1806 
129663 AA442768 Hs.1 1866 
129665 M88458 Hs.1 18778 
129691 X06700 Hs.1 19571 
129783 AA454618 Hs.12479 
129800 AA252436 Hs.12540 
129836 AA452161 Hs.206521 
129850 N20593 Hs.288932 
129869 AA102520 Hs.13015 
129896 AA043021 Hs.1 3225 
129982 M87789 

129985 AA450045 Hs.140452 



ESTs 

solute carrier family 7 (cationic amino aci 
coatomer protein complex; submit epsilon 
Homo sapiens mRNA; cDNA DKFZp566 
Janus kinase 1 (a protein tyrosine kinase) 
ESTs; Weakly similar to alpha 1(XVIII) c 
small nuclear ribonucleoprotein polypepti 
KIAA0553 protein 

Homo sapiens mRNA; cDNA DKFZp586 
basement membrane-induced gene 
Homo sapiens mRNA for putative Ca2-M 
ESTs; Weakly similar to HRIHFB2157 [H 
Homo sapiens mRNA; cDNA DKFZp586 
ESTs 

kynurenine 3-monooxygenase (kynurenin 
proteasome (prosome; macropain) subunit 
Homo sapiens clone 24758 mRNA sequen 
ESTs; Weakly similar to SODIUM- AND 
ESTs 
ESTs 

CDW52 antigen (CAMPATH-1 antigen) 
ESTs 

thrombospondin 2 
N-ethylmaleimide-sensitive factor 
ESTs 
ESTs 

ESTs; Highly similar to HSPC039 protein 

coronin; actin-binding protein; 1A 

polyadenylate binding protein-interacting 

interleukin 7 receptor 

ESTs; Moderately similar to HN1 [M.mus 

KIAA0733 protein 

pilin-like transcription factor 

ESTs 

ras homolog gene family; member H 
glucose regulated protein; 58kD 
ESTs; Highly similar to ribosomal protein 
calmodulin 1 (phosphorylase kinase; delta 
ESTs 

Human ras inhibitor mRNA; 3' end 

plexin C1 

ESTs 

ESTs 

discs; large (Drosophila) homolog 3 (neur 
ESTs 

KIAA0712 gene product 

EST; Highly similar to protein inhibitor o 

Lsm3 protein 

ESTs; Weakly similar to fos39554 1 [H.s 
ESTs 

heat shock 90kD protein 1; alpha 
ESTs 

ESTs; Weakly similar to !!!! ALU SUBFA 
Homo sapiens mRNA; cDNA DKFZp564 
cyclin-dependent kinase inhibitor 2A (mel 
7-dehydrocholesterol reductase 
translocase of inner mitochondrial membr 
KDEL (Lys-Asp-Glu-Leu) endoplasmic re 
collagen; type III; alpha 1 (Ehlers-Danlos 
associated molecule with the SH3 domain 
lysophospholipase I 
YME1 (S.cerevisiae)-like 1 
GDP dissociation inhibitor 2 
ESTs; Weakly similar to heat shock prate 
UDP-Gal:betaGlcNAc beta 1 ;4- galactosy 
immunoglobulin gamma 3 (Gm marker) 
cargo selection protein (mannose 6 phosp 



230 



Attorney Docket No.:018501-005210US 



130029 AA236412 Hs.236510 ESTs; Moderately similar to PFT27 [M.m 

130033 M90696 Hs.181301 cathepsinS 

130036 AA195260 Hs.125849 ESTs; Moderately similar to !!!! ALU SU 

130069 AA055896 Hs.146428 collagen; type V; alpha 1 

5 130077 T24055 Hs.91379 ribosomal protein L26 

130080 X14850 Hs. 147097 H2Ahistone family; member X 

130096 AA223874 Hs.197955 KIAA0704 protein 

130114 AA234717 Hs.14992 ESTs 

130125 M36803 Hs.1504 hemopexin 

10 130135 M61764 Hs.21635 tubulin; gamma 1 

130170 AA610070 Hs.151469 calcium/calmodulin-dependent serine pro 

130189 D43947 Hs.151761 KIAA0100 gene product 

130208 AA620556 Hs. 15250 peroxisomal D3;D2-enoyl-CoA isomerase 

130211 D50840 Hs.23703 UDP-glucose ceramide glucosyltransferas 

15 130235 X14046 Hs.153053 CD37 antigen 

130276 S75295 Hs.169149 karyopherin alpha 1 (importin alpha 5) 

y, 130280 L13738 Hs.153937 activated p21cdc42Hs kinase 

130313 AA620323 Hs,154320 ubiquitin-activating enzyme E1C(homolo 

130314 D86967 Hs. 154332 KIAA0212 gene product 
120 130328 AA135673 Hs.154668 KIAA0391 gene product 

iT-i 130356 X84373 Hs.155017 nuclear receptor interacting protein 1 

TfV 130367 Z38501 Hs.8768 ESTs; Weakly similar to !!!! ALU SUBFA 

W 130378 T47333 Hs.155188 TATA box binding protein (TBP)-associa 

Hi 130384 X66364 Hs. 166071 cyclin-dependent kinase 5 

125 130393 D13630 Hs.155291 KIAA0005 gene product 

~j! 130399 AA449417 Hs.155356 Homo sapiens mRNA for putative glucosy 

P 130407 N29888 Hs.1 55410 ESTs 

130414 M21121 Hs.241392 small inducible cytokine A5 (RANTES) 

130417 U58522 Hs.155485 huntingtin-interacting protein 2 

*30 130421 D21260 Hs.178710 clathrin; heavy polypeptide-like 2 

M= 130441 U35835 Hs.155637 protein kinase; DNA-activated; catalytic p 

h\ 130455 X17059 Hs.155956 N-acetyltransferase 1 (arylamine N-acetyl 

V Z 130498 L38951 Hs. 180446 karyopherin (importin) beta 1 

4" 130499 AA416723 Hs. 158286 Homo sapiens mRNA for KIAA0446 prot 

f3;5 130511 L32137 Hs.1584 cartilage oligomeric matrix protein (pseud 

S i 130553 AA430032 Hs.252587 pituitary tumor-transforming 1 

l - 130558 H96654 Hs.15984 ESTs; Weakly similar to gene pp21 protei 

130568 AA232535 Hs.16085 ESTs; Highly similar to CGI-13 protein [H 

130583 W24957 Hs.293907 ESTs; Moderately similar to similar to C.e 

40 130585 H66211 Hs.16331 ESTs 

130604 X03635 Hs.1657 estrogen receptor 1 

130614 AA132007 Hs.16697 ESTs 

130619 AA477739 Hs.12532 ESTs 

130622 AA235247 Hs.16846 ESTs; Weakly similar to cytochrome P45 

45 130625 F03969 Hs.260720 matrix metalloproteinase 2 (gelatinase A; 

130627 L23808 Hs.1695 matrix metalloproteinase 12 (macrophage 

130629 M60346 Hs.1697 ATPase; H+ transporting; lysosomal (vacu 

130635 M87503 Hs.1706 interferon-stimulated transcription factor 

130639 D59711 Hs.17132 ESTs 

50 130677 H17861 Hs.17767 ESTs 

130681 D82808 Hs.17820 Rho-associated; coiled-coil containing pro 

130693 AA487202 Hs.17962 ESTs 

130703 N63295 Hs.18103 ESTs 

130706 AA488843 Hs.201673 comichon-like 

55 130712 AA292066 Hs.279762 adenylate cyclase 7 

130714 X92896 Hs.18212 DNA segment on chromosome X (unique) 

130715 T98227 Hs.171952 occludin 

130744 AA203527 Hs.18747 POP7 (processing of precursor; S. cerevis 

130747 AA471293 Hs.6879 ESTs 

60 130751 AA435633 Hs.18879 Homo sapiens clone 23965 mRNA sequen 

130796 R39390 Hs.19525 ESTs 

130800 AA223386 Hs.19574 ESTs; Weakly similar to katanin p80 subu 

130855 AA425439 Hs.143323 putative DNA/chromatjn binding motif 

130859 AA287327 Hs.20478 ceroid-lipofuscinosis; neuronal 2; late infa 

65 130866 M58028 Hs.2055 ubiquitin-activating enzyme E1 (A1S9T a 

130880 D14678 Hs.20830 kinesin-like 2 

130891 D31891 Hs.20991 SET domain; bifurcated; 1 
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130905 AA056489 Hs.1 29998 ESTs 

130913 W03592 Hs.21198 translocase of outer mitochondrial membr 

130919 AA291710 Hs.21276 collagen; type IV; alpha 3 (Goodpasture a 

130921 AA074596 Hs.1 94688 bromodomain adjacent to zinc finger dom 

5 130944 M97935 Hs.21486 signal transducer and activator of transcrip 

130974 X57985 Hs.2178 H2B histone family; member Q 

130987 R45698 Hs.21893 ESTs; Weakly similar to cAMP inducible 

130999 N48963 Hs.21992 KIAA0689 protein 

131010 AA435748 Hs.169341 ESTs; Weakly similar to phosphatide acid 

10 131046 X02530 Hs.2248 small inducible cytokine subfamily B (Cy 

131091 T35341 Hs.22880 ESTs; Highly similar to dipeptidyl peptid 

131153 H11760 Hs.23606 ESTs 

131185 M25753 Hs.23960 cyclin B1 

131200 AA609427 Hs.293732 ESTs; Moderately similar to !!!! ALU SU 

15 131206 AA044078 Hs.24210 ESTs 

=* 131210 AA430047 Hs.95549 ESTs 

131227 AA429472 Hs.236522 DKFZP434P1 06 protein 

131244 D38076 Hs.24763 RAN binding protein 1 

131245 AA620599 Hs.24766 DKFZP564E1 962 protein 
|.30 131257 AA256042 Hs.24908 ESTs 

m 131319 U25997 Hs.25590 stanniocalcin 

J?: 131339 AA463450 Hs.25812 Nijmegen breakage syndrome 1 (nibrin) 

IV 131388 R34531 Hs.92200 KIAA0480 gene product 

%A 131410 H84658 Hs.279836 ESTs 

3=25 131472 AA608962 Hs.27258 calcyclin binding protein 

*=' : 131475 Z39053 Hs.27263 ESTs 

£ 131501 AA121127 Hs.8207 H3 histone; family 3A 

Pi 131514 X02152 Hs.2795 lactate dehydrogenase A 

131524 N39152 Hs.301804 ESTs 

[30 131528 D60856 Hs.28309 UDP-glucose dehydrogenase 

nJ 131544 N33236 Hs.28555 ESTs; Weakly similar to B051 1.8 [C.eleg 

r : 131557 D30946 Hs.28707 signal sequence recepton gamma (transloc 

T 131562 U90551 Hs.28777 H2A histone family; member L 

G=l 131564 AA491465 Hs.28792 ESTs 

fQ5 131586 AA235385 Hs.26966 ESTs; Moderately similar to alternatively 

131587 M15182 Hs.183868 glucuronidase; beta 

131589 U52100 Hs.29191 epithelial membrane protein 2 

131615 D14533 Hs.192803 xeroderma pigmentosum; complementatio 

131664 AA136126 Hs.30327 mitogen-activated protein kinase-activated 

40 131679 AA136660 Hs.30579 ESTs 

131684 U26174 Hs.3066 granzyme K (serine protease; granzyme 3 

131687 L11066 Hs.3069 heat shock 70kD protein 9B (mortalin-2) 

131689 AA599653 Hs.30696 transcription factor-like 5 (basic helix-loo 

131693 W60913 Hs.1 10796 ESTs; Weakly similar to cDNA EST yk45 

45 131710 AA233225 Hs.30985 MRS1 protein 

131716 D49738 Hs.31053 cytoskeleton-associated protein 1 

131742 D31352 Hs.31433 ESTs 

131762 H46831 Hs.107767 ESTs; Moderately similar to CaM-KII inh 

131781 AA460450 Hs.31989 DKFZP586G1 722 protein 

50 131795 N32724 Hs.32317 Sox-like transcriptional factor 

131809 L76517 Hs.3260 presenilin 1 (Alzheimer disease 3) 

131814 AA437226 Hs.157 interleukin 10 receptor; alpha 

131838 AA091932 Hs.180628 dynamin-like protein 

131877 J04088 Hs.156346 topoisomerase (DNA) II alpha (170kD) 

55 131885 AA044095 Hs.3402 ESTs 

131891 AA1 58258 Hs.30376 heterogeneous nuclear protein similar to r 

131925 AA248470 Hs.183180 ESTs; Weakly similar to RING finger pro 

131930 AA205460 Hs.69476 ESTs 

131941 D62657 Hs.35086 ubiquitin-specific protease 1 

60 131965 W90146 Hs.35962 ESTs 

131970 D86960 Hs.3610 KIAA0205 gene product 

131971 R70167 Hs.154938 ESTs 

131974 AA410424 Hs.268122 Homo sapiens mRNA; cDNA DKFZp586 

131977 F09788 Hs.3622 procollagen-proline; 2-oxoglutarate 4-diox 

65 131994 AA479515 Hs.279882 Human DNA sequence from clone 703H1 

131997 D82399 Hs.136644 Homo sapiens clone 23714 mRNA sequen 

132017 W67251 Hs.267659 Homo sapiens vav 3 oncogene (VAV3) m 
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ESTsi Moderately similar to neuronal thre 
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132856 


W79865 


Hs.58367 


glypican 4 


6.2 


132869 


N26855 


Hs.203961 


ESTs 


6.5 


132874 


AA425776 


Hs.58609 


ESTs 


5.6 


132880 


AA444369 


Hs.177537 


ESTs 


7.2 


132894 


D82422 


Hs.5944 


ESTs 


7.5 


132900 


N56451 


Hs.5978 


LIM domain only 7 


4.4 


132903 


AA235404 


Hs.5985 


Homo sapiens clone 25186 mRNA sequen 


9.1 


132904 


X83618 


Hs.59889 


3-hydroxy-3-methylglutaryl-Coenzyme A 


10.7 


132906 


AA142857 


Hs.234896 


ESTs; Highly similar to geminin [H.sapie 


10.2 


132914 


AA496037 


Hs.60293 


ESTs 


4.7 


132918 


AA252605 


Hs.6051 


KIAA0616 protein 


7.1 


132936 


AB002305 


Hs.6111 


KIAA0307 gene product 


8.3 


132951 


U04209 


Hs.61418 


microfibrillar-associated protein 1 


4.3 


132957 


AA234791 


Hs.61469 


Human gene from PAC 753P9; chromoso 


13.2 
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132959 AA028103 Hs.61472 

132968 N77151 Hs.61638 

132984 H80409 Hs.62112 

132990 AA458761 Hs. 18387 

132994 AA505133 Hs.279905 

132998 Y00062 Hs.170121 

133002 AF006082 Hs.42915 

133005 C21400 Hs.278605 

133015 AA047036 Hs.246315 

133016 W81298 Hs.6289 
133039 X62055 Hs.63489 
133050 S67325 Hs.63788 
133056 AA071387 Hs.6396 
133062 R33663 Hs.64056 
133083 N70633 Hs.6456 
133091 M122147 Hs.64691 
133093 AA598749 Hs.285996 
133124 AA156049 Hs.267923 
133126 D16469 Hs.6551 
133196 R37367 Hs.6727 
133214 Y10659 Hs.285115 
133225 Z41415 Hs.6823 
133228 N90029 Hs.6831 

133239 AA059405 Hs. 179882 

133240 D31161 Hs.242894 
133257 AF006086 Hs.6895 
133264 W72187 Hs.69192 
133274 AA488886 Hs.6949 
133281 AA421079 Hs.69594 
133283 AA410507 Hs.6968 
133287 L15702 Hs.69771 
133294 R79723 Hs.69997 
133297 AA600057 Hs.70266 
133318 AA256168 Hs.152316 
133362 H06195 Hs.7194 
133370 AA156897 Hs.72157 
133391 X57579 Hs.727 
133395 AA491296 Hs.72805 
133422 N79516 Hs.73287 
133431 AA255438 Hs.7358 
133435 T23983 Hs.323966 
133449 AA094989 Hs.7381 
133468 X03068 Hs.73931 
133484 X78710 Hs.211581 
133506 AA316868 Hs.74346 
133517 X52947 Hs.74471 
133551 D63480 Hs.278634 
133569 AA313977 Hs.172772 
133572 W94333 Hs.279915 
133577 F03717 Hs.75063 
133589 L37368 Hs.75104 
133608 D13315 Hs.75207 
133617 AA148318 Hs.75249 
133627 U09587 Hs.75280 

133633 D21262 Hs.75337 

133634 U24166 Hs.234279 
133640 D83004 Hs.75355 
133644 D89077 Hs.75367 
133649 AA479139 Hs.75393 
133652 AA287383 Hs.7540 
133674 AA458946 Hs.75497 
133700 K01396 Hs.297681 
133705 N21648 Hs.75659 
133716 Y00282 Hs.75722 
133720 L27841 Hs.75737 
133752 U49278 Hs.75875 
133765 D21255 Hs.75929 



ESTs; Weakly similar to unknown [S.cere 
myosin X 

zinc finger protein 207 

transcription factor AP-2 alpha (activating 

solute carrier family 2 (facilitated glucose 

protein tyrosine phosphatase; receptor typ 

ARP2 (actin-related protein 2; yeast) horn 

KIAA0970 protein 

ESTs 

growth factor receptor-bound protein 2 
protein tyrosine phosphatase; non-recepto 
propionyl Coenzyme A carboxylase; beta 
jumping translocation breakpoint 
ESTs 

chaperonin containing TCP1; subunit2 (b 

KIAA0483 protein 

ESTs 

ESTs 

ATPase; H+ transporting; lysosomal (vacu 
Ras-GTPase activating protein SH3 doma 
interleukin 13 receptor; alpha 1 
ESTs; Weakly similar to intrinsic factor-B 
Homo sapiens clone 1400 unknown prate 
Homo sapiens clone 24655 mRNA sequen 
ESTs 

actin related protein 2/3 complex; subunit 
ESTs; Weakly similar to cONA EST yk37 
ESTs 

ESTs; Weakly similar to Sox-like transcri 
ESTs 

B-facton properdin 
zinc finger protein 238 
KIAA0905 protein 
ESTs 

ESTs; Highly similar to CGI-59 protein [H 

DKFZP564I1922 protein 

inhibin; beta A (activin A; activin AB alp 

ESTs 

ESTs; Weakly similar to eyelid [D.melano 
Homo sapiens mRNA; cDNA DKFZp566 
ESTs 

voltage-dependent anion channel 3 
major histocompatibility complex; class II 
metal-regulatory transcription factor 1 
ESTs; Weakly similar to 140G11.h [D.me 
gap junction protein; alpha 1; 43kD (conn 
KIAA0146 protein 

transcription elongation factor B (Sill); po 

translocase of inner mitochondrial membr 

human immunodeficiency virus type I enh 

RNA-binding protein SI; serine-rich dom 

glyoxalase I 

KIAA0069 protein 

glycyl-tRNA synthetase 

nucleolar phosphoprotein p130 

microtubule-associated protein; RP/EB fa 

ubiquitin-conjugating enzyme E2N (homo 

Src-like-adapter 

acid phosphatase 1; soluble 

ESTs 

ESTs 

protease inhibitor 1 (anti-elastase); alpha- 
MpV17 transgene; murine homolog; glom 
ribophorin II 
pericentriolar material 1 
ubiquitin-conjugating enzyme E2 variant 
cadherin 1 1 (OB-cadherin; osteoblast) 
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133772 


W73693 


Hs.76038 


isopentenyl-diphosphste delts isomerase 


7.9 


133774 


Z23090 


Hs.76067 


h6at shock 27kD protein 1 


4.1 


133776 


J03473 


Hs. 177766 


ADP-ribosyltransferase (NAD+; poly {AD 
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133784 


AA214305 


Hs.301064 


ESTs 


5.2 


133814 


M33882 


Hs.76391 


myxovirus (influenza) resistance 1 ; homol 


11.7 


133829 


AA453783 


Hs.76550 


Homo sapiens mRNA; cDNA DKFZp564 
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133834 


AA1 47510 


Hs.288660 


serine protease; umbilical endothelium 
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133839 


M59815 


Hs. 170250 


complement component 4A 
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133842 


U73477 


Hs.285013 


putative human HLA class II associated p 
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133845 


T68510 


Hs.76704 


ESTs 
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133859 


U86782 


Hs. 178761 
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ESTs 
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1 33QRn 

I 0030U 
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9.3 


134208 
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no.o touu 
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0./ 


134395 


L09717 


Hs.8262 


lysosomai-associated membrane protein 2 


6.9 


134399 


H99801 


Hs.82689 


tumor rejection antigen (gp96) 1 


4^5 


134401 


AA243746 


Hs.211577 


kinectin 1 (kinesin receptor) 


11.2 


134405 


J04177 


Hs.82772 


collagen; type XI; alpha 1 


15.3 


134415 


AA329274 


Hs.82911 


protein tyrosine phosphatase type IVA; m 


4.1 


134417 


D87969 


Hs.82921 


solute carrier family 35 (CMP-sialic acid t 


4.2 


134419 


L08044 


Hs.82961 


trefoil factor 3 (intestinal) 


5.9 


134421 


AA122386 


Hs.82985 


collagen; type V; alpha 2 


5.8 


134423 


W96151 


Hs.83006 


ESTs; Highly similar to CGI-1 39 protein 


4.4 


134438 


AA449984 


Hs.246857 


ESTs; Highly similar to proteine kinase JN 


7 


134446 


T25732 


Hs.83419 


KIAA0252 protein 


4.6 


134453 


X70683 


Hs.83484 


SRY (sex determining region Y)-box 4 


5.1 


134470 


X54942 


Hs.83758 


CDC28 protein kinase 2 


20.3 


134487 


R38185 


Hs.83954 


Homo sapiens unknown mRNA 


5 


134495 


D63477 


Hs.84087 


KIAA0143 protein 


16.1 
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134498 
134506 
134529 
134570 
134582 
134600 
134623 
134654 
134655 
134675 
134711 
134714 
134722 
134776 
134806 
134810 
134840 
134843 
134853 
134866 
134868 
134885 
134982 
134989 
134992 
134993 
135010 
135015 
135029 
135032 
135037 
135059 
135071 
135083 
135117 
135144 
135154 
135218 
135237 
135243 
135335 
135349 
135367 
135389 
135400 
135411 
300019 
300021 
300022 
300089 
300107 
300254 
300328 
300549 
300711 
300921 
301124 
301165 
301576 
301604 
301704 
301782 
301884 
301936 
302002 
302032 
302067 



M63180 Hs.84131 

U45328 Hs.84285 

H24460 Hs.848 

U66615 Hs.172280 

AA234966 Hs.86041 

R68884 Hs.86347 

X74496 Hs.86978 

W23625 Hs.8739 

AA454070 Hs.123090 

AA250745 Hs.87773 



X04011 

U89922 

W47183 

J05582 

Z49099 

M27394 

U51477 

H60595 

D82348 

U84011 

Z39762 

N27670 

N46086 



Hs.88974 

Hs.890 

Hs.284226 

Hs.89603 

Hs.89718 

Hs.89751 

Hs.89981 

Hs.90061 

Hs.90280 

Hs.904 

Hs.90419 

Hs.9071 

Hs.92308 



AA236324 Hs.92381 

H05625 Hs.5831 

AA282343 Hs.301005 

D59675 Hs.92927 

U54999 Hs.278338 
AA224180 

AA243497 Hs.173685 

U77948 Hs.278589 

AA598449 Hs.93832 

L08069 Hs.94 

AA495950 Hs.94262 

W52493 Hs.94694 

AA044842 Hs.95260 

AA126433 Hs.267812 

D31157 Hs.324277 

AA454930 Hs.9691 

AA215333 Hs.97101 



H20989 
D83174 



Hs.198281 
Hs.9930 



AA480109 Hs.9963 
U05237 Hs.99872 



Hs.99915 
Hs.99947 



M23263 
L10333 
M97935 
M97935 
M97935 

AI199738 Hs.208275 

AI694585 Hs.270464 

AW079607 Hs. 188417 

AW015860 Hs.224623 

AA699328 Hs.298119 

AI492179 Hs.166244 

AW293224 Hs.232165 

T79326 Hs.298262 



N85789 
AI682905 



Hs.150186 
Hs.270431 



AA373124 Hs.24809 
AA526313 Hs.293691 
N99399 Hs.143046 
AA312082 Hs.105445 
NM_004694Hs.1 14924 
AF013956 Hs.5637 
NM_001992Hs.128087 
H05698 Hs.222399 



threonyl-tRNA synthetase 
ubiqurtin-conjugating enzyme E2I (homol 
FK506-binding protein 4 (59kD) 
SWI/SNF related; matrix associated; actin 
CGG triplet repeat binding protein 1 
ESTs; Weakly similar to predicted using G 
prolyl endopeptidase 
ESTs; Weakly similar to ORF YGR200c [ 
ESTs 

protein kinase; cAMP-dependenL" catalyti 
cytochrome b-245; beta polypeptide (chro 
lymphotoxin beta (TNF superfamily; mem 
ESTs; Weakly similar to neural F box pro 
mucin 1 ; transmembrane 
spermine synthase 

membrane-spanning 4-domains; subfamily 
diacylglycerol kinase; zeta (104kD) 
progesterone binding protein 
5-aminoimidazole-4-carboxamide ribonuc 
amylo-1 ;6-glucosidase; 4-alpha-glucanotr 
KIAA0882 protein 

progesterone membrane binding protein 
ESTs 

Homo sapiens mRNA; chromosome 1 spe 
ESTs 

purine-rich element binding protein B 
ESTs 

LGN protein 

ESTs; Moderately similar to 17-beta-hydr 

Human DNA sequence from clone 30M3 

general transcription factor II; i 

Homo sapiens clone 24483 unknown mRN 

heat shock protein; DNAJ-like 2 

ESTs 

Homo sapiens clone 24837 mRNA sequen 
Homo sapiens mRNA; cDNA DKFZp586 
sorting nexin 4 

ESTs; Weakly similar to growth factor-res 
ESTs 

putative G protein-coupled receptor 
pyruvate kinase; muscle 
collagen-binding protein 2 (colligen 2) 
TYRO protein tyrosine kinase binding pro 
fetal Alzheimer antigen 
androgen receptor (dihydrotestosterone re 
reticulon 1 

AFFX control: STAT1 

AFFX control: STAT1 

AFFX control: STAT1 

ESTs; Weakly similar to III! ALU CLASS 

ESTs; Weakly similar to IN! ALU CLASS 

ESTs; Weakly similar to ZnT-3 [H.sapien 

ESTs 

ESTs 

ESTs; Weakly similar to cDNA EST yk40 
ESTs 

ESTs; Weakly similar to dJ88J8.1 [H.sapi 
ESTs; Weakly similar to PTERIN-4-ALP 
ESTs; Weakly similar to !!!! ALU SUBFA 
ESTs; Weakly similar to C17G10.1 [C.ele 
ESTs 

EST cluster (not in UniGene) with exon h 
GDNF family receptor alpha 1 
EST cluster (not in UniGene) with exon h 
chromobox homolog 4 (Drosophila Pc cla 
EST cluster (not in UniGene) with exon h 
ESTs; Weakly similar to protein-tyrosine 

236 
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10.2 

12.1 
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5 

4.1 

16.8 

4 

4.4 

7 

4.8 
13.6 
4 
8 

5.4 

9.3 

6.7 

10.2 

6.6 

7.4 

6.2 

19.5 

8.8 

12.4 
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5.4 

7.8 

9.1 

5.3 

8.3 

7 

14 

9.1 

7.4 

30.1 

11.9 

5.5 

11 

11 

8.8 

6 

4.7 
8 

4.2 

18 

20.7 

11.6 

9.2 
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NM_003613Hs.151407 


EST cluster (not in UniGene) with exon h 


15.1 




302236 


Al 128606 


Hs.6557 


zinc finger protein 161 


25.8 




302276 


NM_004448Hs.323910 


EST cluster (not in UniGene) with exon h 
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302290 


AL1 17607 


Hs.1 75563 


Homo sapiens mRNA; cDNA DKFZp564 


41.4 
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NM_004271Hs.184018 


EST cluster (not in UniGene) with exon h 
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Hs.1 90386 


KIAA0924 protein 
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AL1 17406 


Hs.200102 


Homo sapiens mRNA; cDNA DKF2p434 
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matrix rnetalloproteinase 24 (membrane-in 
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FRTq- Wpaklv similar tn H1in9 4 \C. p!ph 
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AW293005 Hs.70704 


ESTs 
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Hs.46821 


ESTsj Weakly similar to putative [H.sapie 
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histone deacetylase 3 
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Hs.84700 
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Hs.1 031 80 
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Hs.4055 


ESTs 


17.7 
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ESTs; Weakly similar to zinc finger prote 
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U.O 




OUUHJu 


AA738110 




FQT cinnlotnn (nr\\ in 1 Inif^ono^ viuith ovnn 
CO 1 oil lyiclUII \\nJi III ulllOcI \<o) WIUI caUI 1 


4 1 

*r. 1 




ouuoso 


M872838 




l/a rati n R 
KcldUil 0 
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EST singleton (not in UniGene) with exon 
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313590 M804410 Hs.291677 EST cluster (not in UniGene) 

313663 AI953261 Hs.169813 ESTs 

313667 U69201 Hs.13684 ESTs; Weakly similar to choline kinase is 

313749 AW450376 Hs.119004 ESTs 

5 313832 AW271022 Hs.1 33294 ESTs 

313881 AA535580 Hs.16331 ESTs 

313915 AI969390 Hs.1 63443 ESTs 

313955 AI858884 Hs.270647 ESTs 

313974 AI310151 Hs.173524 ESTs 

10 314097 M648744 Hs.269493 ESTs 

314129 M228366 Hs.115122 ESTs 

314359 AA205569 Hs.1 941 93 ESTs 

314384 AA535840 Hs.1 62203 ESTs; Weakly similar to alternatively spli 

314394 AI380563 Hs.130816 ESTs 

15 314462 AA347951 Hs.326413 ESTs 

314465 AA602917 Hs.156974 ESTs 

r "' 314470 AI934422 Hs.30661 ESTs 

p 314488 AA358265 Hs.182890 ESTs 

r; 314506 AA833655 Hs.206868 ESTs 

72,0 314510 AI204418 Hs.190080 ESTs 

"Vi 314558 AI873274 Hs.190721 ESTs 

W 314661 AA436432 Hs.324239 EST cluster (not in UniGene) 

p i 314691 AW207206 Hs.136319 ESTs 

L \ 314754 AW026761 Hs.134374 ESTs 

25 314775 AI149880 Hs.188809 ESTs 

O 314943 AI476797 Hs.184572 cell division cycle 2; G1 to S and G2 to M 

314961 AW008061 Hs.231994 ESTs 

314963 AI689617 Hs.200934 ESTs 

315006 AI538613 Hs.298241 ESTs 

L30 315010 AA531082 Hs.240049 ESTs 

ft; 315019 AA532807 Hs.105822 ESTs 

"Z 315033 AI493046 Hs.146133 ESTs 

s£i 315036 AA534953 Hs.1 63297 ESTs 

p'i 315037 AW205863 Hs!l33988 ESTs; Weakly similar to gene MAC25 pr 

35 315051 AW292425 Hs.163484 EST 

"= ! 315054 AI968598 Hs.78768 ESTs 

315073 AW452948 Hs.257631 ESTs 

315080 M744550 Hs.136345 ESTs 

315083 AI221325 Hs.205442 ESTs 

40 315088 AA557351 Hs.152448 ESTs; Moderately similar to MULTIFUN 

315175 AI025842 Hs.152530 ESTs 

315196 AA972756 Hs.44898 ESTs 

315296 AA876905 Hs.1 25286 ESTs 

3 1 5303 AW1 94364 Hs. 1 28022 ESTs; Weakly similar to FIG-1 PROTEIN 

45 315352 AA604799 Hs.136528 ESTs; Moderately similar to !!!! ALU SU 

315364 AA643602 Hs.1 55485 ESTs; Highly similar to serine protease [H 

315368 AW291563 Hs.104696 ESTs 

315390 AI801565 Hs.200113 ESTs; Weakly similar to alternatively spli 

315408 AW273261 Hs.216292 ESTs 

50 315458 AA872000 Hs.116104 ESTs 

315472 AA828850 Hs.1 65469 ESTs 

315478 AA665612 Hs.120874 ESTs 

315498 AA628539 Hs.1 16252 ESTs; Moderately similar to !!!! ALU SU 

315527 AI791138 Hs.116768 ESTs 

55 315530 AI200852 Hs.127780 ESTs 

315562 AA737415 Hs.152826 ESTs 

315634 AA837085 Hs.220585 ESTs 

315647 AA648983 Hs.212911 ESTs 

315652 AI521489 Hs.3053 ESTs 

60 315676 AW002565 Hs.124660 ESTs 

315680 AA814309 Hs.123583 ESTs 

315735 AI831760 Hs.155111 ESTs 

315741 AA812168 Hs.122559 ESTs 

315769 AA744875 Hs. 189413 ESTs 

65 315978 AA830893 Hs.1 19769 ESTs 

315984 AI015862 Hs.131793 ESTs 

316042 AW297979 Hs.170698 ESTs 
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321012 AA737314 Hs.194324 EST cluster (not in UniGene) 

321050 AW393497 EST cluster (not in UniGene) 

321051 AF134149 Hs.240395 EST cluster (not in UniGene) 
321171 AI769410 Hs.221461 ESTs 

321192 AA295304 Hs.297939 ESTs; Weakly similar to neogenin [H.sap 

321354 AA078493 EST cluster (not in UniGene) 

321387 H68014 Hs.141278 ESTs; Weakly similar to!!!! ALU SUBFA 

321412 AW366305 Hs.22891 EST cluster (not in UniGene) 

321 489 AW392474 Hs.1 72759 ESTs; Moderately similar to !!!! ALU SU 

321539 N98619 Hs.42915 ARP2 (actin-related protein 2; yeast) horn 

321593 H84762 Hs.253197 ESTs 

321666 D28390 Hs.272897 EST cluster (not in UniGene) 

321891 AW157424 Hs.165954 ESTs 

321910 H67065 Hs.271530 ESTs; Weakly similar to!!!! ALU SUBFA 

321953 AW068268 Hs.292833 ESTs; Weakly similar to !!!! ALU CLASS 

321978 N77342 Hs.21851 EST cluster (not in UniGene) 

322017 AA310039 Hs.9192 ESTs 

322026 AA233527 Hs.283675 low density lipoprotein receptor (familial 

322035 AL137517 Hs.306201 EST cluster (not in UniGene) 

322171 AF085968 Hs.48474 EST cluster (not in UniGene) 

322175 AF085975 EST cluster (not in UniGene) 

322236 AL134970 Hs.104222 follistatin-like 1 

322303 W07459 Hs.1 57601 EST cluster (not in UniGene) 

322735 AA086123 Hs.297856 EST cluster (not in UniGene) 

322777 AA679082 Hs.269947 ESTs 

322818 AW043782 Hs.293616 ESTs 

322882 AW248508 Hs.279727 DiGeorge syndrome critical region gene 2 

322975 C16391 EST cluster (not in UniGene) 

322991 C18965 Hs.159473 ESTs 

323011 AA580288 EST cluster (not in UniGene) 

323091 AW014094 Hs.210761 ESTs 

323107 AI301107 Hs.150790 ESTs 

323136 AL120351 Hs.30177 EST cluster (not in UniGene) 

323168 AL120862 Hs.124165 ESTs 

323195 AI064982 Hs.117950 multifunctional polypeptide similar to SA 

323201 AL049370 Hs.1 3350 Homo sapiens mRNA; cDNA DKFZp586 

323203 AA203135 Hs.1 301 86 ESTs 

323243 W44372 Hs.1 10771 EST cluster (not in UniGene) 

323244 T70731 Hs.193620 EST cluster (not in UniGene) 
323328 AA228078 Hs.255096 EST cluster (not in UniGene) 

323332 AI829520 Hs.227513 ESTs 

323333 AA228883 Hs.208558 EST cluster (not in UniGene) 

323570 AL038623 Hs.208752 ESTs; Weakly similar to !!!! ALU SUBFA 

323604 AI751438 Hs.41271 ESTs; Weakly similar to !!!! ALU SUBFA 

323685 AA344205 Hs.289088 EST cluster (not in UniGene) 

323753 AA327102 Hs.70266 EST cluster (not in UniGene) 

323817 AA410943 EST cluster (not in UniGene) 

323845 AI684674 Hs.41127 ESTs; Weakly similar to waclaw [D.melan 

323930 AA570698 Hs.8173 ESTs 

323997 AA844907 Hs.274454 EST cluster (not in UniGene) 

324047 AA378201 Hs.271340 EST cluster (not in UniGene) 

324261 AL044891 Hs.269350 EST cluster (not in UniGene) 

324302 AA543008 Hs.292471 ESTs; Weakly similar to!!!! ALU SUBFA 

324338 AL1 38357 Hs.1 45078 ESTs 

324344 AW502000 Hs.46677 EST cluster (not in UniGene) 

324432 AA464510 Hs.152812 EST cluster (not in UniGene) 

324495 AW501411 Hs.122489 ESTs; Weakly similar to !!!! ALU CLASS 

324497 AW152624 Hs.136340 ESTs 

324598 AA502659 Hs.1 63986 ESTs 

324603 AW016378 Hs.292934 ESTs 

324620 AA448021 Hs.94109 EST cluster (not in UniGene) 

324727 AI610425 Hs.1 9597 ESTs 

324774 AI031771 Hs.132586 ESTs 

324783 AA640770 Hs.200994 EST cluster (not in UniGene) 

324824 AI826999 Hs.224624 ESTs 

324826 AA704806 Hs.143842 ESTs 

324902 D31323 Hs.271492 ESTs 
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AA613792 

T06882 Hs. 172634 

T06997 Hs.121028 

AI064690 Hs.171176 



M23263 
AA449749 
H14624 
X03363 

D50692 Hs.78221 
TIGR:HT544 
M13755 Hs.833 
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M34423 
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M81057 
U22970 
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U32989 
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D57823 
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androgen receptor (dihydrotestosterone re 

ESTs; Highly similar to secreted apoptosi 

ESTs; Highly similar to secreted apoptosi 

HER2 receptor tyrosine kinase (c-erbB-2; 

c-myc binding protein 

Hs.73946 

interferon-stimulated protein; 15 kDa 
interieukin 7 receptor 
galactosidase; beta 1 
FK506-binding protein 2 (1 3kD) 
carboxypeptidase B1 (tissue) 
multiple UniGene matches 
cytochrome P450; 51 (lanosterol 14-alpha 
tryptophan 2;3-dioxygenase 
hepatocyte nuclear factor 3; alpha 
transporter protein 
cystatin E/M 

Sec23 (S. cerevisiae) homolog A 
mannosyl (alpha-1;3-)-glycoprotein beta-1 
ESTs; Weakly similar to transformation-r 
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ESTs 
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Homo sapiens mRNA; cDNA DKFZp434 
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anterior gradient 2 (Xenepus laevis) homo 
ESTs; Weakly similar to cDNA EST yk47 
ESTs 
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TABLE 13A 



Table 13 A shows the accession numbers for those pkeys lacking unigeneED's for Table 13. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accession 

123619 371681J M602964 AA609200 

103207 30635_4 X72790 

103349 11052_-2 X89059 

110856 19346J4 AA992380 N33063 N21418 H79958 R21911 H79957 

113248 328626J T63857 AW971220 M493469T63699 

123169 44573_2 AI950087 N70208 R97040 N36809 AI3081 19 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 



AA908598 AA251875 AI820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
AI219788 AA884444 N92578 F13493 AA927794 AI560251 AW874068 AL1 34043 AW235363 AA663345 AW008282 AA488964 
AA283144 AI890387 AI950344 AI741346 AI689062 AA282915 AW102898 AI872193 AI763273 AW173586 AW150329 AI653832 
AI762688 M988777 AA488892 AI356394 AW103813 AI539642 AA642789 AA856975 AW505512 AI961530 AW629970 
BE612881 AW276997 AW513601 AW512843 AA044209 AW856538 AA180009 AA337499 AW961 101 AA251669 M251874 
AI819225 AW205862 AI683338 AI858509 AW276905 AI633006 AA972584 AA908741 AW072629 AW513996 AA293273 
AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 M78041 9 AA551005 W80701 AW613456 AI373032 AI564269 
F00531 H83488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 F04005 



102919 25180_2 M21191 AL035748 AA021266 AA323126 AA180515 AI613029 D28356 NM_000034 M11560 AW401425 AW248248 AA012956 



AA323294 W04965 H38759 AA206622 M580747 AI541 172 M381075 AA354229 AW402353 AW405575 AW404021 AW406207 
AA075752 AW176066 AA287222 AA195818 T20243 R87945 AA295539 AA402533 AA232419 AA224515 AW401583 AA331367 
AW402140 AW249079 R31488 AA075757 X05236 AW239490 AA338036 AW239495 AA357262 AA431 005 AA306726 R33804 
AA216544 AW275288 AA227044 AL038124 AA243300 C03242 AA315615 AL035840 R64336 M313917 M018963 M001385 
AA054395 H30840 AW498825 AA086141 AI557324 AA121576 H39128 R77161 AA019688 AA380987 AA348140 AA348257 
AW176086 AA362432 AA171389 AA362416 AA299938 AA319093 AA337972 C04921 AA345696 R89640 AA085425 AA481708 
M313637 AL039229 H84490 H86153 F00656 M326668 M347304 R65890 H41949 M339309 AW402002 AW404854 
AA1 92582 M1 12802 H09248 N83165 H38367 AA356339 M455763 R66853 AA294935 H85911 AA310414 H93436 N87014 
AA001 186 H83640 AA41 1328 AA317929 C04192 AW406288 U46335 M323179 AA427649 AI366131 H14328 AA197161 
AA379497 M31 1816 M017206 AA001 137 AA017420 AA012990 AW163775 AA021397 AA295513 AA355248 AA374921 
AW380419 AA345864 AA318058 M371711 AA363255 AA057094 R88057 AA394045 AW362741 M479579 AW362789 
AW362775 AA223624 AW362809 AW362746 AW362753 AW380412 AA057104 AA295271 X06352 AW362771 X12447 R59553 
R87925 AA292864 R83920 F01120 AA496740 M852551 AL039209 AA093339 AA333961 N86416 C03325 C03831 AA1 94750 
AA359270 AA359460 AW245492 M339400 AA330784 AI908970 N89223AW361938 AI940063 AA134323 M371741 F01267 
AW372970 AA341973 AA346098 AW372969 AA337549 AA327342 H93855 AI074079 F291 18 AA852940 F35696 AA345963 
AA079578 M1 13785 C02989 AA095945 C03145 C05199 AA346024 M190506 AW361659 AI909845 AW374374 AW374382 
AW374401 AW374373 AW374370 A1909831 AW374367 AA353658 F01041 C02843 M375948 AW374414 AA213946 AI525039 
H13744 R31007 M112044 AA134404 H47935 AW177018 AA429768 AA336873 AA112875 H46393 M191267 D59131 
AW406037 AA055244 AA341880 AA179024 AA308537 AW406985 AW32731 1 H30301 AA300705 H43788 AW364149 
AA806213 M481936 C04941 AW375299 M21 190 M410818 AA250940 AI354547 M31 7422 M250903 AI865497 AA890603 
AA366197 AW498538 M78072 AW406461 C03092 F00308 H56488 AA336320 AW406501 AA354102 AA382942 AA096393 
AW376830 AA383446 F01259 AW081388 T94036 M379643 H43842 AI524063 AA621727 AA379099 AA371417 R66811 
H44129 N84794 F01135 AA477852 AA293062 AW361595 H27194 AI831650 H43253 H24797 AI564680 AA380090 W20057 
AI921586 AI192549 AW090808 H25967 AA918121 AI626060 H20221 AA812572 H42178 AA887222 H96000 C03180 F00946 



123533 genbank_M608751 

116480 genbank_C14088 

132225 genbank_AA1 28980 

125154 genbank_W38419 

118475 genbank_N66845 



AA608751 
C14088 
M1 28980 
W38419 
N66845 
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C03986 AI318091 AI860172 M582179 AI633388 M557193 R68075 F24105 AW518239 W56622 AI625219 AI925243 
AW468046 AI921828 AA339164 AI144391 AA643334 AA459631 AA873247 AW373432 AA604384 H27600 AI680458 M159956 
AA6 10836 AA364298 AW373435 AA604225 W73754 AW087924 AA599776 N89227 AI630871 AI633128 AW514329 M010455 
AA563928 AI571596 AI128394 W73707 AI423575 AA583809 AA657988 AI950837 AA169782 M600009 AI885540 M771884 
AI978829 AA505408 AA533937 M481469 AA610869 AA775241 AW273870 AW070909 AI905695 AA4801 15 AA574051 
AI889185 AA773167 AA331375 AA001437 AA194324 AA194300 M558632 AI038538 AA41 1329 M781570 AI833176 
AA935520 AW074197 AA583063 AW073099 AW001 198 M948025 AA587857 AA191540 AI460085 AA193244 AI538037 
AA515572 AA758587 AI14931 1 M508610 AA206409 AA534004 M994600 AA827543 AI916349 AW245129 AW517804 D25663 
AA781985 AA284536 AI819422 H16040 H27531 AA456564 AA845555 AI423596 AA012908 AA889439 AA716311 M968868 
AA320508 AA725731 AA834202 M935997 AA72481 5 AA769353 AA594803 F00827 AI342442 AI003519 AI002503 AI347597 
AI040946 AA1971 62 AA987883 AA292865 AW001 944 AI64071 1 AW244044 AA456784 F30588 AA290829 H24754 AI978683 
AA483686 AA583939 AA121382 AA833831 AA477102 AA977322 AA666379 F35456 AA993537 AI749610 M226934 AA716204 
AW513025 AA628543 AA583705 F25702 AI368748 AI124097 AI880086 M477513 AI758834 AI690753 AA477746 F37761 
AA642243 M159957 AA250844 AA459406 AA427566 F25054 AI569314 AA961665 AI922050 AI759000 AA555236 AA514432 
AA293474 AA001 129 M826789 AA641390 AA134405 F35585 M477416 AW193359 AI361315 AA284988 F36340 AI361322 
F26969 AA991922 AA021267 F26973 AI361314 F35891 AI918509 AA250964 AA190992 AA577139 M865535 AA134324 
AW192842 AI224046 F18975 AA779626 AA856894 AW269997 AW014614 H95554 F31378 AA374868 F26343 AA654007 
AI830942 AA1 13195 F26432 W56652 AA464690 AA055263 AA340654 AA031448 AA976399 AA972526 AA063476 R83921 
T16240 AA533290 N91545 H44053 AA883451 AA513761 AA086477 H09249 F20482 F26737 AA054148 AA857063 AA017259 
M179789 AA088908 H43704 AA194320 F35950 AI880127 F20441 F32878 AA962483 H39094 H56489 H44621 F19390 
AI302232 F25162 AA826965 AA086052 AA917410 AA454513 R59554 AA196755 AA086369 AA079530 H28106 AA243301 
AI025737 AA101239 AA088887 AI214910 AA974886 F16089 F26054 AA515092 F33436 F32829 M78061 AA235645 F19715 
F37529 AI81 1549 AA665180 M708200 F01124 F32382 AA346220 AA627361 F30741 F30010 F28543 AA211715 F20245 
M331222 F25634 F21996 W28215 F21911 R65793A1 192566 H20130 H84491 M719223 M557435 F16967 F26989 F30353 
AA857159 AA291918 F28234 F20840 F25176 F22437 T27904 AA480355 F19528 R87926 H14286 F27532 AW337864 F28411 
H13692 F25651 M975454 F24229 F29657 F18024 AA464779 F17588 F34954 AA947328 F18063 AA657777 AA459644 N91455 
F27850 F29608 F27206 F18418 F31459 F18564 F33496 F16376 F29740 F29843 F29904 F29866 F19135 AW276602 T40337 
F24835 F34672 F26474 AI926215 AA464185 F18217 N26193 AL043256 T41197 F33055 F00386 F29500 F34191 F33297 
AI937207 F22724 F15909 F26232 F1 8889 AA31 8627 F29085 M872104 F17509 F23373 F15660 F17552 F17412 F16863 
F34033 F21515 F17364 F18383 F16546 F17561 F17260 AA292000 F15723 T47438 F16798 F18046 F18319 F17978 F17566 
F34230 F33258 F20860 F17998 AI695701 

118600 genbank_N69222 N69222 

118952 genbank_N92966 N92966 

120873 genbank_AA358015 AA358015 

113702 genbank T97307 T97307 

129982 221_267 Z14221 AW381862M97920AW401444 Z66542 M29470 AW406502 X61 01 1 M34024 AA327072Z14166 Z14167 Z14165 

AW403806 Z14200 AA383972 Z14205 Z14201 M18513 Z14202 AW403684 X14584 AF062221 U43760 X65892 X65883 X62107 
Z80847 X65885 X65893 AF062142 X65891 X17675 Z47274 Z47277 Z47276 X65888 Z47275 X62109 AF062140 L01278 
AF062134 AF062139 X81723 Z80840 X81733 X81743 X81744 X81732 Z80843 AW402942 AW403516 X65919 AF062190 
AF062177 AF062222 AF0621 15 Z47240 AF062263 AF062261 AF062223 AF06221 1 Z47238 AW401714 AW404008 AW404991 
AF062280 M99595 Z47214 Z47232 Z47218 M26995 AF062184 X65895 L38433 X81731 Z1 1946 Z47226 AF062205 AF174012 
L01276 AF062168 AF062136 X81755 X81748 AF174019 AF062285 L06924 L1 1699 AW402665 L09070 L28049 L08090 
AW407843 Z14171 AW402944 L08083 AW405627 L33035 L26907 M17750 L28052 M17751 AJ239360 U19885 L14821 X56526 
AB014341 L12087 L12098 U68231 L12184 AF062242 AF022000 U64499 U00570 AI268604 Y15773 X64239 X62969 U00506 
X73605 X99360 U00577 D83677 AB021539 AF035796 Z33899 U00588 AJ239353 AF174062 Z33901 X98899 AF174058 X63080 
D83676 AB021529 Z18318 U00488 L01412 X81746 U21262 U21272 U00560 AF174060 U00547 U00561 Z18321 S73957 
M26435 AF1 15130 AF115117 Z92896 U21254 Z92895 AF115112 AF062290 T28938 Y09386 AF174067 U27189 Y09384 
U77373 AF174057 M17749 X69692 AF174038 U64478 U64486 AF174098 X99365 AF174077 AF174083 AF174089 AF015133 
AF127792 AJ006171 U00510 U58144 AF004323 AF1 15109 AF004324 AF015130 AF090414 AF090418 AB021536 AB021519 
Z96957 AF021986 Z92898 Z96956 L34164 AF062251 AF052524 U00549 U00541 D83687 AF052522 AF087420 AF052520 
AF02201 1 L43085 U00536 L19915 L43083 AF090420 AB021530 AB021534 Z96955 Z96954 Z33903 X62964 AF103282 U00509 
AF062298 AF062289 AF062307 AW408326 L33037 L04337 L04328 X81742 L04336 L03385 AF032360 S56184 AF062191 
AF191092 AI906954 AF103184 L04343 AW364860 AF001424 AF103163 Z98717 AF103143 AJ008250 L04323 AF103321 
AF103335 L04342 L03818 L03817 AW404978 AW403690 U86801 AF103150 AJ010334 AF035027 AJ007327 AF1031 15 
AF017458 AJ008207 AJ008183 AJ008196 AJ008241 AJ008208 AF103210 AF068668 AF068670 AF068671 AF068664 
AF068669 AF068666 AF068665 AF068672 AW371 244 AW403670 AW408074 AW404575 AW3621 53 AW403803 AW406702 
AW351514 D78345 T29140 J00231 NM_002179 AW405146 AA301091 X04646 H64660 AW402990 AW406534 T93007 
AI857980 AW368899 AI905833 AW406586 M482084 AI872299 AA715266 AW404328 AI831 674 AI709348 M6031 12 
AW514864 M485775 H64492 AW404789 AA487630 AA715498 AA295885 T27613 T98113 

115763 genbank_AA421560 AA421560 

124357 genbank_N22401 N22401 

108733 504187J AA121022 AA126422 

101544 entrez_M31169 M31169 

124447 genbank_N48000 N48000 

124677 genbank_R01073 R01073 

124777 genbank_R41933 R41933 

119302 genbank_T25725 T25725 
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336676 




338008 


w 


io one? 

338057 


9 


329655 




336959 




o2bo99 


la a 


329960 
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OOo4Dl 




OOOCQQ 

OOODOO 




338665 




338689 




oUoof / 




oooooz 




oooyoU 




ooouuy 


40 


0*5*3 AH A 

333010 




OOOUl 0 




oUeyol 








OOO 4 AQ 

333108 


40 


333139 




333254 




333305 




333343 




333388 


50 


326213 




111 JPA 

333456 




'ion J r-/\ 

333459 




333517 


55 


333585 


333679 




326474 




333743 




333758 




333767 


60 


333768 




333769 




333795 




333796 


65 


335004 


333892 




335115 




333904 



entrez_Z93784Z93784 

H58818_at H58818 

genbank_R51818 R51818 

CAT number Accession 

46877J AF085975 H53458 H53459 

139750J AA580288M31 5655 M1 33031 AA377748 

1510563J C16391 C16413 

211994J AW881145 AA490718 M85637M304575T06067AA331991 
233566J AA410943 AW948953 AA334202 AA332882 
1046029_-2 AW170035 

376239J M613792 AW182329 T05304 AW858385 

325188J BE174240 M488528AL042253 

447553_1 BE151746 BE336853 D63271 T94955 AA774994 

174129J AA216387 T63548AA228676 

502195J C05928AW393497 

345248 1 AA534222 M632632 T81234 

4380.7 AW962299 AA310349 AW962294 H63791 H63751 

116028 -2 AA078493 

CH22_3941 FG_834_7_LINK_DJ 

CH22_3992FG_842_3_LINK_DJ 

CH22_3994FG_842_5_LINK_DJ 

c_x_hs 

c_x_hs 

CH22_4154FG_43_4_ 
CH22_6490FG_LINK_EM:AC00 
CH22_6558FG_LINK_EM:AC00 
c14_p2 

CH22_4764FG_367_13_ 

c15_p2 

c16_p2 

CH22_7067FG_LINK_EM:AC00 
CH22_7124FG_LINK_EM:AC00 
CH22_7331 FG_LINK_EM:AC0O 
CH22_7438FG_LINK_EM:AC00 
CH22_7464FG_LINK_EM:AC00 
AI761173 

CH22_7678FG_LINK_DJ246D7 

CH22_7859FG_LINK_DA59H18 

CH22_233FG_61_1_LINK_EM:A 

CH22_234FG_61_2_LINK_EM:A 

CH22_237FG_61_5_LINK_EM:A 

AI873242 

AI880172 

CH22_336FG_79_14_LINK_EM: 
CH22_368FG_83_16_LINK_EM: 
CH22_495FG_1 18_2_LINK_EM 
CH22_550FG_137_2_LINK_EM; 
CH22_589FG_139_12_LINK_EM 
CH22_634FG_144_3_LINK_EM: 
c17_hs 

CH22_706FG_157_5_LINK_EM 
CH22_709FG_157_8_LINK_EM: 
CH22_773FG_1 73_2_LINK_EM: 
CH22_846FG_203_4_LINK_EM: 
CH22_941FG_247_6_LINK_EM: 
c19 hs 

CH22_1009FG_264J_UNK_EM 
CH22_1024FG_268_1_L1NK_EM 
CH22_1034FG_271_6_LINK_EM 
CH22_1 035FG_271_7_LINK_EM 
CH22_1036FG_271_8_L1NK_EM 
CH22_1063FG_275_1_LINK_EM 
CH22_1065FG_275_3_LINK_EM 
CH22_2326FG_472_8_LINK_EM 
CH22_1 163FG_292_14_LINK_E 
CH22_2447FG_496_2_LINK_EM 
CH22_1 176FG_294_2_LINK_EM 
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333905 CH22_1177FG_294_3_LINK_EM 

333921 CH22_1194FG_296_12_LINK_E 

333968 CH22_1245FG_307_4_LINK_EM 

328059 c_6_hs 

5 335287 CH22_2629FG_526_11_LINK_E 

326816 c20_hs 

326817 c20_hs 

335342 CH22_2689FG_536 1_LINK_EM 

335491 CH22_2843FG_570_23_LINK_E 

10 335495 CH22_2847FG_570_28_LINK_E 

335498 CH22_2850FG_571_7_LINK_EM 

328304 c_7_hs 

305453 AA738110 

335544 CH22_2899FG_576_5_LINK_EM 

15 335610 CH22_2969FG_583_4_LINK_EM 

y. 335653 CH22_3013FG_590_4_LINK_EM 

L 335682 CH22_3043FG_595_2_LlNK_EM 

P 335687 CH22_3048FG_596_2_LINK_EM 

Ci 328492 c_7_hs 

="20 335755 CH22_3122FG_604_4_LINK_EM 

335782 CH22_3151FG_609_4_LINK_EM 

W 335791 CH22_3160FG_611_7_LINK_EM 

f1; ; 335809 CH22_3181FG_617j6_LINK_EM 

I "■ 335822 CH22 3195FG 619 7 LINK.EM 

'25 335823 CH22_3196FG_619_8_LINK_EM 

W 335824 CH22_3197FG_619_11_LINK_E 

335825 CH22_3198FG_619_12_LINK_E 

r .-.. 335895 CH22_3272FG_635_3_LINK_EM 

h* ! 335917 CH22_3294FG_636_13_LINK_E 

130 335920 CH22_3297FG_636_16_LINK_E 

??; 305898 AA872838 

'Z 305913 AA876109 

=N 305950 AA884479 

P : l 328857 c_7_hs 

=35 330084 c19_p2 

= = : 337968 CH22_6419FG_LINK_EM:AC00 

309177 AI951118 

309198 A1955915 

309226 AI969897 

40 339352 CH22_8317FG_LINK_BA354I1 

309279 AI990102 

339373 CH22_8348FG_LINK_BA232E1 

325622 c14_hs 

334102 CH22_1380FG_327_60_LINK_E 

45 332927 CH22_148FG_38_1_LINK_C20H 

332929 CH22 150FG_38_3_LINK_C20H 

332930 CH22_151FG_38_4_LINK_C20H 
332955 CH22_179FG_48_12_LINK_EM: 
332958 CH22_182FG_48_15_LINK_EM: 

50 332961 CH22_185FG_48_18_LINK_EM: 

332983 CH22_207FG_54_5_LINK_EMA 

334222 CH22_1506FG_360_3_LINK_EM 

334223 CH22_1507FG_360_4_LINK_EM 
334264 CH22_1551FG_367_15_LINK_E 

55 327110 c21_hs 

334343 CH22_1636FG_375_25_LINK_E 

334360 CH22 1654FG_378_5_LINK_EM 

327196 c_1_hs 

327283 c_1_hs 

60 327313 c_1_hs 

304465 AA421948 

304507 AA456426 

327450 c_2_hs 

304591 M505702 

65 304601 AA507875 

304659 AA533185 

334784 CH22_2096FG_432_9_LINK_EM 
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334789 CH22_2101FG_432_14_LINK_E 

334794 CH22 2106FG 434 2 LINK EM 

336035 CH22_3420FG_678_6_LINK_DJ 

336042 CH22_3427FG_679_4_LINK_DJ 

336093 CH22_3481FG_691_2_LINK_DJ 

336096 CH22_3484FG_691_5_LINK_DJ 

334889 CH22_2206FG_452_3_LINK_EM 

336150 CH22_3540FG_706_6_LINK_DA 

336152 CH22_3543FG_706_9_LINK_DA 

336416 CH22_3833FG_823_38_LINK_B 

336444 CH22_3864FG_827_10_LINK_D 

336449 CH22_3870FG_829_6_UNK_DJ 

336471 CH22_3894FG_829_30_LINK_D 
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TABLE 13B 



Table 13B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 13B. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication 

entitled "The DNA sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

Ntjposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 




Strand 


Nt_position 


332955 


Dunham, 


. etal. 


Plus 


2508896-2508992 


332958 


Dunham, 


. et.al. 


Plus 


2516164-2516310 


332961 


Dunham, 


. etal. 


Plus 


2521424-2521555 


333139 


Dunham, 


. etal. 


Plus 


3369495-3369571 


333254 


Dunham, 


. etal. 


Plus 


2521424-2521555 


333305 


Dunham, 


. etal. 


Plus 


4630388-4630645 


333388 


Dunham, 


. etal. 


Plus 


49137494913805 


333517 


Dunham, 


. etal. 


Plus 


5570729-5570925 


333585 


Dunham, 


. etal. 


Plus 


6234778-6234894 


333679 


Dunham, 


. etal. 


Plus 


7068795-7068896 


333767 


Dunham, 


. etal. 


Plus 


7694407-7694623 


333768 


Dunham, 


. etal. 


Plus 


7695440-7695697 


333769 


Dunham, 


. etal. 


Plus 


7696625-7696707 


333795 


Dunham, 


. etal. 


Plus 


7807688-7807795 


333796 


Dunham, 


. etal. 


Plus 


7808253-7808319 


333892 


Dunham, 


. etal. 


Plus 


8156825-8157001 


333921 


Dunham, 


. etal. 


Plus 


8380325-8380441 


333968 


Dunham, 


. etal. 


Plus 


8681004-8681241 


334102 


Dunham, 


. etal. 


Plus 


9995140-9996373 


334264 


Dunham, 


. etal. 


Plus 


13234447-13234544 


334343 


Dunham, 


. etal. 


Plus 


13655828-13656307 


334794 


Dunham, 


. etal. 


Plus 


16374312-16374458 


334889 


Dunham, 


. etal. 


Plus 


19286024-19286515 


335287 


Dunham, 


. etal. 


Plus 


22299047-22299299 


335491 


Dunham, 


. etal. 


Plus 


24128651-24128827 


335495 


Dunham, 


. etal. 


Plus 


24140688-24140872 


335498 


Dunham, 
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TABLE 14: Table 2 from BRCA 001-5 US 

Table 14, a subset of table 13, depicts a preferred group of genes highly upregulated in breast 
cancer cells. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

1 0 UnkjenelD: Unkjene number 

Unkjene Title: Unkjene gene title 

R1: Ratio of tumor to normal breast tissue 
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18.3 
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heat shock 70 kD protein 4 


17.4 
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Hs. 156346 


Homo sapiens (cell line HL-6) alpha topois 
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M13755 


Hs.833 


interferon-stimulated protein; 15 kDa 
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M86849 
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Homo sapiens connexin 26 (GJB2) mRNA 


22.5 
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extracellular matrix protein 1 
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Hs.26102 
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16.6 
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19.3 
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19.5 
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Hs.42645 


ESTs 


18.1 
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20.6 


125852 
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Hs.76550 


Homo sapiens mRNA; cDNA DKFZp564B 


25.9 
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Hs.265398 


ESTs; Weakly similar to transformation-rel 


16.4 
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91 ? 




318744 


AI793124 
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DiGeorge syndrome critical region gene 2 


ID.O 
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324432 


AA464510 
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ESTs 
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55 


330388 
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HER2 receptor tyrosine kinase (c-erbB-2; E 


17.7 
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interferon-stimulated protein; 15 kDa 
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330814 
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Hs.265398 
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dachshund (Drosophila) homolog 


15.1 
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Hs.1892 


EST; Highly similar to PHENYLETHANO 


15.3 
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17.8 


65 


333769 






CH22 FGENES.271 8 


48.3 


333968 






CH22 FGENES.307 4 


15.9 
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18.5 
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335791 CH22_FGENES.611_7 27.3 

336512 CH22_FGENES.834_7 21.4 

338008 CH22_EM:AC005500.GENSCAN.127-9 15.2 
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TABLE 14A 

Table 14A shows the accession numbers for those pkeys lacking unigenelD's for Table 14. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and rnRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number: Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accession 

309583 1046029_-2 AW170035 

336512 CH22_3941FG_834_7_LINK_DJ 

338008 CH22_6490FG_LINK_EM:AC00 

333769 CH22_1036FG_271_8_UNK_EM 

333968 CH22_1245FG_307_4_LINK_EM 

335791 CH22_3160FG_611_7_LINK_EM 

309177 AI951118 

332958 CH22_182FG_48_15_LINK_EM: 

334223 CH22_1507FG_360_4_LINK_EM 

334264 CH22_1551FG_367_15_LINK_E 

123619 371681 1 AA602964 AA609200 
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TABLE 14B 



Table 14B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 14. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication 

entitled "The DNA sequence of human chromosome 22." Dunham I. et al., Nature (1 999) 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 

Nt_position: Indicates nucleotide positions of predicted exons. 



Pkey Ref 

332958 Dunham, I. et.al. 

333769 Dunham, I. etal. 

333968 Dunham, I. etal. 

334264 Dunham, I. etal. 

338008 Dunham, I. etal. 

334223 Dunham, I. etal. 

335791 Dunham, I. etal. 

336512 Dunham, I. etal. 



Strand Nt_position 

Plus 2516164-2516310 

Pius 7696625-7696707 

Plus 8681004-8681241 

Plus 13234447-13234544 

Plus 7697068-7697236 

Minus 12734365-12734269 

Minus 25948563-25948411 

Minus 34278373-34278275 
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TABLE 15: Table 3 from BRCA 001-5 US 



Table 15 shows genes downregulated in breast cancer cells. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnfcjenelD: Unkjene number 

Unkjene Title: Unkjene gene title 

R1: Ratio of normal breast tissue to tumor 



Pkey ExAccn UniGene IDUnigene Title R1 

100115 D00632 Hs.172153 glutathione peroxidase 3 (plasma) 1.7 

100499 TIGR:HT1428 Hs.283108 Globin, Beta 1.5 

100502 TIGR:HT1496 Hs.169228 Adrenal-Specific Protein Pg2 2.3 

100815 TIGR:HT4268 Hs.9739 L-Glycerol-3-Phosphate:Nad+Oxidoreduct 1.7 

101125 L10373 Hs.82749 transmembrane 4 superfamily member 2 1.5 

101367 M1 2963 Hs.4 alcohol dehydrogenase 1 (class I); alpha po 2.9 

101397 M15856 Hs.180878 lipoprotein lipase 1.6 

101883 M98399 Hs.75613 CD36 antigen (collagen type I receptor; thr 1.6 

102227 U25138 Hs.93841 potassium large conductance calcium-activ 1.6 

102857 X00129 Hs.76461 retinol-binding protein 4; interstitial 3 

103211 X73079 Hs.288579 polymeric immunoglobulin receptor 1.8 

103496 Y09267 Hs.132821 flavin containing monooxygenase 2 1.5 

103562 Z21966 Hs.2815 POU domain; class 6; transcription factor 1 1.8 

104672 AA007629 glycerol-3-phosphate dehydrogenase 1 (sol 2.4 

105083 AA146619 Hs.18791 ESTs; Weakly similar to CALCIUM-BIND 1.7 

105138 AA164519 Hs.15248 ESTs 1.5 

106075 AA417915 Hs.25930 ESTs 1.5 

106870 AA487576 Hs.26530 serum deprivation response (phosphatidyls 1.6 

107099 AA609645 Hs.211568 eukaryotic translation initiation factor4 gam 2.7 

107616 AA004901 Hs.261164 ESTs 1.6 

107997 AA037388 Hs.82223 Human DNA sequence from clone 141H5o 1.7 

108604 AA099820 Hs.49696 ESTs 2.4 

111130 N64265 Hs.19515 yz44h12.s1 Morton Fetal Cochlea Homo sa 1.7 

111837 R36447 Hs.24453 ESTs 1.6 

112538 R70255 ESTs 1.9 

112808 R97970 Hs.281022 EST 1.5 

113086 T40652 Hs.209100 DKFZP434C171 protein 1.9 

115740 AA418033 Hs.283559 ESTs 1.6 

115949 AA443800 Hs.43125 ESTs 2 

115965 AA446661 Hs.173233 ESTs 2.2 

117224 N20300 Hs.218707 ESTs 1.7 

117513 N32174 Hs.44317 SRY (sex-determining region Y)-box 10 1.7 

119059 R15436 Hs.77889 Friedreich ataxia region gene X123 1.7 

119175 R71792 Hs.301002 ESTs; Weakly similar to cell death activate 2.8 

119359 T71021 Hs.285681 ESTs; Highly similar to WS basic-helix-loo 1.9 

119798 W73386 Hs.249129 ESTs 3 

120889 AA365784 Hs.97044 ESTs 1.6 

121381 AA405747 Hs.97984 ESTs; Weakly similar to WASP-family pro 1.8 

121750 AA421184 Hs.97549 ESTs 1.5 

122127 AA434447 Hs.106771 ESTs 2.5 

122348 AA443695 Hs.293410 ESTs 2.1 

122485 AA448300 Hs.160318 phospholemman 1.5 

123443 AA598841 Hs.167382 natriuretic peptide receptor A/guanylatecy 1.8 

123505 AA600135 ESTs; Moderately similarto !!!! ALU SUB 1.5 

125284 W94688 Hs.103253 perilipin 1.7 

126300 D81972 HUM427D08B Human fetal brain (TFujiw 1.8 

126747 R72515 Hs.160318 phospholemman 1.6 

127218 AA309765 Hs.116017 ESTs; Weakly similar to KIAA0795 protei 1.5 

127357 AA452788 Hs.75432 zx39g11.M Soares_total_fetus_Nb2HF8_9 1.7 
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ESTs 1.5 

ESTs; Weakly similar to !!!! ALU SUBFA 1.5 

ESTs 1.5 

ESTs 1.6 

eukaryotic translation elongation factor 2 1 .7 

DKFZP586P1 422 protein 1.5 

ESTs 2.1 

ESTs; Highly similar to CGI-38 protein [H 1 .5 

insulin-like growth factor binding protein 6 1 .7 

hemoglobin; beta 1.7 

myomesin 1 (skelemin) (185kD) 3.8 

ESTs 1.9 

alcohol dehydrogenase 3 (class I); gamma p 2.2 

aquaporin7 1.7 

leptin (murine obesity homolog) 2.5 

ESTs; Weakly similar to Homo sapiens p2 1 .6 

deleted in bladder cancer chromosome regi 1.5 

tetranectin (plasminogen-binding protein) 2 

gamma-aminobutyric acid (GABA) A recep 1 .5 

integrin; alpha 7 1.7 

transferrin 2.3 

glutathione S-transferase M5 1 .9 

TU3A protein 4.6 

deoxyribonuclease l-like 3 1 .5 

carbonic anhydrase IV 1 .6 

Human G0S2 protein gene; complete cds 1 .9 

ESTs 1.7 

ESTs 1.5 

ESTs 1.8 

ESTs 1.6 

ESTs 2.1 

hemoglobin; alpha 1 1.8 

hemoglobin; delta 1.6 

EST cluster (not in UniGene) with exon hit 1 .7 

glycogenin2 1.5 

EST singleton (notin UniGene) with exon 1.5 

EST singleton (not in UniGene) with exon 1.5 

EST singleton (not in UniGene) with exon 1 .7 

EST singleton (not in UniGene) with exon 1 .5 

EST singleton (not in UniGene) with exon 1 .5 

EST singleton (not in UniGene) with exon 1 .6 

EST singleton (not in UniGene) with exon 1 .5 

EST singleton (not in UniGene) with exon 1 .9 

retinoid X receptor, beta 1 .5 

EST 1.5 

ESTs; Moderately similar to alternatively s 1 .8 

ESTs 1.6 

ESTs 2.1 

ESTs 1.9 

ESTs 2.3 

ESTs 1.8 

ESTs 2.2 

ESTs 1.9 

ESTs 1.7 

ESTs 1.8 

ESTs 1.5 

ESTs 1.6 

ESTs 1.7 

ESTs 1.5 

ESTs 1.5 

ESTs 1.6 

ESTs 1.5 

ESTs 1.7 

EST cluster (not in UniGene) 1.5 

ESTs 1.7 

EST cluster (not in UniGene) 1 .5 

ESTs 2.2 
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TABLE 15A 



Table 15A shows the accession numbers for those pkeys lacking unigenelD's for Table 15. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accession 

126300 250375_2 D81972 BE003132 
112538 504579J AA908813 R70255 
123505 genbank_AA600135 AA600135 

104672 6735_7 AA349096 AI368018 F21390 F17759 R48772 AI421485 AI300352 H43971 AI378525 F33652 R47898 AI264177 F22289 N28263 
AI276281 R48205 AI245302 AI190036 AI281050 AW242903 H42892 AA910870 AW473816 H25721 AW451438 F19647 F22375 
H45809 F33447 AA774528 AA007629 H42537 C01077 F32386 

322102 46708J H45589H19807AF075038H19808H42437 

336865 CH22_4590FG_305_1_ 

338192 CH22_6755FG_LINK_EM:AC00 

329733 c14_p2 

326120 c17_hs 

326139 c17_hs 

326855 c20_hs 

335352 CH22_2699FG_539_5_LINK_EM 

335639 CH22_2999FG_584_19_LINK_E 

307206 Al 192534 

307377 AI222691 

337494 CH22_5727FG_799_12_ 

337764 CH22_6115FG_LINK_EM:AC00 

337983 CH22_6438FG_LINK_EM:AC00 

339366 CH22_8336FG_LINK_BA354I1 

325272 c11_hs 

325558 c12_hs 

325656 c14_hs 

334175 CH22_1455FG_349_10_LINK_E 
304182 H91086 

334347 CH22_1640FG_375_31_LINK_E 
327438 c_2_hs 
304622 AA516384 

334737 CH22_2049FG_424_12_LINK_E 
304682 AA550994 

336244 CH22_3642FG_746_2_LINK_DA 
306193 AA923457 

336336 CH22_3746FG_814_8_LINK_BA 
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TABLE 15B 



Table 15B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 15. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication 

entitled "The DNA sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


334347 


Dunham, I. etal. 


Plus 


13663814-13663926 


334737 


Dunham, I. etal. 


Plus 


15998517-15998685 


335639 


Dunham, I. etal. 


Plus 


25173591-25173696 


337494 


Dunham, I. etal. 


Plus 


33339024-33339148 


334175 


Dunham, I. etal. 


Minus 


11668659-11668597 


335352 


Dunham, I. etal. 


Minus 


22681512-22681384 


336244 


Dunham, I. etal. 


Minus 


31402729-31402583 


336336 


Dunham, I. et.al. 


Minus 


33797209-33797076 


336865 


Dunham, I. etal. 


Minus 


8622405-8622289 


337764 


Dunham, I. etal. 


Minus 


40356404035446 


337983 


Dunham, I. etal. 


Minus 


7275495-7275271 


338192 


Dunham, I. et.al. 


Minus 


13248453-13248277 


339366 


Dunham, I. etal. 


Minus 


33647431-33647293 


325272 


5866902 


Minus 


13247-13312 


325558 


6056302 


Plus 


70930-71030 


325656 


6056305 


Minus 


78190-78707 


329733 


6065783 


Plus 


163237-163450 


326120 


5867194 


Plus 


36116-36276 


326139 


5867203 


Minus 


218901-218960 


326855 


6552460 


Minus 


111390-111463 


327438 


6004454 


Minus 


199569-199692 
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TABLE 16: Table 4 from BRCA 001-5 US 



Table 16, a subset of table 15, depicts a preferred group of genes highly downregulated in 
breast cancer cells. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnkjenelD: Unkjene number 

Unkjene Title: - Unkjene gene title 

R1 : Ratio of normal breast tissue to tumor 



Pkey 


ExAccn 


UniGene ID 


Unigene Title 


R1 


100502 


TIGR:HT1496 


Hs.169228 


Adrenal-Specific Protein Pg2 


2.3 


101367 


M12963 


Hs.4 


alcohol dehydrogenase 1 (class I); alpha 


2.9 


102857 


X00129 


Hs.76461 


retinol-binding protein 4; interstitial 


3 


104672 


M007629 




glycerol-3-phosphate dehydrogenase 1 


2.4 


107099 


AA609645 


Hs.211568 


eukaryotic translation initiation factor 4 gam 


2.7 


108604 


M099820 


Hs.49696 


ESTs 


2.4 


115949 


M443800 


Hs.43125 


ESTs 


2 


115965 


AA446661 


Hs.173233 


ESTs 


2.2 


119175 


R71792 


Hs.301002 


ESTs; Weakly similar to cell death activator 


2.8 


119798 


W73386 


Hs.249129 


ESTs 


3 


122127 


AA434447 


Hs.1 06771 


ESTs 


2.5 


122348 


AA443695 


Hs.293410 


ESTs 


2.1 


129285 


T62068 


Hs.11006 


ESTs 


2.1 


131267 


AA211776 


Hs.2504 


myomesin 1 (skelemin)(185kD) 


3.8 


131282 


M12272 


Hs.4 


alcohol dehydrogenase 3 (class I); gamma 


2.2 


131810 


D49487 


Hs. 194236 


leptjn (murine obesity homolog) 


2.5 


133120 


X64559 


Hs.65424 


tetranectin (plasminogen-binding protein) 


2 


133601 


S95936 


Hs.284176 


transferrin 


2.3 


134111 


N79674 


Hs.8022 


TU3A protein 


4.6 


301396 


AA923549 


Hs.224121 


ESTs 


2.1 


311794 


AW238092 


Hs.254759 


ESTs 


2.1 


312575 


H25237 


Hs.306814 


ESTs 


2.3 


313283 


W32480 


Hs.157099 


ESTs 


2.2 


322814 


AI824495 


Hs.211038 


ESTs 


2.2 


322929 


AI365585 


Hs.146246 


ESTs 


2.3 


324675 


AW014734 


Hs.157969 


ESTs 


2.2 


330931 


F01443 


Hs.284256 


ESTs 


4.6 


332364 


W94688 


Hs.103253 


perilipin 


2.1 


337983 






CH22_EM:AC005500.GENSCAN.1 10-1 


2 
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TABLE 16A 

Table 16A shows the accession numbers for those pkeys lacking unigenelD's for Table 16. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number: Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT number Accession 

104672 6735_7 AA349096 AI368018 F21390 F17759 R48772 AI421485 AI300352 H43971 AI378525 F33652 R47898 AI264177 F22289 N28263 
AI276281 R48205 AI245302 AI190036 AI281050 AW242903 H42892 AA910870 AW473816 H25721 AW451438 F19647 F22375 
H45809 F33447 M774528 AA007629 H42537 C01077 F32386 



263 



Attorney Docket No.:018501-005210US 



TABLE 17: Table 1 from BRCA 014 P 



Table 17 shows accession numbers representing 759 sequences of breast cancer genes or 
fragments thereof encoding breast cancer modulating proteins. For each overexpressed gene 
identified, a ratio of the relative amount of expression in breast tumors versus normal breast 
tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnfcjenelD: Unkjene number 

Unfcjene Title: Unkjene gene title 

R1 : Ratio of tumor to normal breast tissue 



Pkey ExAccn UnigenelD Unigene Title R1 

100227 AV654694 Hs.82316 interferon-induced, hepatitis C-associat 3 

100405 AW291587 Hs.82733 nidogen2 3.2 

100406 AI962060 Hs.1 18397 AE-binding protein 1 3.6 
100420 086983 Hs.1 18893 Melanoma associated gene 3.2 
100911 X83300 Hs.289103 SMA4 5.2 
100960 J00124 Hs.1 17729 keratin 14 (epidermolysis bullosa simple 4.3 
101011 BE387036 Hs.1211 acid phosphatase 5, tartrate resistant 3 
101183 AA442324 Hs.795 H2A histone family, member O 3.2 
101194 L20971 Hs.188 phosphodiesterase 4B, cAMP-speciftc (dun 3 
101329 U66042 Hs.82171 Homo sapiens clone 191B7 placenta expres 4.1 
101378 BE563085 Hs.833 interferon-stimulated protein, 15 kDa 5.3 
101474 R07566 Hs.73817 small inducible cytokine A3 (homologous 3.9 
101491 M25809 Hs.64173 ATPase, H+ transporting, lysosomal (vacu 4.5 
101530 M29874 Hs.1360 cytochrome P450, subfamily IIB (phenobar 9 
101602 AA353776 Hs.901 CD48 antigen (B-cell membrane protein) 3.4 
101663 NM_003528 Hs.2178 H2B histone family, member Q 5.6 
101758 BE019494 Hs.79217 pyrroline-5-carboxylate reductase 1 3.6 
101767 M81057 Hs.180884 carboxypeptidase B1 (tissue) 12 
101817 M89907 Hs.152292 SWI/SNF related, matrix associated, acti 3.2 
101851 BE260964 Hs.82045 midkine (neurite growth-promoting factor 4.1 
101878 M97815 Hs.183650 cellular retinoic acid-binding protein 2 6.5 
102209 NM_002038 Hs.265827 interferon, alpha-inducible protein (do 3 
102214 - U23752 Hs.32964 SRY (sex determining region Y)-box 1 1 3 
102297 NMJ301504 Hs.198252 G protein-coupled receptor 9 3.7 
102299 NM_005824 Hs.155545 37 kDa leucine-rich repeat (LRR) protein 3.7 
102301 NM_005651 Hs.183671 tryptophan 2,3-dioxygenase 5.2 
102305 AL043202 Hs.90073 chromosome segregation 1 (yeast homolog) 3.5 
102369 U39840 Hs.299867 hepatocyte nuclear factor 3, alpha 3.9 
102591 U62325 Hs.324125 amyloid beta (A4) precursor protein-bind 4 
102721 H16646 Hs.118666 hypothetical protein PP591 3.5 
102739 AA363025 Hs.1 55572 Human clone 23801 mRNA sequence 3.2 
102791 AF080229 gb:Human endogenous retrovirus K clone 1 3 
102804 NM_002318 Hs.83354 lysyl oxidase-like 2 3.2 
102903 M73779 Hs.250505 retinoic acid receptor, alpha 3.3 
103010 X52509 Hs.161640 tyrosine aminotransferase 12.4 
103042 T81656 Hs.252259 ribosomal protein S3 4.5 
103117 X63578 Hs.295449 paralbumin 3 
103207 X72790 gb:Human endogenous retrovirus mRNA for 5.9 
103282 BE390551 Hs.77628 steroidogenic acute regulatory protein r 3.9 
103284 AI751601 Hs.8375 TNF receptor-associated factor 4 3.3 
103329 X85134 Hs.72984 retinoblastoma-binding protein 5 3.1 
103364 X90872 Hs.279929 gp25L2 protein 3 
103385 NM_007069 Hs.37189 similartoratHREV107 3.4 
103456 AA496425 Hs.9629 papillary renal cell carcinoma (transloc 3.2 
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103498 Y09306 Hs.30148 homeodomain-interacting protein kinase 3 3.4 

103558 BE616547 Hs.2785 keratin 17 3.7 

103563 L02911 Hs. 1 50402 Activin A receptor, type I (ACVR1)(ALK 3.2 

103612 BE336654 Hs.70937 H3 histone family, member A 4.5 

103825 AI571835 Hs.55468 ESTs 4 

104073 AW779318 Hs.88417 ESTs 3.8 

104103 AW021102 Hs.21509 ESTs 4.3 

104115 AF183810 Hs.26102 opposite strand to trichorhinophalangeal 7.6 

104168 AA461618 Hs.31704 ESTs, Weakly similar to KIAA0227 [H.sapi 3.6 

104173 M084273 Hs.76561 ESTs, Weakly similar to S47072 finger pr 4 

104181 AF173296 Hs.283740 DC6 protein 3 

104189 AB040927 Hs.301804 KIM1494 protein 3.2 

104269 AI559444 Hs.293960 ESTs 4.3 

104307 AI929700 Hs.1 11680 endosulfine alpha 3.1 

104518 H20816 Hs.1 12423 Homo sapiens mRNA; cDNA DKFZp586l1420 (f 3.2 

104556 AV650851 Hs.96900 hypothetical protein; KIAA1 830 protein 4.4 

104658 AA360954 Hs.27268 Homo sapiens cDNA: FU21933 fis, clone H 3.2 

104748 AA015879 Hs.33536 ESTs 3.2 

104755 T49951 Hs.9029 DKFZP434G032 protein 4.5 

104825 AA035613 Hs.141883 ESTs 6.9 

104830 AW294092 Hs.21594 hypothetical protein MGC1 5754 11.1 

104865 T79340 Hs.22575 B-cell Cli/lymphoma 6, member B (zinc fi 3.5 

104906 BE298684 Hs.26802 protein kinase domains containing protei 6.5 

104961 H78517 Hs.33905 ESTs 3.6 

105038 AW503733 Hs.9414 KIAA1488 protein 4.5 

105088 H58589 Hs.35156 Homo sapiens CDNAFLJ11027 fis, clone PL 3.8 

105092 AA148982 Hs.29068 ESTs 3 

105093 AL1 37566 Hs.32405 Homo sapiens mRNA; cDNA DKFZp586G0321 (f 4.8 
105304 AW134924 Hs.190325 ESTs 8.2 
105397 AA814807 Hs.7395 hypothetical protein FLJ23182 3.1 
105409 AW505076 Hs.301855 DiGeorge syndrome critical region gene 8 4.2 
105431 AA252033 Hs.242413 hypothetical protein DKFZp434K1 421 4.4 
105552 AA256750 Hs.28802 centaurin-alpha 2 protein 3.2 
105598 AA279439 Hs.279763 hypothetical protein FLJ 10504 3.5 
105650 W16741 Hs.25635 HSPC003 protein 3.7 
105688 AI299139 Hs.17517 ESTs 5.5 

105808 AI133161 Hs.286131 CGI-101 protein 3.5 

105809 AW973653 Hs.20104 hypothetical protein FLJ00052 3.3 
105909 AA195191 Hs.5111 hypothetical protein FLJ20729 3.2 
105965 AA131657 Hs.23830 ESTs 3.3 
106135 AL117474 Hs.41181 Homo sapiens mRNA; cDNA DKFZp727C191 (fr 3.2 
106184 W28948 Hs.10762 ESTs 3.3 
106293 N39842 Hs.301444 KIAA1673 4.1 
106400 BE397649 Hs.94109 Homo sapiens cDNA FLJ13634 fis, clone PL 3.1 
106474 BE383668 Hs.42484 hypothetical protein FLJ1 061 8 3.2 
106484 AA351978 Hs.4943 hepatocellular carcinoma associated prot 7.8 
106533 AL134708 Hs.145998 ESTs 3 
106614 AA648459 Hs.335951 hypothetical protein AF301222 3.8 
106636 AW958037 Hs.286 ribosomal protein L4 3.3 
106661 AW499914 Hs.7579 hypothetical protein FLJ10402 3 
106743 BE613328 Hs.21938 hypothetical protein FLJ12492 4.2 
106844 AA485055 Hs.158213 sperm associated antigen 6 3.4 

106864 AI311928 gb:qo89h04.x1 NCI_CGAP_Kid5 Homo sapiens 4.4 

106865 AW192535 Hs.19479 ESTs 3.6 
106871 AW472981 Hs.321130 hypothetical protein MGC2771 4.1 
106942 AA995351 Hs.31314 retinoblastoma-binding protein 7 3.6 
106968 AF216751 Hs.26813 CDA14 5.3 
107105 AW963419 Hs.155223 stanniocalcin 2 3.4 
107158 N32849 Hs.31844 hypothetical protein FLJ12586 3.1 
107248 AW263124 Hs.315111 nuclear receptor co-repressor/HDAC3 comp 5.9 
107265 BE379594 Hs.49136 ESTs, Moderately similar to ALU7_HUMAN A 3.9 
107630 AW961576 Hs.60178 ESTs 4.6 
107710 AI955040 Hs.265398 ESTs, Weakly similar to transformation-r 3 
107890 AA025386 Hs.61311 ESTs, Weakly similar to S1 0590 cysteine 3.1 
107985 T40064 Hs.71968 Homo sapiens mRNA; cDNA DKFZp564F053 (fr 4.8 
108000 AI263307 Hs.239884 H2B histone family, member L 3.3 
108217 AA058686 Hs.62588 ESTs 3.8 
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113117 


AW813731 


Hs. 159153 ESTs, Moderately similar to S65657 alpha 


3.4 


55 
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BE613410 


Hs.31575 SEC63, endoplasmic reticulum translocon 


3.2 
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Hs.10263 ESTs 


3.5 
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BE262470 


Hs.241471 RNB6 


6.2 
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Hs.269165 ESTs, Weakly similar to ALU1_HUMAN ALU S 


3.7 
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U54727 


Hs.191445 ESTs 
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60 


113494 


T91451 


Hs.86538 ESTs 


3.4 


113518 


AW367788 


Hs.323954 postmeiotic segregation increased 2-like 


3.1 




113571 


AI702609 


Hs.15713 hypothetical protein MGC2776 


3.1 




113822 


NM 004585 


Hs.1 7466 retinoic acid receptor responder (tazaro 


3.9 




113835 


AI912410 


Hs.27475 Homo sapiens cDNA FLJ12749 fis, clone NT 


3 


65 


113938 


W81598 


gb:zd88g02.s1 Soares JetaiJiearLNbHH 1 9W 


4.6 


113947 


W84768 


gb:zh53d03.s1 Soares_fetal_liver_spleen_ 


3.1 




113970 


W27249 


Hs.8109 hypothetical protein FLJ21080 


6.9 




114086 


AA378776 


Hs.288649 hypothetical protein MGC3077 


4.3 
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114148 AW470411 Hs.288433 neurotrimin 4.1 

114424 AW780192 Hs.267596 ESTs 3.4 

114518 AW1 63267 Hs.1 06469 suppressor of varl (S.cerevisiae) 3-like 3.1 

114563 AI979168 Hs.82226 glycoprotein (transmembrane) nmb 4.8 

5 114965 AI733881 Hs.72472 BMP-R1B 10.1 

114995 AA769266 Hs. 193657 ESTs 3.6 

115121 AI634549 Hs.88155 ESTs 3.2 

115134 AW968073 Hs.194331 ESTs, Highly similar to A5571 3 inositol 4.2 

115167 AA749209 Hs.43728 hypothetical protein 3 

10 115253 BE149845 Hs.289038 hypothetical protein MGC4126 3.6 

115277 AA814100 Hs.86693 ESTs 3.9 

115327 N46436 Hs.109221 ESTs 3.4 

115354 AA281636 Hs.334827 ESTs 4.8 

115657 AA405620 Hs.55158 ESTs, Weakly similar to T29520 hypotheti 3.5 

15 115676 AA953006 Hs.88143 ESTs 9.3 

115709 AW293849 Hs.58279 ESTs, Weakly similar to ALU7 HUMAN ALUS 3.4 

115729 AA417812 Hs.38775 ESTs 4 

H : 115787 AI126772 Hs.40479 ESTs 3.1 

Q 115830 AW970529 Hs.86434 hypothetical protein FLJ21 81 6 3.6 

=20 115835 AA521410 Hs.41371 ESTs 3.1 

115850 NMJ514937 Hs.52463 KIAA0966 protein 3 

if! 115900 AK001500 Hs.165186 hypothetical protein FLJ 13852 3.2 

ff\ 115935 AA354549 Hs.41181 Homo sapiens mRNA; cDNA DKFZp727C191 (fr 3 

g.'. 115948 AL042465 Hs.43445 poly(A)-specific ribonuclease (deadenyla 3.1 

3^25 116092 AB041035 Hs.93847 NM_016931:Homo sapiens NADPH oxidase 4 ( 6.7 

=U 116115 AL042355 Hs.70202 WD repeat domain 10 3.6 

f*-. 116184 AW450737 Hs.128791 CGI-09 protein 3.1 

116192 AA464976 Hs.62528 ESTs, Moderately similar to A46010 X-lin 3.3 

e 116208 AI219083 Hs.42532 ESTs, Moderately similar to ALU8_HUMAN A 3.2 

f=S0 116246 AF265555 Hs.250646 baculoviral IAP repeat-containing 6 3.6 

" 116443 AW962196 Hs.321264 LBP protein 32 4.1 

fT 116470 AI272141 Hs.83484 SRY (sex determining region Y)-box 4 4.1 

fLi 116726 AK001114 Hs.53913 hypothetical protein FLJ10252 8.6 

F- 116845 AA649530 gb:ns44f05.s1 NCI_CGAP_Alv1 Homo sapiens 3.2 

235 117026 H88256 Hs.50456 ESTs, Moderately similar to ZN75_HUMAN Z 3.5 

W 117216 AI569804 Hs.42792 ESTs, Weakly similar to I78885 serine/th 3.1 

fU 117296 AL133427 Hs.42506 Homo sapiens mRNA full length insert cDN 3.2 

117403 H84455 Hs.40639 ESTs 4.7 

117691 AB040959 Hs.93836 DKFZP434N014 protein 3 

40 118229 AW968941 Hs.166254 hypothetical protein DKFZp566l133 3.3 

118363 AI183838 Hs.48938 hypothetical protein FLJ21802 4.3 

118416 N66028 Hs.49105 FKBP-associated protein 3.1 

118470 AW970584 Hs.291033 ESTs 3.4 

118502 AL157488 Hs.50150 Homo sapiens mRNA; cDNA DKFZp564B182 (fr 5.2 

45 118695 AK000465 Hs.50081 KIAA1 199 protein 3.4 

118925 N92293 Hs.206832 ESTs, Moderately similar to ALU8JHUMAN A 3.3 

119025 BE003760 Hs.55209 Homo sapiens mRNA; cDNA DKFZp434K0514 (f 19.7 

119036 R95872 Hs.117572 chemokine binding protein 2 3.7 

119063 R16833 Hs.53106 ESTs, Moderately similar to ALU1.HUMAN A 4.1 

50 119075 M10905 Hs.287820 fibronectin 1 3.2 

119620 W47620 Hs.56009 ^'-oligoadenylate synthetase 3 (100 k 3.3 

119741 AF041853 Hs.43670 kinesin family member 3A 3.1 

119747 AI970797 Hs.64859 ESTs 5 

119754 AL037824 Hs.194695 ras homolog gene family, member I 3.8 

55 119905 AW449064 Hs.1 19571 collagen, type III, alpha 1 (Ehlers-Danl 3.1 

120084 W94472 Hs.59529 ESTs, Moderately similar to ALU 1.HUMAN A 8.4 

120241 AA825686 Hs.321176 ESTs, Weakly similar to S65824 reverse t 3.6 

120326 AA196300 Hs.21145 hypothetical protein RG083M05.2 3.2 

120742 AA225084 gb:nc21d06.r1 NCI_CGAP_Pr1 Homo sapiens 3.6 

60 120870 AA357172 Hs.292581 ESTs, Moderately similar to ALU1_HUMAN A 5.8 

120885 AA365515 Hs.301872 hypothetical protein MGC4840 3 

120970 AA398118 Hs.97579 ESTs, Weakly similar to A46010 X-linked 3.7 

121054 AW976570 Hs.97387 ESTs 5.3 

121095 AA320134 Hs.196029 Homo sapiens mRNA for KIAA1 657 protein, 4 

65 121103 AA398936 Hs.97697 EST 3.5 

121121 AA399371 Hs.189095 similar to SALL1 (sal (Drosophila)-like 6.3 

121337 AW885727 Hs.301570 ESTs 4.7 
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Hs.82772 collagen, type XI, alpha 1 
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Hs.99598 hypothetical protein MGC5338 
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Hs.16601 1 catenin (cadherin-associated protein), d 
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Hs.188691 Homo sapiens cDNA FLJ12033 fis, clone HE 


3.8 
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4.1 
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Hs.139315 Homo sapiens cDNA: FLJ21479 fis, clone C 


3.6 




124567 


AW451645 


Hs.151504 Homo sapiens cDNA FLJ11973 fis, clone HE 


3.1 




124911 


N34151 


Hs. 1 74 1 95 interferon induced transmembrane protein 


3.5 
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Hs.101774 hypothetical protein FLJ23045 


4.3 
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BE065136 


Hs.145696 splicing factor (CC1. 3) 
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Hs.269432 ESTs, Moderately similar to ALU1_HUMAN A 


8.1 
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Hs.288684 Homo sapiens cDNA FLJ1 1750 fis, clone HE 


4.7 
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Hs.1 05413 ESTs 


3.1 
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Hs.187561 ESTs, Moderately similar to ALU1 JHUMAN A 


3.3 
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125304 


AL359573 


Hs.124940 GTP-binding protein 


3 


=|35 


125330 


AW880562 


Hs.1 14574 ESTs 


3 


jae. 


125331 


AI422996 


Hs.1 61 378 ESTs 


3.2 


fl 1 


125685 


AI924630 


Hs.4943 hepatocellular carcinoma associated prot 


3.2 




126257 


N99638 


gb:za39g1 1,r1 Soares fetal liver spleen 


4 


40 


126474 


AW975814 


Hs.326714 Homo sapiens clone IMAGE:713177, mRNA se 


4 


126666 


AA648886 


Hs.151999 ESTs 


3.8 




126872 


AW450979 


gb:UI-H-BI3-ala-a-12-0-Ul.s1 NCI_CGAP_Su 


3 




127431 


AW771958 


Hs. 1 75437 ESTs, Moderately similar to PC4259 ferri 


3.6 




127980 


AA961459 


Hs.1 25644 ESTs 


4.1 


45 


127997 


AW068311 


Hs.31 1054 Homo sapiens mRNA full length insert cDN 


3.3 


128420 


AA650274 


Hs.41296 fibronectin leucine rich transmembrane p 


4.6 




128609 


NM_003616 


Hs. 102456 survival of motor neuron protein interac 


3.9 




128946 


Y13153 


Hs. 107318 kynurenine 3-monooxygenase (kynurenine 3 


3.1 




128955 


AA775076 


Hs.185807 Homo sapiens, Similar to PRO0478 protein 


3.9 


50 


129092 


D56365 


Hs.63525 poly(rC)-binding protein 2 


3.3 


129270 


AA357185 


H^s 109918 ra^ hnmnkvi npnp femilv mpmhprH 


3.1 




129301 


AF1 82277 


Hs 330780 cytochrome P450 subfamilv IIB fDhenobar 


3.9 




129385 


AA172106 


Hs 110QSO Ran P nmtein 






129619 


AA209534 


Hs.284243 tetraspan NET-6 protein 


3.4 


55 


129629 


AK000398 


Hs.1 1747 hypothetical protein FLJ20391 


3 


129725 


X56411 


Hs.1219 alcohol dehydrogenase 4 (class II), pi p 


3.2 




130069 


AI754813 


Hs.146428 collagen, type V, alpha 1 


5.4 




130092 


X03363 


Hs.323910 v-erb-b2 avian erythroblastic leukemia v 


4.4 




130298 


AI347487 


Hs.132781 class I cytokine receptor 


4.6 


60 


130382 


NM 003450 


Hs.1 55204 zinc finger protein 1 74 


5.6 


130622 


AI582291 


Hs.1 6846 ESTs, Weakly similar to 04HUD1 debrisoqu 


3 




130703 


R77776 


Hs.18103 ESTs 


3.8 




130881 


AA809875 


Hs.25933 ESTs 


4.2 




130954 


AB014544 


Hs.21 572 KIAA0644 gene product 


4.7 


65 


131095 


AI399653 


Hs.22917 ESTs 


4.3 


131153 


H09048 


Hs.23606 ESTs 


3.8 




131253 


R71802 


Hs.24853 ESTs 


3.5 




131372 


AW293399 


Hs.144904 nuclear receptor co-repressor 1 


3.6 
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131507 AI826268 Hs.27769 ESTs, Weakly similar to MCATJHUMAN MITOC 3.2 

131587 AI695549 Hs. 183868 glucuronidase, beta 3.1 

131739 AF017986 Hs.31386 secreted frizzled-related protein 2 3.2 

131795 BE501849 Hs.32317 high-mobility group 20B 3.2 

5 131970 D86960 Hs.3610 KIAA0205 gene product 3.6 

131986 NM_002314 Hs.36566 LIM domain kinase 1 3.2 

132093 AA400091 Hs.39421 ESTs 3.2 

132122 AA426202 Hs.40403 Cbp/p300-interacting transactivator, wit 3 

132159 D76435 Hs.41154 Zic family member 1 (odd-paired Drosophi 3.2 

10 132333 M192669 Hs.45032 ESTs 3.5 

132406 AL133731 Hs.4774 Homo sapiens mRNA; cDNA DKFZp761C1712 (f 9.2 

132482 AV660345 Hs.238126 CGI49 protein 8.2 

132543 BE568452 Hs.5101 protein regulator of cytokinesis 1 3.4 

132624 M326108 Hs.33829 bHLH protein DEC2 3.2 

15 132700 AA319233 Hs.5521 ESTs 4.8 

132725 NM_006276 Hs.184167 splicing factor, arginine/serine-rich 7 3.6 

132799 W73311 Hs.1 69407 SAC2 (suppressor of actin mutations 2, y 3.2 

M' 132847 T48195 Hs.58189 eukaryotic translation initiation factor 3.5 

ff; 132857 Y00272 Hs.184572 cell division cycle 2, G1 to Sand G2 to 4.4 

f«20 132936 AL120659 Hs.6111 aryl-hydrocarbon receptor nuclear transl 4.8 

133130 AI128606 Hs.6557 zinc finger protein 161 3.3 

yl 133142 AW952412 Hs.65874 ESTs, Weakly similar to A40348 Elav/Sex- 3.5 

ff\ 133167 AW162840 Hs.6641 kinesin family member 5C 4.5 

133225 AW600291 Hs.6823 hypothetical protein FU 10430 3.3 

"E5 133274 AA085191 Hs.6949 hypothetical protein MGC1 1275 3 

*%i 133275 Z93241 Hs.239934 CGI-96 protein 4.5 

p; 133287 AW797437 Hs.69771 B-factor, properdin 4.1 

133376 BE618768 Hs.7232 acetyl-Coenzyme A carboxylase alpha 5.1 

« 133462 AW675064 Hs.73875 fumarylacetoacetate hydrolase (fumarylac 3 

C30 133740 AW162919 Hs.170160 RAB2, member RAS oncogene family-like 3.4 

l-. 133831 BE274552 Hs.76578 protein inhibitor of activated STAT3 3.9 

133976 AI908165 Hs.169946 GATA-binding protein 3 (T-cell receptor 6.2 

PJ 134666 BE391929 Hs.8752 transmembrane protein 4 3.1 

JF 134710 AI433797 Hs.8889 serine hydroxymethyltransferase 1 (solub 3 

335 134731 D89377 Hs.89404 msh (Drosophila) homeo box homolog 2 5.8 

W 134776 J05582 Hs.89603 mucin 1 , transmembrane 4 

fli 135230 AF064804 Hs.96757 suppressor of Ty (S.cerevisiae) 3 homolo 3.2 

135303 R61253 Hs.98265 KIAA 1877 protein 3.3 

135400 X78592 Hs.99915 androgen receptor (dihydrotestosterone r 4.8 

40 135411 L10333 Hs.99947 reticulon 1 3.8 

300089 AI199738 Hs.208275 ESTs, Weakly similar to ALUA_HUMAN !!!! 3.8 

300233 AW614220 Hs.189402 ESTs 4.2 

300254 AW183618 Hs.55610 solute carrier family 30 (zinc transport 9.9 

300256 AW591433 Hs.298241 Transmembrane protease, serine 3 4.9 

45 300378 Z45270 Hs.235873 hypothefical protein FLJ22672 3.4 

300973 AA572949 Hs.207566 ESTs 3.5 

301111 R10799 Hs.191990 ESTs 3.8 

301341 M887801 Hs.208229 G protein-coupled receptor 13.9 

301548 AI091631 Hs.203845 two pore potassium channel KT3. 3 4.4 

50 301884 AA312082 Hs.105445 GDNF family receptor alpha 1 5.7 

301936 U79745 Hs.1 14924 solute carrier family 1 6 (monocarboxylic 8.6 

301976 T97905 gb:ye54c10.r1 Soares fetal liver spleen 3.9 

302001 AB020711 Hs.278346 KIAA0904 protein 7.7 

302067 BE542706 Hs.222399 CEGP1 protein 7.3 

55 302094 AW749321 Hs.6786 ESTs 3.3 

302099 AL049670 Hs.137576 ribosomal protein L34 pseudogene 1 4.2 

302145 NM_003613 Hs.151407 cartilage intermediate layer protein, nu 7.9 

302235 AL049987 Hs.166361 Homo sapiens mRNA; cDNA DKFZp564F1 12 (fr 5.6 

302276 AW057736 Hs.323910 HER2 receptor tyrosine kinase (c-erb-b2, 5.4 

60 302290 AA179949 Hs.175563 Homo sapiens mRNA; cDNA DKFZp564N0763 (f 34.1 

302372 AL1 17406 Hs.200102 ATP-binding cassette transporter MRP8 6.7 

302378 AL109712 Hs.296506 Homo sapiens mRNA full length insert cDN 4 

302384 AI678059 Hs.202676 synaptonemal complex protein 2 4.3 

302385 AJ224172 Hs.204096 lipophilin B (uteroglobin family member) 13.8 
6 5 302680 AW192334 Hs.38218 ESTs 9.6 

302830 AI038997 Hs.132921 ESTs 5 

302857 AF282265 Hs.44836 inner centromere protein antigens (135kD 3.4 
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302892 AW176909 Hs.42346 calcineurin-binding protein calsarcin-1 3.4 

302970 W05608 Hs.312679 ESTs, Weakly similar to A49019 dynein he 5.1 

303271 AA652687 Hs.96151 Human DNA sequence from clone RP5-1103G7 3.7 

303289 AL121460 Hs.272673 hypothetical protein FLJ20508 4.1 

303357 AW006352 Hs. 159643 ESTs, Weakly similar to T32554 hypotheti 4.2 

303540 AA355607 Hs.309490 ESTs, Weakly similar to putative WHSC1 p 4.3 

303563 AA367699 Hs. 10082 potassium intermediate/small conductance 3.3 

303642 AW299459 gb:xs50d08.x1 NCI_CGAP_Kid1 1 Homo sapien 4.2 

303780 AI424014 Hs.18995 KIAA1304 protein 3.6 

303797 AW629759 gb:hh70e05.y1 NCI_CGAP_GU1 Homo sapiens 4.9 

303852 R53434 Hs.90207 hypothetical protein MGC1 11 38 3.7 

304328 AA149951 Hs.62112 zinc finger protein 207 3 

304782 AA582081 gb:nn32h08.s1 NCI_CGAP_Gas1 Homo sapiens 4.1 

305913 AA876109 gb:nx24h01.s1 NCI_CGAP_GC3 Homo sapiens 3 

305917 AA876469 gb:oe48b04.s1 NCI_CGAP_Pr25 Homo sapiens 3.1 

307010 AI140014 gb:qa68f09.x1 Soares_fetal_hearLNbHH19W 3.5 

307041 AI144243 gb:qb85b12.x1 Soares_fetai_heart_NbHH19W 3.9 

308106 AI476803 gb:tj77e12.x1 Soares_NSF_F8_9W_OT_PA_P_S 4.3 

308307 AI581398 Hs.172928 collagen, type I, alpha 1 4.6 

308615 AK000142 Hs.101774 hypothetical protein FU23045 4.4 

309177 AI951118 Hs.326736 Homo sapiens breast cancer antigen NY-BR 17.3 

309328 AW024348 Hs.233191 EST, Weakly similar to A2721 7 glucose tr 3.2 

309574 AW168083 gb:xg59g04.x1 NCI_CGAP_Ut4 Homo sapiens 3.1 

309583 AW170035 Hs.326736 Homo sapiens breast cancer antigen NY-BR 57.6 

310064 AI199712 Hs.148486 ESTs, Weakly similarto 1917210A Pro/Arg 4.6 

310098 AI685841 Hs.161354 ESTs 3.6 

310438 AW022192 Hs.200197 ESTs 4.6 

310683 AI939456 Hs.160870 ESTs 3.2 

310727 AK000703 Hs.323822 Homo sapiens mRNA for KIAA1551 protein, 3.6 

310781 AI380797 Hs.158992 ESTs 10.2 

310895 AI955121 Hs.165724 N-acetylga!actosamine-4-0-sulfotransfera 3.4 

310955 AI476732 Hs.263912 ESTs 10.9 

311117 AI671439 Hs.196029 Homo sapiens mRNA for KIAA1657 protein, 3.1 

311166 AI821005 Hs.118599 ESTs 10.8 

311237 AA641098 Hs.208809 ESTs, Moderately similarto ALU1_HUMAN A 4.3 

311465 AI758660 Hs.206132 ESTs 4.4 

311587 AI828254 Hs.271019 ESTs, Weakly similar to A47582 B-cell gr 5.1 

311598 AW023595 Hs.232048 ESTs 5.8 

311774 AA700870 Hs.14304 ESTs 3.3 

311785 AI056769 Hs.133512 ESTs 3.9 

311872 R12375 Hs.194600 ESTs 3.3 

311889 AA767342 Hs.122483 ESTs, Weakly similar to PSF_HUMAN PTB-AS 3 

311913 AI358522 Hs.270188 ESTs 3 

311923 T60843 Hs.189679 ESTs 5.6 

311935 AA216387 gb:nc16b02.s1 NCI_CGAP_Pr1 Homo sapiens 5.2 

312019 AA373630 Hs.188750 ESTs 3 

312021 AA759263 Hs.14041 ESTs 3.4 

312067 T78968 Hs.14411 ESTs 3.5 

312090 T80177 Hs.118064 similar to rat nuclear ubiquitous casein 3.8 

312147 AI633744 Hs.195648 ESTs, Weakly similar to I38022 hypotheti 4.4 

312153 BE261944 Hs.118625 hexokinasel 5.2 

312168 T92251 Hs.198882 ESTs 3.3 

312182 T94344 Hs.326263 ESTs 3.3 

312187 AA700439 Hs.188490 ESTs 3.4 

312199 AW438602 Hs.191179 ESTs 3.9 

312219 H73505 Hs.1 17874 ESTs 4 

312226 AA315703 Hs.199993 ESTs, Weakly similar to ALUB.HUMAN !!!! 4.9 

312299 AA972712 Hs.269737 ESTs 5.7 

312544 AA516420 Hs.183526 ESTs, Weakly similar to I38022 hypotheti 6.3 

312638 AW439195 Hs.256880 ESTs, Weakly similar to S65657 alpha-1 C- 4.9 

312826 AW291545 Hs.185018 ESTs 4.9 

312837 AW292286 Hs.255058 ESTs 4.4 

312980 AA497043 Hs.1 15685 ESTs 3.1 

313070 AI422023 Hs.161338 ESTs 4.3 

313079 N76497 Hs.1787 proteolipid protein 1 (Pelizaeus-Merzbac 3.3 

313089 AF026944 Hs 293797 ESTs 5 8 

313096 AW073310 Hs!l63533 Homo sapiens cDNAFLJ14142fis, clone MA 4^5 
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313126 AA746503 Hs.283313 ESTs 10 

313166 AI801098 Hs.151500 ESTs 3.5 

313197 AW979008 Hs.222487 ESTs 3.3 

313280 AW960454 Hs.222830 ESTs 4.7 

5 313325 AI420611 Hs.127832 ESTs 3.4 

313328 AW449211 Hs.105445 GDNF family receptor alpha 1 12.4 

313352 AW150945 Hs.144758 ESTs 4.1 

313385 AI032087 Hs.269819 ESTs 3 

313393 AI674685 Hs.200141 ESTs 5.2 

10 313417 AA741151 Hs.137323 ESTs 3.5 

313434 W92070 gb:zh48g05.r1 Soares_fetal_liver_spleen_ 3.7 

313569 AI273419 Hs.135146 hypothetical protein FLJ 13984 3 

313591 AA046309 gb:zf12f01.s1 Soares_fetal_heart_NbHH19W 5.6 

313615 AI540978 Hs.301997 hypothetical protein FLJ13033 3.2 

15 313915 C18863 Hs.163443 Homo sapiens cDNAFLJ1 1576 fis, clone HE 26.3 

313975 AW175896 Hs.65114 keratin 18 3 

313979 AI535895 Hs.221024 ESTs 4.9 

y. 313997 AV657317 Hs.288649 hypothetical protein MGC3077 3.9 

314043 M827082 Hs.291872 ESTs 3.1 

i^O 314078 AW129357 Hs.329700 ESTs 8.3 

B 314097 AA648744 Hs.269493 ESTs 6.6 

sir; 314121 AI732083 Hs.187619 ESTs 6.2 

fj 314129 AA228366 Hs.1 15122 ESTs 4 
W 314138 AA740616 gb:ny97f11.s1 NCI_CGAP_GCB1 Homo sapiens 5.9 

' 25 314236 M743396 Hs.1 89023 ESTs 3.1 

314244 AL036450 Hs.103238 ESTs 4 

314305 AI280112 Hs.125232 Homo sapiens CDNAFU13266 fis, clone OV 8 

|I! 314306 AI697901 Hs.192425 ESTs 3.7 

314322 AA907153 Hs.190060 ESTs 3.3 

1,30 314394 AW961597 Hs.130816 ESTs, Moderately similar to I38022 hypot 4.2 

fc= ! 314401 AI660412 Hs.234557 ESTs 3.3 

M= 314465 AA602917 Hs.156974 ESTs 4.7 

U i 314506 AA833655 Hs.206868 Homo sapiens cDNA FLJ 14056 fis, clone HE 8.5 

' r: 314510 AI204418 Hs.190080 ESTs 4 

m5 314546 AW007211 Hs.16131 hypothetical protein FLJ12876 3.4 

p 314547 AA399272 Hs.144341 ESTs 6.7 

fTs 314558 AI873274 Hs.190721 ESTs 27.4 

» y - 314627 AA425310 Hs.155766 ESTs, Weakly similar to A47582 B-cellgr 4.4 
314648 AW979268 gb:EST391 378 MAGE resequences, MAGP Homo 4.6 

40 314691 AW207206 Hs.136319 ESTs 20.7 

314729 AA457367 Hs.191638 ESTs 3.6 

314754 AW026761 Hs.134374 ESTs 3.6 

314814 BE350122 Hs.157367 ESTs, Weakly similar to I78885 serine/th 4.9 

314864 AW971198 Hs.294068 ESTs 4.3 

45 314881 AI095087 Hs.152299 ESTs, Moderately similar to S65657 alpha 3.7 

314882 AA828032 Hs.189076 ESTs 3.1 

314981 AW972359 Hs.293334 ESTs 3 

315006 AI538613 Hs.298241 Transmembrane protease, serine 3 10.9 

315021 AA533447 Hs.312989 ESTs 5.3 

50 315051 AW292425 Hs.163484 ESTs 12.9 

315060 AA551 104 Hs.189048 ESTs, Moderately similar to ALUC.HUMAN I 5.8 

315073 AW452948 Hs.257631 ESTs 4.2 

315080 AA744550 Hs.136345 ESTs 3.7 

315175 AI025842 Hs.152530 ESTs 6 

55 315183 AW136134 Hs.220277 ESTs 3.9 

315193 AI241331 Hs.131765 ESTs, Moderately similar to I38937 DNA/R 4.4 

315196 AI367347 Hs.44898 Homo sapiens clone TCCCTA00151 mRNAsequ 8.2 

315198 AI741506 Hs.165900 ESTs, Weakly similar to ALU1_HUMAN ALUS 3.6 

315240 R38772 Hs.172619 myelin transcription factor Mike 3.4 

60 315263 AW510994 Hs.220740 ESTs 3.4 

315282 AI222165 Hs.144923 ESTs 4.9 

3*5296 AA876905 Hs.125286 ESTs 4 

315368 AB037745 Hs.104696 KIAA1 324 protein 4.7 

315397 AA218940 Hs.137516 fidgetin-like 1 3.1 

65 315489 A1378817 Hs.191847 ESTs 3.1 

315498 AA628539 Hs.116252 ESTs, Moderatelysimilarto ALU1_HUMAN A 3.2 

315526 AI193043 Hs.128685 ESTs, Weakly similar to T1 7226 hypotheti 4.1 
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315530 AW015415 Hs.127780 ESTs 8.9 

315562 M737415 Hs.152826 ESTs 5.5 

315634 M837085 Hs.220585 ESTs 6.3 

315647 M648983 Hs.212911 ESTs 3.6 

315707 AI418055 Hs.161160 ESTs 5.1 

315772 AW515373 Hs.271249 Homo sapiens cDNA FLJ 13580 fis, clone PL 3.1 

315850 AW270550 Hs.1 16957 ESTs 3.8 

315858 M737345 Hs.294041 ESTs 5 

315878 AA683336 Hs.189046 ESTs 3.1 

315977 AW865916 Hs.151206 ESTs 4.7 

315978 AA830893 Hs.1 19769 ESTs 4.1 
315995 AI217477 Hs.194591 ESTs 4.1 
316012 AA764950 Hs.1 19898 ESTs 7 
316042 AI469960 Hs.170698 ESTs 4.9 
316052 AI962796 Hs.136754 ESTs 4.1 
316072 AW517524 Hs.135201 NOD2 protein 3.2 
316074 AW975114 Hs.293273 ESTs 3.8 
316100 AW203986 Hs.213003 ESTs 3.2 
316133 AI187742 Hs.125562 ESTs 3.7 
316177 AI904982 Hs.293102 ESTs, Moderately similar to ALU1_HUMAN A 30.7 
316186 AI433540 gb:ti69g05.x1 NCI_CGAP_Kid11 Homosapien 3.1 
316244 AI640761 Hs.224988 ESTs 3.5 
316303 AA740994 Hs.209609 ESTs 3.8 
316313 AA741300 Hs.202599 ESTs, Weakly similar to I38022 hypotheti 4.4 
316364 AA747807 Hs.149500 ESTs 3.2 
316580 AA938198 Hs.146123 poly(A) polymerase gamma 9.4 
316697 AW293174 Hs.252627 ESTs 4.4 
316715 AI440266 Hs.170673 ESTs, Weakly similar to T24832 hypotheti 3 

316868 AI660898 Hs.195602 ESTs 3.2 

316869 AI954880 Hs.134604 ESTs 3.2 
316886 AA836331 Hs.134981 ESTs 4.4 
316897 AA838114 Hs.221612 ESTs 3.7 
316943 AW014875 Hs.137007 ESTs 4.6 
317069 AI732892 Hs.190489 ESTs 5.9 
317194 AW445167 Hs.126036 ESTs 4.1 
317360 AI125252 Hs.126419 ESTs 3.5 
317404 A1806867 Hs.126594 ESTs 5.1 
317452 AA972965 Hs.135568 ESTs 6.9 
317501 A1822034 Hs.137097 ESTs 4.6 
317674 AW294909 Hs.132208 ESTs 4.3 
317803 AW664964 Hs.128899 ESTs 6.1 
317834 X56348 Hs.287270 ret proto-oncogene (multiple endocrine n 3.1 
317850 AI681545 Hs.152982 hypothetical protein FLJ131 17 3.4 
317881 AI827248 Hs.224398 Homo sapiens cDNAFLJ1 1469 fis, clone HE 9.6 
317902 AW102941 Hs.211265 ESTs 4.1 
317916 AI565071 Hs.159983 ESTs 10.3 
318042 AW294522 Hs.149991 ESTs 3.1 
318223 AI077540 Hs.134090 ESTs 3.9 
318327 AW294013 Hs.200942 ESTs 3 
318332 AI093930 Hs.163440 Homo sapiens cDNA: FU21000 fis, clone C 4.4 
318418 AF107493 Hs.118498 Homo sapiens LUCA-1 5 protein mRN A, splic 5.4 
318558 AW402677 Hs.146381 RNA binding motif protein, X chromosome 4.4 
318625 AA526235 Hs.193162 Homo sapiens cDNA FU1 1983 fis, clone HE 5.9 
318634 T49598 Hs.1 56832 ESTs 4 
318740 NM_002543 Hs.77729 oxidised low density lipoprotein (lectin 7.3 
318744 AI793124 Hs.144479 ESTs 17.8 
318781 F11802 Hs.6818 ESTs 3 
319191 NM_012391 Hs.79414 prostate epithelium-specific Ets transcr 3.6 
319478 AI524124 Hs.270307 ESTs 4.6 
319510 W88532 Hs.254562 ESTs 3.3 
319551 AA761668 gb:nz24c08.s1 NC1_CGAP_GCB1 Homo sapiens 3.2 
319745 T79366 Hs.1 08258 actin binding protein; macrophin (microf 3.3 
319834 AA071267 gb:zm61g01.r1 Stratagene fibroblast (937 6.2 
319840 C19035 Hs.1 64259 ESTs 3.3 
319977 AA534222 gb:nj21d02.s1 NCI_CGAP_AA1 Homo sapiens 4.3 
320074 AA321166 Hs.278233 ESTs 3.4 
320167 AA984373 Hs.90790 Homo sapiens cDNA: FLJ22930 fis, clone K 4.1 
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320187 
320211 
320416 
320588 
320635 
320654 
320742 
320832 
320915 
321016 
321107 
321171 
321253 
321318 
321642 
321644 
321683 
321758 
321811 
321828 
321910 
321937 
321978 
322035 
322136 
322258 
322296 
322303 
322476 
322520 
322521 
322567 
322595 
322675 
322766 
322818 
322882 
322975 
323091 
323131 
323168 
323244 
323262 
323332 
323333 
323335 
323645 
323663 
323693 
323782 
323817 
323930 
323974 
324001 
324036 
324261 
324285 
324296 
324305 
324432 
324585 
324598 
324603 
324631 
324716 
324748 
324771 



T99949 

AL039402 

AI026984 

U78082 

N50617 

AI160015 

AI601188 

AA214584 

AI359144 

BE144167 

AI732643 

AI769410 

AA610649 

AB033041 

AI432199 

AW975944 

AI471598 

U29112 

D80630 

R59890 

H67065 

AL049351 

N77342 

AL137517 

AF075083 

BE265745 

W76326 

AI357412 

AW963372 

T55958 

AF147347 

AF155108 

W92147 

AA017656 

AW068805 

AW043782 

AW248508 

C16391 

AI902456 

AK002088 

AL120862 

AW675572 

AL133990 

AI829520 

AV651680 

AI655499 

AW445014 

BE081058 

AA317962 

AW961560 

AA410943 

AL043683 

AI825204 

AL044949 

AI472078 

BE069341 

AA431159 

AI524039 

AA642007 

AA464510 

AI823969 

AW972227 

AW993522 

M937116 

BE169746 

AW974941 

M631739 



Hs.303428 

Hs.125783 

Hs.293662 

Hs.1 67738 

Hs.80506 

Hs.118112 

Hs.120910 

Hs.290167 

Hs.143688 

Hs.49994 

Hs.144151 

Hs.221461 

Hs.333239 

Hs.1 37507 

Hs.247084 

Hs.237396 

Hs.197531 

Hs.196151 

Hs.83623 

Hs.271530 

Hs.302058 

Hs.21851 

Hs.334473 

Hs.194359 

Hs.157601 
Hs.46677 



Hs.256150 
Hs.118394 

Hs.288467 
Hs.293616 
Hs.279727 

Hs.210761 
Hs.270124 
Hs.124165 
Hs.1 93620 
Hs.190642 

Hs.208558 
Hs.161712 
Hs.1 97746 
Hs.243023 
Hs.249721 
Hs.97600 

Hs.8173 
Hs.211408 
Hs.1 16298 
Hs.303662 

Hs.122954 
Hs.1 92524 
Hs.1 16369 
Hs.1 5281 2 
Hs.1 32678 
Hs.1 63986 
Hs.292934 
Hs.293683 
Hs.12504 
Hs.292385 
Hs.335440 



Homo sapiens cDNA FLJ14832 fis, clone OV 5.3 

DEME-6 protein 9.2 

ESTs 3.1 

RNA polymerase II transcriptional regula 3.1 

small nuclear ribonucleoprotein polypept 6.1 

ESTs 3.5 

ESTs 3 

ESTs 3.7 

Homo sapiens cDNA: FLJ23031 fis, clone L 3.1 

hypothetical protein similar to RNA-bind 3.3 

ESTs 12.3 

ESTs 3.3 

ESTs 3 

vang (van gogh, Drosophila)-like 2 3.9 

ESTs 3 

ESTs 11.7 

ESTs 3.8 

ESTs 4.4 

gb:HUM091D02B Human fetal brain (TFujiwa 3.2 

nuclear receptor subfamily 1, group I, m 3.1 

ESTs, Weakly similar to ALU7_HUMAN ALU S 4.7 
Homo sapiens mRNA; cDNA DKFZp566C093 (fr 3.5 

Homo sapiens cDNA FLJ1 2900 fis, clone NT 5 

hypothetical protein DKFZp56401 278 1 9 

gb:Homo sapiens full length insert cDNA 3.6 

ESTs, Weakly similar to ALUCJHUMAN !!!! 3 

gb:zd60d04.r1 Soares_fetal_heart_NbHH19W 4.4 

ESTs 11.5 

PRO2000 protein 3 

gb:yb35f05.r1 Stratagene fetal spleen (9 3 

gb:Homo sapiens full length insert cDNA 4.2 

Homo sapiens, Similar to RIKEN cDNA 28 1 0 4 

ESTs 5.4 

gb:ze39h01.M Soares retina N2b4HR Homo 3.1 

Homo sapiens cDNA FU1 2280 fis, clone MA 5.2 

ESTs 7.6 

Homo sapiens cDNA FLJ14035 fis, done HE 5.9 

gb:C16391 Clontech human aorta polyA mRN 16.5 

ESTs, Weakly similar to 138022 hypotheti 4 

Homo sapiens cDNA FLJ11226 fis, clone PL 3.3 

programmed cell death 9 (PDCD9) 6.3 

ESTs 4.6 

ESTs 10.5 

gb:wl19c06.x1 NCI_CGAP_Ut1 Homo sapiens 6.2 

ESTs 4.3 

ESTs 9.2 

ESTs 3.1 

ESTs 4 

ESTs, Moderately similar to PC4259 fern 3 

ESTs 3.2 

BMP-R1B 8.4 

hypothetical protein FLJ 1 0803 3.3 

ESTs 4.5 

ESTs 4.5 

ESTs 8.4 
gb:QV3-BT0381-270100-073-o08 BT0381 Homo 49.4 

ESTs 3 

ESTs 3 

ESTs 3.3 

ESTs 16.5 

ESTs 3.3 

Homo sapiens cDNA: FLJ22765 fis, clone K 5 

ESTs 10.4 

ESTs, Weakly similar to I54374 gene NF2 3.3 

likely ortholog of mouse Arkadia 3.2 

ESTs, Weakly similar to I78885 serine/th 3 

EST 3 
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» 

111 

m 
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HI 
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e30 



40 



45 



50 



55 



60 



65 



324774 
324823 
324824 
324826 
324961 
324987 
324994 
325146 
325372 
325544 
327075 
332798 
334223 
334447 
335809 
335824 
338255 
409430 
428046 
432558 
436808 
448569 
453542 



AI031771 

AW516704 

A1826999 

M704806 

AA613792 

AI375572 

AI805416 

AI064690 



R21945 

AW812795 

R97268 

AA731602 

BE382657 

AW836724 

M97935 

M97935 

M55150 

M13755 

AI052047 

AA252033 

AA401739 

H18459 

R48744 

M31682 

AA416873 

D80240 

R49590 



X03363 



Hs.132586 
Hs.208726 
Hs.224624 
Hs.143842 

Hs.1 72634 
Hs.213897 
Hs.171176 



Hs.166975 
Hs.155381 
Hs. 177269 
Hs. 120266 
Hs.21486 
Hs.339660 



AA034918 



AF049569 
M13955 



HG4126HT4396 



ESTs 
ESTs 
ESTs 

ESTs, Weakly similar to 2004399A chromos 

gb:no97h03.s1 NCI_CGAP_Pr2 Homo sapiens 

ESTs 

ESTs 

ESTs 

Phase 2 & 3 Exons 
Phase 2 & 3 Exons 
Phase 2 & 3 Exons 

C22000007:gi|12314195|emb|CAB99338.1|(A 
NM_005080*:Homo sapiens X-box binding pr 
NM_012429*:Homo sapiens SEC 14 (S. cerevi 
NM_014509*:Homo sapiens kraken-like (BK1 
ENSP00000249072*:DJ222E13.1 (N-TERMINAL 
NM_014323*:Homo sapiens zinc finger prot 
splicing factor, arginine/serine-rich 5 
ESTs, Moderately similar to 138022 hypot 
ESTs 
ESTs 

signal transducer and activator of trans 

Homo sapiens mRNA expressed only in plac 

AFFX control: STAT1 

AFFX control: STAT1 

fumarylacetoacetate 

interferon stimulated protein; 15 kDa 

ESTs 

ESTs; Weakly similar to III! ALU SUBFAMILY J 
ESTs 

hepatocellular carcinoma associated protein; 
ESTs 

inhibin; beta B (activin AB beta polypeptide) 
ESTs 

HUM5G1 1 A Human fetal brain (TFujiwara) Homo 
ESTs 

CH22_FGENES.678_5 

CH22_FGENES.619_7 

CH22_FGENES.619_12 

CH22_EM:AC005500.GENSCAN.127 9 

CH22_EM:AC005500.GENSCAN.304 2 

CH22_FGENES.271_8 

CH22_FGENES.619_13 

CH22_FGENES.271_7 

CH22_FGENES.617_7 

CH.07_hsgi|6004473 

CH22_FGENES.264_1 

HER2 receptor tyrosine kinase (c erbB 2; ERBB2; 

CH22_FGENES.617_9 

CH.07_hsgi|5868264 

CH.19_hsgi|5867439 

CH22_FGENES.6 3 

CH.17_hsgi|5867230 

CH.20_hsgi|6552458 

CH22_EM:AC005500.GENSCAN.148 22 

CH22_FGENES.669_10 

KIAA1 028 protein 

CH22_FGENES.48_12 

CH22_FGENES.118_2 

ESTs 

multiple UniGene matches 

CH22_FGENES.619_8 

CH22_FGENES.137 

CH22_FGENES.360 3 
CH22_FGENES.706_9 
CH.21_hsgi|6531965 



4.2 

3.4 

3.1 

4.4 

3.9 

18.8 

3.3 

4.2 

4.4 

5.7 

3.8 

4.3 

26.2 

3.9 

10.1 

20 

9 

4 

4.6 

3.2 

3.9 

4.1 

3.7 

3.2 

3 

3 

4.5 
6.7 
3.2 
3.3 
3 

4.2 
3 
3 
4 

3.2 

16.8 

12.9 

11.3 

9.2 

8.5 

8.4 

8 

7.3 
7.2 
7.1 
6.8 

neu) 6.6 
6.5 
5.8 
5.7 
5.3 
5.1 
5.1 
4.7 
4.6 
4.6 
4.5 
4.5 
4.4 
4.3 
4.3 
4.3 

Zinc Finger Protein Hzf4 

4.3 

4.3 

4.2 



4.3 
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15 



HG2614 HT2710 
X83535 

HG4716 HT5158 
HG4677HT5102 



P25 



f30 



p5 

Us 1 



40 



45 



AA714311 



HG2465 HT4871 



AA707750 



AA976074 



CH.17_hsgi|5867215 
CH22_FGENES.669_8 

CH22_FGENES.48_18 

matrix metalloproteinase 14 (membrane inserted) 
CH22_FGENES.271_6 
CH22_FGENES.617_3 
CH22_FGENES.290_8 

CH22_FGENES.135 
CH22_FGENES.132 
CH.14_hsgi|6682474 
CH.02_hsgi|5867750 
CH22_FGENES.617_8 

CH22_DJ32I10.GENSCAN.23 39 
CH22_FGENES.543_20 
CH22_EM:AC005500.GENSCAN.96 1 
CH22_FGENES.204_2 
CH22_FGENES.619_4 
CH.16_hsgi|5867087 
EST cluster (not in UniGene) 
CH22_EM:AC005500.GENSCAN.149 9 
CH22 EM:AC005500.GENSCAN.421 5 
CH22_FGENES.13 4 
CH.07_hsgi|6004478 
CH22_FGENES.360_1 

CH22_FGENES.6_2 

CH22_C20H12.GENSCAN.16 2 

CH22_C65E1.GENSCAN.81 

ESTs; Weakly similar to cis Golgi matrix 

CH22_FGENES.307_4 

CH22_EM:AC005500.GENSCAN.248 14 

CH.06_hsgi|5902482 

CH22_FGENES.669_5 

CH22_DJ32I10.GENSCAN.19 8 

CH22_FGENES.527_6 

CH22_FGENES.330_10 

CH22_FGENES.14 2 

ESTs 

CH22_FGENES.226 7 
CH22_FGENES.13 3 
CH22_EM:AC005500.GENSCAN.209 12 
CH22_FGENES.271_3 



4.1 
4.1 

Collagen, Type Viii, Alpha 1 4.1 

4.1 

4 

3.9 
3.9 
3.8 

Guanosine 5' Monophosphate Synthase 

3.8 

3.8 

3.8 

3.8 

3.7 

Oncogene Ret/Ptc2, Fusion Activated 

3.7 

3.7 

3.7 

3.5 

3.5 

3.5 

3.4 

3.4 

3.4 

3.3 

3.3 

3.3 

Dna Binding Protein Ap 2, Alt Splice 3 

3.3 

3.2 

3.2 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3 

3 

3 

3 

3 



3.8 



3.7 



3.3 
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TABLE 17 A 

Table 17A shows the accession numbers for those pkeys lacking unigenelD's for Table 17. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number: Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accession 



116845 393481J AA649530AA659316 H64973 
103207 30635.-4 X72790 

126257 182217.1 N99638 AW973750 AA328271 H90994 AA558020 AA234435 N59599 R94815 

102791 37186.1 AF080229 AF080231 AF080230 AF080232 AF080233 AF080234 BE550633 AI636743 AW614951 BE467547 AI680833 

AI633818 N29986 U87592 U87593 U87590 U87591 S46404 U87587 AA463992 AW206802 AI970376 AI58371 8 AI672574 
N25695 AW665466 AI818326 AA126128 AI480345 AW013827 M248638 AI214968 AA204735 AA207155 AA206262 AA204833 
AW003247 AW496808 AI080480 AI631703 AI651023 AI867418 AW818140 AA502500 AI206199 AI671282 AI352545 BE501030 
AI652535 BE465762 M206331 AW451866 AA471088 AA206342 AA204834 AA206100 AW021661 AA332922 N66048 
AA703396 H92278 AW139734 H92683 U87589 U87595 H69001 U87594 BE466420 AI624817 BE46661 1 AI206344 AA574397 
AA348354AI493192 

126872 142696.1 AW450979 M136653 AA136656 AW419381 M984358 AA492073 BE168945 AA809054 AW238038 BE011212 BE011359 
BE01 1367 BE01 1368 BE01 1362 BE01 1215 BE01 1 365 BE01 1 363 
1746257.1 R82040 R70934 
176835.1 AA225084AA302713 

324239.1 AI311928 AA936030 T51931 M609816 M487195 AA664207 
genbank_F09609 F09609 
genbank_R08440 R08440 



112631 
120742 
106864 
109700 
111532 
113938 
113947 
124357 
108733 
112303 
322136 
322296 
321811 
314648 
322520 
322521 
322675 
323332 
316186 
322975 
324261 
323817 
301976 
324961 
303642 
303797 
319551 
311935 
319834 
319977 
314138 
313591 



genbank_W81598 W81598 

genbank_W84768 W84768 

genbank_N22401 N22401 

504187.1 AA121022M126422 

genbank_R54797 R54797 

46802.1 AF075083 H52291 H52528 

47334.1 W76326AF086341 W72300 

1527481.1 D80630 D80896 D80895 

293660.1 AW979268 M878419 AA431342 AA431628 

38916.1 T55958T57205AF147346 

3891 7.1 AF1 47347 T55426 T55503 

86787.1 AA017656 M017374M019761 

179142.1 AI829520 AI791832 AA228414 AI791823 AA22921 1 AA229315 
425440.1 AI433540AA728984AA804981 
1510563.1 C16391 C16413 

273265.1 BE069341 AW748403 AL044891 AI908240 AA393080 

233566.1 AA410943 AW948953 AA334202 AA332882 

128835.1 T97905AA101672 

376239.1 AA613792 AW182329 T05304 AW858385 

284260.1 AW299459 AA4171 12 

386364.1 AW629759 AW749955 M633408 AI651005 

357371.1 AA761668AA573621 R92814 R09670 

174129.1 AA216387T63548AA228676 

112523.1 AA071267T65940 T64515AA071334 

345248.1 AA534222AA632632 T81234 

179960.1 AA740616AA654854AA229923 

103087.1 M046309AI263500AA046397 
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308106 AI476803 

338255 CH22_6856FG_LINK_EM:AC00 

335809 CH22_3181FG_617_6_LINK_EM 

335824 CH22_3197FG_619_11_LINK_E 

5 307010 AI140014 

307041 AI144243 

305913 AA876109 

305917 AA876469 

309574 AW1 68083 

10 325372 c12_hs 

325544 c12 hs 

332798 CH22_14FG_6_5_LINK_C4G1.G 

334223 CH22_1507FG_360_4_LINK_EM 

327075 c21_hs 

15 334447 CH22_1746FG_387_7_LINK_EM 

304782 AA582081 

313434 441798J W92070 AW019952 W92053 



111 



w 
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TABLE 17B 



Table 17B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 17. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank identifier (Gl) numbers. "Dunham I. et al." refers to the publication 

entitled "The DNA sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 

Imposition: Indicates nucleotide positions of predicted exons. 



Pkey Ref 



Strand 



Nt_position 



334447 
335809 
335824 
332798 
334223 
338255 
325372 
325544 
327075 



Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
5866920 
6682452 
6531965 



Plus 

Plus 

Plus 

Minus 

Minus 

Minus 

Minus 

Plus 

Plus 



14308764-14308824 

26310772-26310909 

26376860-26376942 

232147-231974 

12734365-12734269 

15242294-15242231 

1117061-1117304 

171228-171286 

40413184041431 
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TABLE 18: Table 2 from BRCA 014 P 

Table 18 shows genes with atleast five times the expression in breast tumor tissue than is 
expressed in normal body tissues. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnkjenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of tumor to normal body tissue 



Pkey ExAccn UnigenelD Unigene Title 



R1 



101378 BE563085 Hs.833 interferon-stimulated protein, 15 kDa 5.3 

101530 M29874 Hs.1360 cytochrome P450, subfamily IIB (phenobar 9 

101767 M81057 Hs.180884 carboxypeptidase B1 (tissue) 12 

101878 M97815 Hs.183650 cellular retinoic acid-binding protein 2 6.5 

103010 X52509 Hs.161640 tyrosine aminotransferase 12.4 

104115 AF183810 Hs.26102 opposite strand to trichorhinophaiangeal 7.6 

104825 AA035613 Hs.141883 ESTs 6.9 

107105 AW963419 Hs.155223 stanniocalcin 2 5.3 

108819 AA011449 Hs.271627 ESTs 6.1 

112287 AB033064 Hs.334806 KIAA1 238 protein 7.3 

112561 AI791493 Hs.1 29873 ESTs, Weakly similar to A36036 cytochrom 8.2 

112637 R82331 Hs.1 64599 ESTs 5.4 

113206 BE262470 Hs.241471 RNB6 6.2 

113970 W27249 Hs.8109 hypothetical protein FLJ21080 6.9 

114965 AI733881 Hs.72472 BMP-R1B 10.1 

118925 N92293 Hs.206832 ESTs, Moderately similar to ALU8JHUMAN A 19.7 

119905 AW449064 Hs.1 19571 collagen, type III, alpha 1 (Ehlers-Danl 8.4 

121611 M31669 Hs.1735 inhibin, beta B (activin AB beta polypep 5.6 

129301 AF182277 Hs.330780 cytochrome P450, subfamily IIB (phenobar 6.2 

133976 AI908165 Hs.1 69946 GATA-binding protein 3 (T-cell receptor 6.2 

134731 D89377 Hs.89404 msh (Drosophila) homeo box homolog 2 5.8 

300254 AW1 83618 Hs.55610 solute carrier family 30 (zinc transport 9.9 

301884 AA312082 Hs.105445 GDNF family receptor alpha 1 5.7 

302001 AB020711 Hs.278346 KIAA0904 protein 7.7 

302067 BE542706 Hs.222399 CEGP1 protein 7.3 

302276 AW057736 Hs.323910 HER2 receptor tyrosine kinase (c-erb-b2, 5.4 

302290 AA179949 Hs.175563 Homo sapiens mRNA; cDNA DKFZp564N0763 (f 34.1 

302372 AL1 17406 Hs.200102 ATP-binding cassette transporter MRP8 6.7 

302385 AJ224172 Hs.204096 lipophilin B (uteroglobin family member) 13.8 

309177 AI951118 Hs.326736 Homo sapiens breast cancer antigen NY-BR 17.3 

309583 AW170035 Hs.326736 Homo sapiens breast cancer antigen NY-BR 57.6 

310781 AI380797 Hs.1 58992 ESTs 10.2 

311166 AI821005 Hs.118599 ESTs 10.8 

311935 AA216387 gb:nc16b02.s1 NCI_CGAP_Pr1 Homo sapiens 5.2 

312153 BE261944 Hs.1 18625 hexokinasel 5.2 

313328 AW449211 Hs.105445 GDNF family receptor alpha 1 12.4 

313915 C18863 Hs.163443 Homo sapiens cDNA FU1 1576 fis, clone HE 26.3 

314097 AA648744 Hs.269493 ESTs 6.6 

314138 AA740616 gb:ny97f11.s1 NCI_CGAP_GCB1 Homo sapiens 5.9 

314506 AA833655 Hs.206868 Homo sapiens cDNAFLJ 14056 fis, clone HE 8.5 

314558 AI873274 Hs.190721 ESTs 27.4 

314691 AW207206 Hs.136319 ESTs 20.7 

315006 AI538613 Hs.298241 Transmembrane protease, serine 3 10.9 

315021 AA533447 Hs.312989 ESTs 5.3 

315051 AW292425 Hs.163484 ESTs 12.9 

315060 AA551104 Hs.189048 ESTs, Moderately similar to ALUC_HUMAN ! 5.8 
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315196 AI367347 Hs.44898 Homo sapiens clone TCCCTA001 51 mRNAsequ 8.2 



315530 AW015415 Hs.127780 ESTs 8.9 

315634 AA837085 Hs.220585 ESTs 6.3 

316012 M764950 Hs.1 19898 ESTs 7 

5 316177 AI904982 Hs.293102 ESTs, Moderately similar to ALU1JHUMAN A 30.7 

316580 AA938198 Hs.146123 poly(A) polymerase gamma 9.4 

317803 AW664964 Hs.128899 ESTs 6.1 

317881 AI827248 Hs.224398 Homo sapiens cDNAFLJ11469fis, clone HE 9.6 

318740 NM_002543Hs.77729 oxidised low density lipoprotein (lectin 7.3 

10 318744 AI793124 Hs.144479 ESTs 17.8 

320211 AL039402 Hs.125783 DEME-6 protein 9.2 

321107 AI732643 Hs.144151 ESTs 12.3 

321644 AW975944 Hs.237396 ESTs 11.7 

321978 N77342 Hs.21851 Homo sapiens cDNAFLJ12900fis, clone NT 5 

15 322035 AL137517 Hs.334473 hypothetical protein DKFZp56401278 19 

322766 AW068805 Hs.288467 Homo sapiens cDNA FLJ 12280 fis, clone MA 5.2 

322818 AW043782 Hs.293616 ESTs 7.6 

f* 322975 C16391 gb:C16391 Clontech human aorta polyAmRN 16.5 

H 323262 AL133990 Hs.190642 ESTs 10.5 

S20 323332 AI829520 gb:wl19c06.x1 NCI_CGAP_Ut1 Homo sapiens 6.2 

W 3 23817 AA410943 BMP-R1B 8.4 

Lfl 324261 BE069341 gb:QV3-BT0381-270100-073-c08BT0381 Homo 49.4 

ff- 324432 AA464510 Hs.152812 ESTs 16.5 

jj! 324598 AW972227 Hs. 163986 Homo sapiens cDNA: FLJ22765 fis, clone K 5 

f25 324603 AW993522 Hs.292934 ESTs 10.4 

%| 324987 AI375572 Hs.172634 ESTs 18.8 

*»! 325544 Phase2&3Exons 5.7 

330388 Hs.46 HER2 receptor tyrosine kinase (c-erb-b2, 6.6 

s 334223 NM_005080*:Homo sapiens X-box binding pr 26.2 

fSO 335809 NM_014509*:Homo sapiens kra ken-like (BK1 10.1 

.j 335824 ENSP00000249072*:DJ222E13.1(N-TERMINAL 20 

^ AI052047 ESTs; Weakly similar to CYTOCHROME P450 6.7 

f'y R72427 CH22_EM:AC005500.GENSCAN.127 9 5.5 

r : CH22_FGENES.619_13 9.2 

33.5 CH22_FGENES.617_9 8 

CH22_FGENES.271_7 6.5 

f! I CH22_FGENES.619_7 7.3 

CH22_FGENES.271_8 12.9 

CH22_FGENES.619_12 8.4 

40 CH22_EM:AC005500.GENSCAN.304 2 11.3 

CH.07_hsgi|6004473 8.5 

CH22 FGENES.617_7 7.1 

CH22_FGENES.678_5 7.2 

CH22_FGENES.678_5 16.8 
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TABLE 18A 

Table 18A shows the accession numbers for those pkeys lacking unigenelD's for Table 18. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accession 



323332 179142J AI829520 AI791832M228414 AI791823 M229211 AA229315 

322975 1510563J C16391 C16413 

324261 273265J BE069341 AW748403 AL044891 AI908240 AA393080 

323817 233566J AA410943 AW948953 AA334202 AA332882 

311935 174129J AA216387 T63548 AA228676 

314138 179960J AA740616 AA654854 AA229923 

335809 CH22_3181FG_617_6_LINK_EM 

335824 CH22_3197FG_619_11_L1NK_E 

325544 c12_hs 

334223 CH22_1507FG_360_4JJNK_EM 
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TABLE 18B 

Table 18B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 18. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 



Pkey: Unique number corresponding to an Eos probeset 

Reft Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication 

entitled "The DNA sequence of human chromosome 22." Dunham I. et al., Nature (1 999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey Ref 



Strand NLposition 



335809 
335824 
334223 



Dunham, I. etal. 
Dunham, I. etal. 
Dunham, I. etal. 



325544 6682452 



Plus 
Plus 
Minus 
Plus 



26310772-26310909 
26376860-26376942 
12734365-12734269 
171228-171286 
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TABLE 19: 1045 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES 

Table 19 shows 1045 genes up-regulated in breast cancer compared to normal adult tissues. 
These were selected from 59680 probesets on the Affymetrix/Eos-Hu03 GeneChip array such 
that the ratio of "average" breast cancer to "average" normal adult tissues was greater than or 
equal to 2.5. The "average" breast cancer level was set to the 90 th percentile value. The 
"average" normal adult tissue level was set to the 90 th percentile value amongst 144 non- 
malignant tissues. In order to remove gene-specific background levels of non-specific 
hybridization, the 15 th percentile value amongst the 144 non-malignant tissues was subtracted 
from both the numerator and the denominator before the ratio was evaluated. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of tumor to normal body tissue 



Pkey ExAccn UnigenelD UnigeneTitle R1 

408591 AF015224 Hs.46452 mammaglobin 1 137.6 

406964 M21305 gb:Human alpha satellite and satellite 3 71.0 

400291 AA401369 Hs.190721 ESTs 68.4 
407277 AW1 70035 Hs.326736 Homo sapiens breast cancer antigen NY-BR 54.2 
449746 AI668594 Hs.176588 ESTs, Weakly similar to CP4Y_HUMAN CYTOC 46.4 
426878 BE069341 gb:QV3-BT0381-270100-073-c08BT0381 Homo 44.8 

400292 AA250737 Hs.72472 BMP-R1B 37.4 
427585 D31152 Hs.179729 collagen, type X, alpha 1 (Schmid metaph 32.9 
408045 AW138959 Hs.245123 ESTs 31.9 
407178 AA195651 Hs.104106 ESTs 30.4 
407377 C16391 gb:C16391 Clontech human aorta polyA mRN 27.7 
450705 U90304 Hs.25351 iroquois homeobox protein 5 24.8 
407212 AA412108 Hs.269350 ESTs 22.0 
428846 NM_000230 Hs.194236 leptin (murine obesity homolog) 21.9 
404561 trichorhinophalangeal syndrome I (TRPS1 ) 21.8 
407980 AA046309 gb:zf12f01.s1 Soares_fetal_heart_NbHH19W 19.8 
447350 AI375572 Hs.172634 ESTs 17.3 
450375 AA009647 Hs.8850 a disintegrin and metalloproteinase doma 16.6 
422109 S73265 Hs.1473 gastrin-releasing peptide 16.5 
435496 AW840171 Hs.265398 ESTs, Weakly similar to transformation-r 16.0 
453160 AI263307 Hs.239884 H2B histone family, member L 15.8 
420813 X51501 Hs.99949 prolactin-induced protein 15.8 
415989 AI267700 Hs.317584 ESTs 15.5 
422505 AL120862 Hs.124165 programmed cell death 9 (PDCD9) 14.8 
424399 AI905687 Hs.2533 aldehyde dehydrogenase 9 family, member 14.5 
423575 C18863 Hs.163443 Homo sapiens cDNA FLJ 11 576 fis, clone HE 13.7 
429441 AJ224172 Hs.204096 lipophilin B (uteroglobin family member) 13.6 
431474 AL133990 Hs.190642 ESTs 13.5 
448595 AB014544 Hs.21572 KIAA0644 gene product 13.0 
427217 AA399272 Hs.144341 ESTs 12.8 
402578 C1001134:gi|2117372lpir||l65981 fatty ac 12.6 
422805 AA436989 Hs.121017 H2A histone family, member A 12.2 
424634 NM_003613 Hs.151407 cartilage intermediate layer protein, nu 12.0 
456207 AA193450 gb:zr40e07.r1 Soares_NhHMPu_S1 Homo sapi 11.9 
424086 AI351010 Hs.102267 lysyl oxidase 11.9 
459587 AA031 956 gb:zk15e04.s1 Soares_pregnant_uterus_NbH 11.5 
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416208 AW291 168 Hs.41295 ESTs, Weakly similar to MUC2_HUMAN MUCIN 11.5 

429170 NM_001394 Hs.2359 dual specificity phosphatase 4 115 

407276 AI951118 Hs.326736 Homo sapiens breast cancer antigen NY-BR 11.4 

434377 AW137148 Hs.306593 Homo sapiens cDNA FLJ11382fis, clone HE 11.3 

5 448390 AL035414 Hs.21068 hypothetical protein 11.1 

452401 NM_007115 Hs.29352 tumor necrosis factor, alpha-induced pro 11.0 

421037 AI684808 Hs.197653 programmed cell death 9 (PDCD9) 10.9 

452461 N78223 Hs.108106 transcription factor 10.7 

443348 AW873596 Hs.182278 calmodulin 2 (phosphorylase kinase, delt 10.6 

10 421155 H87879 Hs.102267 lysyl oxidase 10.5 

402606 NM_024626:Homo sapiens hypothetical prot 10.4 

409269 AA576953 Hs.22972 hypothetical protein FLJ 13352 10.4 

447268 AI370413 Hs.36563 hypothetical protein FU2241 8 10.3 

447033 AI357412 Hs.157601 ESTs 10.2 

15 400295 W72838 Hs.2533 aldehyde dehydrogenase 9 family, member 10.1 

424905 NM_002497 Hs.153704 NIMA (never in mitosis gene a)-related k 10.1 

432441 AW292425 Hs.163484 ESTs 9.9 

f" 427365 AI873274 Hs.190721 ESTs 9.9 

p 438950 H23789 Hs.144530 EST 9.8 

i-20 422835 BE218705 Hs.121378 metallothionein-like 5, testis-specific 9.7 

rr! 425692 D90041 Hs.155956 N-acetyltransferase 1 (arylamine N-acety 9.7 

If! 411869 W20027 Hs.23439 ESTs 9.6 

ft\ 439820 AL360204 Hs.283853 Homo sapiens mRNA full length insert cDN 9.6 

m- 445730 AI624342 Hs.170042 ESTs 9.5 

~ z %5 459583 AI907673 gb:IL-BT152-080399-O04 BT152 Homo sapien 9.3 

T %! 452744 AI267652 Hs.30504 Homo sapiens mRNA; cDNA DKF2p434E082 (fr 9.2 

432596 AJ224741 Hs.278461 matrilin 3 9.1 

400297 AI127076 Hs.334473 hypothetical protein DKFZp56401278 9.1 

9 449448 D60730 Hs.57471 ESTs 9.1 

C30 423945 AA410943 gb:zt32h03.ii Soares ovary tumor NbHOTH 9.1 

y. 406348 Target Exon 9.0 

I/. 424735 U31875 Hs.272499 short-chain alcohol dehydrogenase family 9.0 

W 453392 U23752 Hs.32964 SRY (sex determining region Y)-box 11 9.0 

M 433365 AF026944 Hs.293797 ESTs 8.8 

M5 405654 NA C12001521 :gi|7513934|pir||T31081 cca3 pr 8.8 

b : ! 418601 AA279490 Hs.86368 calmegin 8.8 

W 451110 AI955040 Hs.265398 ESTs, Weakly similar to transformation-r 8.7 

408771 AW732573 Hs.47584 potassium voltage-gated channel, delayed 8.5 

409041 AB033025 Hs.50081 KIAA1 1 99 protein 8.4 

40 442353 BE379594 Hs.49136 ESTs, Moderately similar to ALU7_HUMAN A 8.3 

451561 N52812 Hs.177403 ESTs 8.2 

424001 W67883 Hs.1 37476 paternally expressed 10 8.2 

429859 NM_007050 Hs.225952 protein tyrosine phosphatase, receptor t 8.1 

423887 AL080207 Hs.134585 DKFZP434G232 protein 8.1 

45 405095 NA Target Exon 8.1 

419296 AA236115 Hs.120785 ESTs 8.0 

447164 AF026941 Hs.17518 Homo sapiens cig5 mRNA, partial sequence 8.0 

413472 BE242870 Hs.75379 solute carrier family 1 (glial high affi 8.0 

416747 AW876523 Hs.15929 hypothetical protein FLJ12910 8.0 

50 415385 R17798 Hs.7535 COBW-like protein 7.9 

434424 AI811202 Hs.325335 Homo sapiens cDNA: FLJ23523 fis, clone L 7.9 

420931 AF044197 Hs.100431 small inducible cytokine B subfamily (Cy 7.9 

406687 M31126 Hs.272620 matrix metalloproteinase 11 (MMP11; stro 7.8 

400285 NA Eos Control 7.7 

5 5 437207 T27503 Hs.15929 hypothetical protein FLJ12910 7.6 

427119 AW880562 Hs.1 14574 ESTs 7.5 

429534 AW976987 Hs.163327 ESTs, Weakly similar to 2109260AB cell 7.5 

433426 H69125 Hs.1 33525 ESTs 7.5 

411078 AI222020 Hs.182364 CocoaCrisp 7.4 

60 426214 H59846 Hs.128355 ESTs, Moderately similar to ALU7_HUMAN A 7.4 

447475 AI380797 Hs.1 58992 ESTs 7.3 

415263 AA948033 Hs.130853 ESTs 7.2 

439569 AW602166 Hs.222399 CEGP1 protein 7.2 

414142 AW368397 Hs.150042 Homo sapiens cDNA FLJ14438 fis, clone HE 7.1 

65 426261 AW242243 Hs.168670 peroxisomal famesylated protein 7.0 

444783 AK001468 Hs.62180 anillin (Drosophila Scraps homolog), act 6.9 

445885 AI734009 Hs.1 27699 KIAA1 603 protein 6.9 
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429432 AI678059 Hs.202676 synaptonemal complex protein 2 6.9 

410781 AI375672 Hs.165028 ESTs 6.9 

443788 AI732643 Hs. 1441 51 ESTs 6.9 

421373 AA808229 Hs.167771 ESTs 6.8 

451398 AI793124 Hs.144479 ESTs 6.8 

404253 NM_021058*:Homo sapiens H2B histone fami 6.8 

441098 AI015591 Hs.131004 ESTs, Weakly similar to T17227 hypotheti 6.7 

426215 AW963419 Hs.155223 stanniocalcin 2 6.6 

428227 M321649 Hs.2248 small inducible cytokine subfamily B (CX 6.6 

422956 BE545072 Hs.122579 hypothetical protein FLJ10461 6.6 

411111 AW818127 gb:CM1-ST0277-061299-059-b07ST0277Homo6.6 

434988 AI418055 Hs.161160 ESTs 6.6 

442580 AI733682 Hs.130239 ESTs 6.6 

449611 AI970394 Hs.197075 ESTs 6.6 

408000 L11690 Hs.620 bullous pemphigoid antigen 1 (230/240kD) 6.5 

420757 X78592 Hs.99915 androgen receptor (dihydrotestosterone r 6.5 

431089 BE041395 Hs.283676 ESTs, Weakly similar to unknown protein 6.5 

400301 X03635 Hs.1657 estrogen receptor 1 6.5 

427356 AW023482 Hs.97849 ESTs 6.5 

425704 U79293 Hs.159264 Human clone 23948 mRNA sequence 6.4 

441134 W29092 Hs.7678 cellular retinoic acid-binding protein 1 6.4 

424902 NM.003866 Hs.153687 inositol poIyphosphate-4-phosphatase, ty 6.4 

448693 AW004854 Hs.228320 hypothetical protein FU23537 6.4 

431448 AL137517 Hs.334473 hypothetical protein DKFZp56401278 6.2 

444342 NM_014398 Hs.10887 similar to lysosome-associated membrane 6.1 

422168 AA586894 Hs.1 12408 S100 calcium-binding protein A7 (psorias 6.1 

453331 AI240665 Hs.8895 ESTs 6.1 

418007 M13509 Hs.83169 matrix metalloproteinase 1 (MMP1; inters 6.0 

441233 AA972965 Hs.135568 ESTs 6.0 

418092 R45154 Hs.106604 ESTs 6.0 

430044 AA464510 Hs.152812 ESTs 5.9 

432837 AA310693 Hs.87329 HSPC072 protein 5.9 

433285 AW975944 Hs.237396 ESTs 5.9 

450701 H39960 Hs.288467 Homo sapiens cDNA FLJ12280 fis, clone MA 5.9 

425707 AF115402 Hs.11713 E74-like factor 5 (ets domain transcript 5.9 

410785 AW803341 gb:IL2-UM0079-090300-050-D03 UM0079 Homo 5.9 

425398 AL049689 Hs.156369 hypothetical protein similar to tenascin 5.9 

414812 X72755 Hs.77367 monokine induced by gamma interferon 5.8 

459371 R20991 gb:yg06h01.r1 Soares infant brain 1MB H 5.8 

411284 N28519 Hs.135191 ESTs, Weakly similar to unnamed protein 5.8 

453511 AL031224 Hs.33102 transcription factor AP-2 beta (activati 5.8 

451807 W52854 Hs.27099 hypothetical protein FLJ23293 similar to 5.7 

430510 AW162916 Hs.241576 hypothetical protein PR02577 5.7 

415539 AI733881 Hs.72472 BMP-R1B 5.6 

438199 AW016531 Hs.122147 ESTs 5.6 

417866 AW067903 Hs.82772 collagen, type XI, alpha 1 5.5 

430019 AA463893 Hs.220933 ESTs 5.5 

439809 R41396 Hs.101774 hypothetical protein FLJ23045 5.5 

423811 AW299598 Hs.50895 homeoboxC4 5.4 

434539 AW748078 Hs.214410 ESTs, Weakly similar to MUC2_HUMAN MUCIN 5.4 

439138 AI742605 Hs.193696 ESTs 5.4 

453931 AL1 21278 Hs.25144 ESTs 5.4 

444078 BE246919 Hs.10290 U5 snRNP-specific 40 kDa protein (hPrp8- 5.4 

447102 BE167434 Hs.98471 ESTs, Weakly similar to T1 871 2 hypotheti 5.4 

451621 AI879148 Hs.26770 fatty acid binding protein 7, brain 5.4 

425236 AW067800 Hs.155223 stanniocalcin 2 5.3 

421464 AA291553 Hs.190086 ESTs 5.3 

450736 AW970060 gb:EST382140 MAGE resequences, MAGK Homo5.3 

428085 AA421081 Hs.12388 ■ ESTs 5.3 

452838 U65011 Hs.30743 preferentially expressed antigen in mela 5.3 

445424 AB028945 Hs.12696 cortactin SH3 domain-binding protein 5.3 

456938 X52509 Hs.161640 tyrosine aminotransferase 5.3 

422867 L32137 Hs.1584 cartilage oligomeric matrix protein (COM 5.2 

438167 R28363 Hs.24286 ESTs 5.2 

433330 AW207084 Hs.132816 hypothetical protein MGC14801 5.2 

449765 N92293 Hs.206832 ESTs, Moderately similar to ALU8JHUMAN A 5.2 

416276 U41060 Hs.79136 LIV-1 protein, estrogen regulated 5.2 
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400300 X03363 HER2 receptor tyrosine kinase (c-erb-b2, 5.2 

418004 U37519 Hs.87539 aldehyde dehydrogenase 3 family, member 5.2 

439840 AW449211 Hs.105445 GDNF family receptor alpha 1 5.2 

428771 AB028992 Hs.193143 KIAA1 069 protein 5.2 
5 455047 AW852530 gb:PM1-CT0243-071099-001-g06 CT0243 Homo5.2 

419169 AW851980 Hs.262346 ESTs, Weakly similar to S72482 hypotheti 5.2 

453197 AI916269 Hs.109057 ESTs, Weakly similar to ALU5_HUMAN ALUS 5.1 

400298 AA032279 Hs.61635 six transmembrane epithelial antigen of 5.1 

431023 AI283133 Hs.297420 ESTs 5.1 

10 427666 AI791495 Hs.180142 calmodulin-like skin protein 5.1 

427718 AI798680 Hs.25933 ESTs 5.1 

434531 AA642007 Hs.1 16369 ESTs 5.1 

429220 AW207206 Hs.1 36319 ESTs 5.1 

405494 NA C2001 837*:gi|1 2697903|dbj|BAB21 770.1 1 (A 5.1 

15 452930 AW195285 Hs.194097 ESTs, Weakly similar to I38022 hypotheti 5.1 

444910 AI201849 gb:qs76g04.x1 NCI_CGAP_Pr28 Homo sapiens 5.1 

453310 X70697 Hs.553 solute carrier family 6 (neurotransmitte 5.0 

M' 444381 BE387335 Hs.283713 ESTs, Weakly similar to S64054 hypotheti 5.0 

r=: 450603 R43646 Hs.1 2422 ESTs 5.0 

20 416575 W02414 Hs.38383 ESTs 5.0 

^ : 438504 AW665281 Hs.224625 ESTs 5.0 

If! 416209 AA236776 Hs.79078 MAD2 (mitotic arrest deficient, yeast, h 5.0 

ff-. mm AK000713 Hs.193736 hypothetical protein FLJ20706 5.0 

T\ 420077 AW512260 Hs.87767 ESTs 4.9 

1 125 450480 X82125 Hs.25040 zinc finger protein 239 4.9 

\\ 437637 AJ003029 Hs.65792 syntrophin, gamma 2 4.9 

r£' s 431808 M30703 Hs.270833 amphiregulin (schwannoma-derived growth 4.9 

fe= ; 418836 AI655499 Hs.161712 ESTs 4.8 

i 442441 AI820662 Hs.129598 ESTs 4.8 

r30 435635 AF220050 Hs.181385 uncharacterized hematopoietic stem/proge 4.8 

f!' 400286 NA C1 6000922:gi|7499103|pir]|T20903 hypothe 4.8 

r" 407506 U71600 gb:Human zinc finger protein zfp31 (zf31 4.8 

Hi 420026 AI831190 Hs.166676 ESTs 4.8 

441377 BE218239 Hs.202656 ESTs 4.8 

!35 457726 AI217477 Hs.194591 ESTs 4.8 

Ql 412785 AW997556 Hs.78521 KIAA1717 protein 4.8 

h I 428368 BE440042 Hs.83326 matrix metalloproteinase 3 (stromelysin 4.7 

436026 AI349764 Hs.217081 ESTs 4.7 

409110 AA191493 Hs.48778 niban protein 4.7 

40 400284 NA estrogen receptor 1 4.7 

410102 AW248508 Hs.279727 Homo sapiens cDNA FLJ 14035 fis, clone HE 4.7 

407819 R42185 Hs.274803 ESTs 4.7 

430486 BE062109 Hs.241551 chloride channel, calcium activated, fam 4.7 

422896 AW961489 Hs.154116 ESTs 4.7 

45 453616 NM_003462 Hs.33846 dynein, axonemal, light intermediate pol 4.7 

427427 AF077345 Hs.177936 ESTs 4.6 

421751 AW813731 Hs.159153 ESTs, Moderately similar to S65657 alpha 4.6 

454074 R63503 Hs.28419 ESTs 4.6 

405718 C4000799*:gi|6330365|dbj|BAA86508.1|(AB 4.6 

50 444649 AW207523 Hs.197628 ESTs 4.6 

429431 Z40313 Hs.106330 Homo sapiens clone IMAGE:23371, mRNA seq 4.6 

427811 M81057 Hs. 1 80884 carboxypeptidase B1 (tissue) 4.6 

447342 AI199268 Hs.19322 Homo sapiens, Similar to RIKENcDNA 2010 4.6 

430345 AK000282 Hs.239681 hypothetical protein FU20275 4.6 
5 5 454307 AW855717 gb:RC1-CT0279-081299-013-b01 CT0279 Homo 4.6 

400303 AA242758 Hs.79136 LIV-1 protein, estrogen regulated 4.6 

438180 AA808189 Hs.272151 ESTs 4.6 
451 340 AW936273 gb:QVO-DT0020-090200-1 07-g07 DT0020 Homo 4.6 

458711 AL036877 Hs.282878 ESTs 4.6 

60 457430 AA514660 Hs.128443 ESTs 4.6 

416030 H15261 Hs.21948 ESTs 4.6 

447233 AW246333 Hs.17901 Homo sapiens, clone IMAGE:3937015, mRNA, 4.6 

445537 AJ245671 Hs.12844 EGF-like-domain, multiple 6 (EGFL6) 4.5 

424590 AW966399 Hs.46821 hypothetical protein FLJ20086 4.5 

65 432374 W68815 Hs.301885 Homo sapiens cDNAFLH 1346 fis, clone PL 4.5 

423833 AW503329 gb:UI-HF-BN0-akx-e-02-0-Ul.r1 NIH_MGC_50 4.5 

406747 AI925153 Hs.217493 annexinA2 4.5 
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412102 H56435 gb:yq98e09.r1 Soares fetal liver spleen 4.5 

431716 D89053 Hs.268012 fatty-acid-Coenzyme A ligase, long-chain 4.5 
411050 AW814902 gb:MR1-ST0206-120400-022-f08ST0206 Homo 4.5 

401418 NA C14000338*:gi|7459502|pir||S74665 outer 4.5 

5 436194 AK001074 Hs.333435 Homo sapiens cDNA FLJ10212 fis, clone HE 4.5 

436211 AK001581 Hs.334828 hypothetical protein FLJ10719; KIAA1794 4.4 

414080 AA1 35257 Hs.47783 B aggressive lymphoma gene 4.4 

424115 AA335497 Hs.293965 ESTs, Weakly similar to I38022 hypotheti 4.4 

415786 AW419196 Hs.257924 hypothetical protein FLJ13782 4.4 

10 442117 AW664964 Hs.128899 ESTs 4.4 

452784 BE463857 Hs.151258 hypothetical protein FLJ21 062 4.4 

432731 R31178 Hs.287820 fibronectinl 4.4 
410534 AW905138 gb:QVO-NN1071-280400-207-g07 NN1071 Homo4.4 

405196 NA C2000662*:gi|7512792|pir||T12482hypothe 4.4 

15 430217 N47863 Hs.336901 ribosomal protein S24 4.4 

401793 C17001545:gi|5360127|gb|AAD42882.1lAF155 4.4 

415747 AA381209 gb:EST94257 Activated T-cells I Homo sap 4.4 

423679 AB007975 Hs.131454 KIAA0506 protein 4.4 

p 400238 NA C19000274*:gi|12741327|ref|XP_008833.2| 4.4 

f§0 425627 AF019612 Hs.297007 membrane-bound transcription factor prot 4.4 

fZ 400608 C10001899:gi|7508633|pir||T25392hypothe 4.4 

U! 458634 AV657310 Hs.282898 ESTs 4.3 

ff! 407771 AL138272 Hs.62713 ESTs 4.3 

ct, 405906 NA Target Exon 4.3 

l 't5 405925 NA Target Exon 4.3 

"S! 439382 BE247684 Hs.103070 ESTs 4.3 

fl 445263 H57646 Hs.42586 KIAA1560 protein 4.3 

407162 N63855 Hs.142634 zinc finger protein 4.3 

5 419536 AA603305 gb:np12d11.s1 NCI_CGAP_Pr3 Homo sapiens 4.3 

OO 454359 N71277 gb:za36e03.s1 Soares fetal liver spleen 4.3 

Lj. 411558 AA102670 Hs.70725 gamma-aminobutyric acid (GABA) A recepto 4.2 

!T. 450715 AI266484 Hs.31570 ESTs, Weakly similar to KIM1 324 protein 4.2 

421451 AA291377 Hs.50831 ESTs 4.2 

j;: 452864 AA033714 Hs.287629 hypothetical protein FU14260 4.2 

M5 409757 NM_001898 Hs.123114 cystatin SN 4.2 

413043 BE158766 gb:IL2-HT0397-071299-024-F02 HT0397 Homo 4.2 
W 413499 BE144884 gb:CM0-HT0182-041099-065-e11 HT0182 Homo4.2 

444619 BE538082 Hs.8172 ESTs, Moderately similar to A46010 X-lin 4.2 

408380 AF1 23050 Hs.44532 diubiquitin 4.2 

40 406992 S82472 gb:beta -pol=DNA polymerase beta {exon a 4.2 

404285 NA C6001 909:gi|704441 |dbj|BAA1 8909. 1 1 (D298 4.2 

425247 NM_005940 Hs.1 55324 matrix metalloproteinase 11 (MMP11;stro 4.2 

428046 AW812795 Hs.155381 ESTs, Moderately similar to I38022 hypot 4.2 

446163 AA026880 Hs.25252 prolactin receptor 4.2 

45 421147 AW592167 Hs.293299 ESTs 4.2 

426451 AI908165 Hs.1 69946 GATA-binding protein 3 (T-cell receptor 4.2 

415227 AW821113 Hs.72402 ESTs 4.2 

452176 AA024538 Hs.282990 Human DNA sequence from clone RP1-28H20 4.2 

452862 AW378065 Hs.8687 ESTs 4.2 

50 443646 AI085198 Hs.164226 ESTs 4.2 

425523 AB007948 Hs.158244 KIAA0479 protein 4.1 

424687 J05070 Hs.151738 matrix metalloproteinase 9 (gelatinase B 4.1 

430009 AA894564 Hs.22242 ESTs 4.1 

434469 AA634806 gb:ab28c02.r1 Stratagene lung (937210) H 4.1 

55 451381 BE241831 Hs.172330 hypothetical protein MGC2705 4.1 

450229 R18717 Hs.8929 hypothetical protein FU1 1362 4.1 
455700 BE0681 15 gb:CM1-BT0368-061299-060-g07 BT0368 Homo 4.1 

431924 AK000850 Hs.272203 Homo sapiens cDNA FLJ20843 fis, clone AD 4.1 

438885 AI886558 Hs.184987 ESTs 4.1 

60 401451 NM_004496*:Homo sapiens hepatocyte nucle 4.1 

431676 AI685464 gb:tt88f04.x1 NCI_CGAP_Pr28 Homo sapiens 4.1 

409092 AI735283 Hs.172608 ESTs 4.1 

429270 W60379 Hs.57773 ESTs 4.1 

443903 AI220547 Hs.135223 ESTs 4.1 

65 427122 AW057736 Hs.323910 HER2 receptor tyrosine kinase (c-erb-b2, 4.1 

410275 U85658 Hs.61796 transcription factor AP-2 gamma (activat 4.1 

432912 BE007371 Hs.200313 ESTs 4.1 
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403585 Target Exon 4.1 

438295 AI394151 Hs.37932 ESTs 4.1 

420380 M640891 Hs.1 02406 ESTs 4.1 

431118 BE264901 Hs.250502 carbonic anhydrase VIII 4.1 

5 416182 NM_004354 Hs.79069 cyclin G2 4.1 

418994 AA296520 Hs.89546 selectin E (endothelial adhesion molecul 4.1 

400555 Target Exon 4.1 

410079 U94362 Hs.58589 glycogenin2 4.0 

427674 NM_003528 Hs.2178 H2B histone family, member Q 4.0 

10 427131 AA448460 Hs.1 12017 GE36gene 4.0 

439759 AL359055 Hs.67709 Homo sapiens mRNA full length insert cDN 4.0 

429353 AL1 17406 Hs.200102 ATP-binding cassette transporter MRP8 4.0 

421296 NM_002666 Hs.103253 perilipin 4.0 

418819 AA228776 Hs.191721 ESTs 4.0 

15 424188 AW954552 Hs.142634 zinc finger protein 4.0 



455431 AW938484 gb:CM0-DT0057-290200-253-d06 DT0057 Homo4.0 



404142 NA Target Exon 4.0 

M> 441143 AI027604 Hs.159650 ESTs 4.0 

js, 444540 AI693927 Hs.265165 ESTs 4.0 

!f0 415579 AA165232 Hs.222069 ESTs 4.0 

Q 452891 N75582 Hs.212875 ESTs, Weakly similar to DYH9_HUMAN CILIA 4.0 
Ifi 414605 BE390440 gb:601283601F1 NIH_MGC_44 Homo sapiens c 4.0 

!=: 452281 T93500 Hs.28792 Homo sapiens cDNA FLJ11041 fis, clone PL 4.0 

417801 AA417383 Hs.82582 integrin, beta-like 1 (with EGF-like rep 4.0 

fp5 446232 AI281848 Hs.194691 retinoic acid induced 3 4.0 

\,\ 447377 X77343 Hs.334334 transcription factor AP-2 alpha (activat 4.0 

~i 437854 AL1 19723 gb:DKFZp761A2124_r1 761 (synonym: hamy2) 4.0 

iJ 446140 AA356170 Hs.26750 hypothetical protein FU21 908 4.0 

3 452240 AI591147 Hs.61232 ESTs 4.0 

»SO 459574 AI741122 Hs.101810 Homo sapiens cDNA FU 14232 fis, clone NT 4.0 

J= : 458673 N99626 gb:za39d11.rl Soares fetal liver spleen 4.0 

¥' 444858 AM 99738 Hs.208275 ESTs, Weakly similar to ALUA_HUMAN !!!! 4.0 

ffi 452166 AI948607 Hs.264680 ESTs 4.0 

'"Z-. 452681 AF153330 Hs.30246 solute carrier family 19 (thiamine trans 3.9 

"65 450192 AA263143 Hs.24596 RAD51 -interacting protein 3.9 

Q 406554 NA Target Exon 3.9 

Hi 416259 AA573006 Hs.19173 ESTs 3.9 

445813 Z42023 Hs.106576 alanine-glyoxylate aminotransferase 2-li 3.9 

451024 AA442176 gb:zw63b08.r1 Soares_totalJetus_Nb2HF8_ 3.9 

40 413930 M86153 Hs.75618 RAB11A, member RAS oncogene family 3.9 

401781 Target Exon 3.9 

415296 F05086 Hs.328142 ESTs 3.9 

452564 AA026777 gb:ze93c11.r1 Soares_fetal_heart_NbHH19W 3.9 

442500 AI819068 Hs.209122 ESTs 3.9 

45 419759 Z21336 Hs.135411 actjn related protein 3.9 

424638 AI472106 Hs.49303 Homo sapiens CDNAFLJ1 1663 fis, clone HE 3.9 

439699 AF086534 Hs.187561 ESTs, Moderately similar to ALU1_HUMAN A 3.9 

428042 AA419529 Hs.76391 myxovirus (influenza) resistance 1 , homo 3.9 

452501 AB037791 Hs.29716 hypothetical protein FU10980 3.9 

5 0 453049 BE537217 Hs.30343 ESTs 3.9 

443213 BE568414 Hs.145497 Homo sapiens cDNA: FLJ22097 fis, clone H 3.9 

443489 AI073512 Hs.133916 ESTs 3.9 
455092 BE152428 gb:CM0-HT0323-151299-126-b04 HT0323 Homo3.9 

401 785 NM_002275*:Homo sapiens keratin 15 (KRT1 3.9 

5 5 426427 M86699 Hs.1 69840 TTK protein kinase 3.9 

446009 AI989885 Hs.231926 ESTs 3.9 

436033 H75391 Hs.255748 ESTs 3.9 
451067 BE172186 gb:MRO-HT0559-110300-005-h11 HT0559 Homo3.8 

419348 AA236645 Hs.98274 ESTs 3.8 

60 444635 AI184268 Hs.339665 ESTs 3.8 

412140 AA219691 Hs.73625 RAB6 interacting, kinesin-like (rabkines 3.8 

403593 NA Target Exon 3.8 

442323 AW016669 Hs.29190 ESTs 3.8 

419854 AW664873 Hs.87836 Homo sapiens PAC clone RP5-1087M19 from 3.8 

65 433871 W02410 Hs.205555 ESTs 3.8 

445253 AI217928 Hs.144762 ESTs 3.8 

409542 AA503020 Hs.36563 hypothetical protein FU2241 8 3.8 
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443162 T49951 Hs.9029 DKFZP434G032 protein 3.8 

458194 AW383618 Hs.265459 ESTs, Moderately similar to ALU2JHUMAN A 3.8 

422475 AL359938 Hs.1 17313 Meis (mouse) homolog 3 3.8 

440705 AA904244 Hs.1 53205 ESTs 3.8 

5 447290 AI476732 Hs.263912 ESTs 3.8 

403426 Target Exon 3.8 

427821 AA470158 Hs.98202 ESTs 3.8 

454288 BE222648 Hs.279458 ESTs, Highly similar to C380A1 .1b [H.sap 3.8 

443801 AW206942 Hs.253594 ESTs 3.8 

10 410658 AW105231 Hs.192035 ESTs 3.8 

410672 AW794600 gb:RC6-UM0014-170300-022-C05 UM0014 Homo3.8 

428579 NM_005756 Hs.1 84942 G protein-coupled receptor 64 3.8 

445495 BE622641 Hs.38489 ESTs, Weakly similar to I38022 hypotheti 3.8 

447995 AI742618 Hs.181733 ESTs, Weakly similar to nitrilase homolo 3.7 

15 401747 Homo sapiens keratin 17 (KRT17) 3.7 

420633 NM_014581 Hs.274480 odorant-binding protein 2A 3.7 

423545 AP000692 Hs.129781 chromosome 21 open reading frame 5 3.7 

N 433138 AB029496 Hs.59729 semaphorin sem2 3.7 

pi 434715 BE005346 Hs.116410 ESTs 3.7 

30 428664 AK001666 Hs.189095 similar to SALL1 (sal (Drosophila)-like 3.7 

™' 450951 AA018534 Hs.103334 ESTs 3.7 



Lfl 402696 NA C3002523:gi|6686211|sp|Q27533|YH2M_CAEEL3.7 



fr; 446868 AV660737 Hs.1 351 00 ESTs 3.7 

p I 458154 AW816379 Hs.335018 ESTs 3.7 

115 422026 U80736 Hs.1 10826 trinucleotide repeat containing 9 3.7 

Kj 419440 AB020689 Hs.90419 KIAA0882 protein 3.7 

f=i 421524 AA312082 Hs.105445 GDNF family receptor alpha 1 3.7 

417283 N62840 Hs.48648 ESTs 3.7 

s 401508 NA NM_024817:Homo sapiens hypothetical prot 3.7 

f3.0 410303 AA324597 Hs.21851 Homo sapiens cDNA FLJ 12900 fis, clone NT 3.7 

™: 420362 U79734 Hs.97206 huntingtin interacting protein 1 3.7 

r!. 433384 AI021992 Hs.124244 ESTs 3.7 

FU 434302 AA629065 Hs.1 16301 ESTs 3.7 

P 1 443938 R55373 Hs.20864 ESTs 3.7 
35 448420 BE623004 gb:601 441 282F1 NIH_MGC_72 Homo sapiens c 3.7 

U 458712 AI347502 Hs.107872 hypothetical protein FLJ20761 3.7 

pi 433404 T32982 Hs.102720 ESTs 3.7 

405232 NM_015832:Homo sapiens methyl-CpG bindin 3.7 

430491 AL109791 Hs.241559 Homo sapiens mRNA full length insert cDN 3.7 
40 455609 BE011668 gb:CM3-BN0223-1 00500-1 77-a04 BN0223 Homo3.7 

450164 AI239923 Hs.30098 ESTs 3.7 

453948 AI970797 Hs.64859 ESTs 3.7 

436061 AI248584 Hs.190745 Homo sapiens cDNA: FLJ21 326 fis, clone C 3.7 

401049 NA Target Exon 3.6 

45 418867 D31771 Hs.89404 msh (Drosophila) homeo box homolog 2 3.6 

420179 N74530 Hs.21168 ESTs 3.6 

458663 AV658444 Hs.280776 tankyrase, TRF1 -interacting ankyrin-rela 3.6 

437259 AI377755 Hs.120695 ESTs 3.6 

428309 M97815 Hs.183650 cellular retinoic acid-binding protein 2 3.6 

50 450522 AI698839 gb:wd31f02.x1 Soares_NFL_T_GBC_S1 Homo s 3.6 

451952 AL120173 Hs.301663 ESTs 3.6 
412209 AW901456 gb:RC0-NN1012-270300-031-o97 NN1012 Homo3.6 

425201 AA352111 gb:EST60061 Activated T-ce!ls XX Homo sa 3.6 

443830 AI142095 Hs.143273 ESTs 3.6 
5 5 439255 BE1 64500 gb:RC4-HT0469-230300-014-e10 HT0469 Homo 3.6 

414869 AA157291 Hs.21479 ubinucleinl 3.6 

409064 AA062954 Hs.1 41 883 ESTs 3.6 

407721 Y12735 Hs.38018 dual-specificity tyrosine-(Y)-phosphoryl 3.6 

445135 AK000054 Hs.12347 hypothetical protein FLJ20047 3.6 

60 404091 NA Target Exon 3.6 

409731 AA1 25985 Hs.56145 thymosin, beta, identified in neuroblast 3.6 

405153 Target Exon 3.6 

423248 AA380177 Hs.125845 ribulose-5-phosphate-3-epimerase 3.6 

403639 NA ENSP00000233023 , :CDNAFLJ12662fis,clon 3.6 

65 404360 C7001385:gi|12082809|gb|AAG48618.1lAF315 3.6 

422352 AA766296 Hs.99200 ESTs 3.6 

423338 AB007961 Hs.127338 KIAA0492 protein 3.6 
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424202 BE350295 Hs.15032 RAN binding protein 17 3.6 

431750 AA514986 Hs.283705 ESTs 3.6 

439907 AA853978 Hs.1 24577 ESTs 3.6 

453596 AA441838 Hs.62905 hypothetical protein FLJ14834 3.6 

406446 NA Target Exon 3.6 

418454 AA315308 Hs.1 95870 hypothetical protein FU 14991 3.6 

434360 AW015415 Hs.1 27780 ESTs 3.6 

409079 W87707 Hs.82065 interieukin 6 signal transducer (gp130, 3.6 

440132 AI697121 Hs.202466 ESTs, Weakly similar to S65824 reverse t 3.6 

448706 AW291095 Hs.21814 interieukin 20 receptor, alpha 3.6 

440671 AW297920 Hs.1 30054 ESTs 3.5 
407647 AW8601 58 gb:RC0-CT0379-2901 00-032-b04 CT0379 Homo 3.5 

459023 AW968226 Hs.60798 ESTs 3.5 

402820 NM_017646*:Homo sapiens tRNA isopentenyl 3.5 

417009 AA191719 Hs.314714 ESTs 3.5 

447048 AW393080 Hs.228320 hypothetical protein FU23537 3.5 

449978 AI806335 Hs.200829 ESTs, Weakly similar to T301 71 ninein- 3.5 

428062 AA420683 Hs.98321 hypothetical protein FLJ14103 3.5 

452909 NM_015368 Hs.30985 pannexin 1 3.5 

400610 NA Target Exon 3.5 

417843 W07361 Hs.22545 Homo sapiens cDNAFLJ12935fis, clone NT 3.5 

419335 AW960146 Hs.284137 hypothetical protein FU 12888 3.5 

451592 AI805416 Hs.213897 ESTs 3.5 

443270 NM_004272 Hs.337737 Homer, neuronal immediate early gene, 1B 3.5 

423948 AW392342 Hs.283077 centrosomalP4.1-associated protein; unc 3.5 

449424 AW448937 Hs.197030 ESTs 3.5 
423841 AW753967 gb:RC2-CT0304-080100-011-h12CT0304Homo3.5 

416806 NM_000288 Hs.79993 peroxisomal biogenesis factor 7 3.5 

422060 R20893 Hs.325823 ESTs, Moderately similar to ALU5_HUMAN A 3.5 

433104 AL043002 Hs.128246 ESTs, Moderately similar to unnamed prot 3.5 

415778 H84847 Hs.49391 hypothetical protein LOC54149 3.5 

413054 AW316843 Hs.66309 hypothetical protein MGC1 1061 3.5 

416636 N32536 Hs.42645 solute carrierfamily 16 (monocarboxylic 3.5 

424639 AI917494 Hs.9812 Homo sapiens cDNA FU14388 fis, clone HE 3.5 

424827 AI057094 Hs.96867 Homo sapiens cDNA: FLJ231 55 fis, clone L 3.5 

437782 AI370876 Hs.79090 exportin 1 (CRM1, yeast, homolog) 3.5 

411514 AW850178 gb:IL3-CT0219-271099-022-H12 CT0219 Homo 3.5 

413783 AA314337 Hs.301547 ribosomal protein S7 3.5 

421106 AA877124 Hs.172844 ESTs 3.5 

431291 N25521 Hs.25275 Kruppel-type zinc finger protein 3.5 

440623 AI935016 Hs.216639 ESTs 3.5 
455838 BE145808 gb:MRO-HT0208-101299-103-f11 HT0208 Homo 3.5 

458771 AW295151 Hs.163612 ESTs 3.5 

442942 AW1 67087 Hs.1 31 562 ESTs 3.5 

436550 Z501 58 Hs.270235 ESTs, Weakly similar to MMHUB1 laminin b 3.5 

418849 AW474547 Hs.53565 Homo sapiens PIG-M mRNA for mannosyltran 3.5 

424420 BE614743 Hs.146688 prostaglandin E synthase 3.5 

430916 AW505021 Hs.88414 BTB and CNC homology 1, basic leucine zi 3.5 

432030 AI908400 Hs.143789 ESTs 3.5 

439405 AF086224 Hs.55238 ESTs 3.5 

405917 NA C17000675:gi|7290703|gb|AAF46150.1|(AEO 3.5 

452727 AW993582 Hs. 176220 ESTs 3.5 

426320 W47595 Hs.169300 transforming growth factor, beta 2 3.4 

421070 AA283185 Hs.1 9327 ESTs 3.4 

424625 AW904466 Hs.321197 PDZ domain protein (Drosophila inaD-like 3.4 

428508 BE252383 Hs.184668 SBBI31 protein 3.4 
455651 BE064962 gb:RC1-BT0313-130400-016-c02 BT0313 Homo 3.4 

410555 U92649 Hs.64311 a disintegrin and metalloproteinase doma 3.4 

447754 AW073310 Hs.163533 Homo sapiens CDNAFLJ14142 fis, clone MA 3.4 
418636 AW749855 gb:QV4-BT0534-281299-O53-c05 BT0534 Homo 3.4 

404097 NA C5000242*:gi|9369379|gb|AAF87128.1|AC006 3.4 

434205 AF1 19861 Hs.283032 hypothetical protein PRO201 5 3.4 

421072 AI215069 Hs.89113 ESTs 3.4 

402421 NA C1001578*:gi|6759903|gb|AAF28099.1I(AF1 3.4 

405248 NA Target Exon 3.4 

407638 AJ404672 Hs.334483 hypothetical protein FLJ23571 3.4 

403000 BE247275 Hs.151787 U5 snRNP-specific protein, 116 kD 3.4 
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H J J J J J 


AF0381fi4 


Hs.98074 itchy (mouse homolog) E3 ubiquitin prate 


3.4 




432239 


X81334 


Hs.2936 matrix metalloproteinase 13 (collagenase 


3.4 




458747 




Hs.257391 hypothetical protein DKFZp761J1523 


3.4 




442082 


R41823 


Hs.7413 ESTs; calsyntenin-2 


3.4 


5 


417974 


AA? 1(1765 

Ant 1 u r UJ 


gb:zr90c06.r1 NCI_CGAP_GCB1 Homo sapiens 3.4 




446002 


AI346468 


H<? 145789 ESTs 


3.4 




448995 


AI613276 


1fifi9 nuaninp nucleotide hindino nrotein (G or 


3.4 




436007 


AI247716 


H"? 2371RR FST<! 


3.4 


10 


424698 


AA1 64366 


H<t 111Q73 hvnothetical nrotein FLJ23111 


3.4 




AIQ71313 


He 170904 KIAAIW1 nrnfein 






410467 


AF102546 


He fi3Q31 HaHiQhiinH ^nrriennhila^ hrtmnlnn 


3.3 






NA 


Tom CvAh 

1 diyct exun 


3 3 




441826 


AW503603 


He 19QQ11 nhnerthntriPQtprflee relateH 


3.3 


15 


453472 


ai nwn 


nh-nKF7n*iR4Mn37 rl 164 fqvnonvm' hfbr21 


3.3 


447(178 


AWSR1797 

rtVVOOJ/ Z/ 


He 3fl117ft F9Te 


3 3 




441690 


R81733 
r\o 1 / w 


He 331 fifi F^Te 
no, jo \\J\) co i o 


3.3 




43f)(lQ3 


A Aft 14043 


We flftn4^ PQTe 

ns.00U4O Co is 


3 3 




418478 


I I3AQ41 


We 1 174 rvr*lin_Honondont kinaep inhihitnr 9A (mo 
rib. 1 Ift l^Utll-UcpcMUcI IL MllaOC IIHIIUILUI CJ\ yMIC 


3 3 
j. <j 




4(189(18 




ns.tOUOtt Scnnc/inreonine Kinase to 


3 3 


|__.| 


*t IH/O/ 


Ml IvvOOU 


Hs.125087 ESTs 


3.3 




AF011171 


Hs.23838 calcium channel, voltage-dependent L ty 


3.3 


2 ¥: 
i n 5 ; 


41RQ19 
4 I09 1 1 


mm ooorai 


Hs.89472 angiotensin receptor 1 


3.3 




43fiAfl1 
*r004UO 


aairoo7Q 


Hs.172932 Homo sapiens mRNA for partial 3'UTR, seq 


3.3 


ii.. : 




riVVOUOOO/ 


Hs.4007 Sarcolemmal-associated protein 


3.3 


fi5 


4uyoo 1 


iNivi_uuoyoz 


Hs.54416 sine oculis homeobox (Drosophila) homolo 


3.3 




49Q14fl 


AW13ftft79 
AW 1 000/ Z 


Hs.135288 ESTs 


3.3 




4onfln7 

4ZU0U/ 


AA9POR97 


Hs.57846 ESTs 


3.3 


O 




AA9QRQR1 


gb:EST1 12514 Adrenal gland tumor Homo sa 


3.3 




44 tH QQ 

440 i oy 


AIQ3R410 

Miyoo4ou 


Hs.147482 ESTs 


3.3 






WA 


Target Exon 


3.3 


■ 


49RRA1 

MZUVO 1 


AAQQ4ftQfi 

r\rw3*tt)3U 


Hs.22514 ESTs 


3.3 


1=35: 


41*799 

t JO 1 


AA741141 


Hs.282832 ESTs, Weakly similar to T24961 hypotheti 


3.3 


~~ : 

i 


4I1Q430 


R91Q41 


Hs.166975 splicing factor, arginine/serine-rich 5 


3.3 




443194 




Hs.279009 matrix Gla protein 


3.3 


25 


441439 
*+tj*tjz 


AVR53771 


gb'AV653771 GLC Homo sapiens cDNA clone 


3.3 






AA191RRR 


Hs.10592 ESTs 


3.3 


1 U 


4f1R111 


MA 


Target Exon 


3.3 








Hs.293261 ESTs 


3.3 


40 


411171 




gb:QV2-ST0296-150200-040-c10 ST0296 Homo 3.3 


4004 11 


T1RQ71 


Hs.289014 ESTs, Weakly similar to A43932 mucin 2 p 


3.3 




43Q310, 




Hs.102793 ESTs 


3.3 




401171 
4v lO/O 


MA 


Target Exon 


3.3 






A I (145R33 


Hs.44269 ESTs 


3.3 


45 


441R9A 


AI1d41fifi 


Hs.155743 ESTs 


3.3 


44ft 94 3 


A.IA/3RQ77'! 


Hs.52620 integrin, beta 8 


3.3 




44A109 
440 lUz 




Hs.22270 ESTs 


3.3 




44Z1 10 


A A07C740 
AMy/0/ to 


Hs.202242 ESTs 


3.3 




410/00 


AAAJOlOO 


Hs.79889 monocyte to macrophage differentiation-a 


3.3 


50 


40ouoy 




Hs.130526 ESTs 


3.3 


4014/4 


I AJo/4 


Hs.207636 ESTs 


3.2 




449c; CQ 

44zooy 


1 IUZ 10 


Hs.159993 gycosyltransferase 


3.2 




4003*11 


AI0Z4UUa 


Hs.44577 ESTs 


3.2 




420036 


R60336 


Hs.52792 Homo sapiens mRNA; cDNA DKFZp586l1823 (f 3.2 


55 


435627 


W88774 


Hs.1 18370 ESTs 


3.2 


411598 


BE336654 


Hs.70937 H3 histone family, member A 


3.2 




446733 


AA863360 


Hs.26040 ESTs, Weakly similar to fatty acid omega 


3.2 




410153 


BE311926 


Hs.15830 hypothetical protein FLJ12691 


3.2 




403637 


NA 


C3001106*:gi|10047201|dbj|BAB13394.1|(A 


3.2 


60 


405547 




NM_018833*:Homo sapiens transporter 2, A 


3.2 


427878 


C05766 


Hs.181022 CGI-07 protein 


3.2 




451871 


AI821005 


Hs.1 18599 ESTs 


3.2 




410313 


R10305 


Hs.185683 ESTs 


3.2 




416856 


N27833 


Hs.269028 ESTs, Weakly similar to I38022 hypotheti 


3.2 


65 


449490 


A1652777 


Hs.197069 ESTs 


3.2 


450506 


NM 004460 


Hs.418 fibroblast activation protein, alpha 


3.2 




440684 


AI253123 


Hs.127356 ESTs, Highly similar to S21424 nestin [H 


3.2 




459055 


N23235 


Hs.30567 ESTs, Weakly similar to B34087 hypotheti 


3.2 
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452190 H26735 Hs.91668 Homo sapiens clone PP1498 unknown mRNA 3.2 

430965 AA489732 Hs.154918 ESTs 3.2 

405394 Target Exon 3.2 

424693 BE169810 Hs.47557 ESTs 3.2 

5 454265 H03556 Hs.300949 ESTs, Weakly similar to thyroid hormone 3.2 

437687 AA765917 Hs. 122840 ESTs 3.2 

428372 AK000684 Hs. 183887 hypothetical protein FLJ22104 3.2 

414083 AL121282 Hs.257786 ESTs 3.2 
411670 AW856552 gb:RC1-CT0294-O80100-O12-a04CT0294 Homo 3.2 

10 416283 NM_005429 Hs.79141 vascular endothelial growth factor C 3.2 

437488 M758239 Hs. 180330 ESTs 3.2 

428398 AI249368 Hs.98558 ESTs 3.2 

452042 H38857 Hs.243901 Homo sapiens cDNA FLJ20738 fis, clone HE 3.2 

421477 AI904743 Hs.104650 hypothetical protein FLJ10292 3.2 

=.1.5 438078 AI016377 Hs.131693 ESTs 3.2 

IZ' 448816 AB033052 Hs.22151 KIAA1 226 protein 3.2 

U 419519 AI198719 Hs.176376 ESTs 3.2 

fH 404580 NM_014112*:Homosapienstrichorhinophala 3.2 

f£ 447046 AA326187 Hs.17170 G protein-coupled receptor 4 3.2 

*20 457473 AW974903 Hs.291231 ESTs 3.1 

W 429838 AW904907 Hs.30732 hypothetical protein FLJ13409; KIAA1711 3.1 

f?'! 459702 AI204995 gb:an03c03.x1 Stratagene schizo brain S1 3.1 

t "i 400195 NA NM_007057*:Hcmo sapiens ZWIOinteractor 3.1 

417860 AW408557 Hs.235498 hypothetical protein FU 14075 3.1 

£25 417995 AW974175 Hs.188751 ESTs, Weakly similar to MAPBJHUMAN MICRO 3.1 

422589 AA312735 Hs.30512 Homo sapiens mRNA for KIAA0556 protein, 3.1 

L. 435870 AA701327 Hs.17949 ESTs 3.1 

W 440801 AA906366 Hs.190535 ESTs 3.1 

|b4> 426274 D38122 Hs.2007 tumor necrosis factor (ligand) superfami 3.1 

?30 423728 AW891294 Hs.132136 solute carrier family 4, sodium bicarbon 3.1 

- " : 439677 R82331 Hs.164599 ESTs 3.1 

452834 AI638627 Hs.105685 KIAA1 688 protein 3.1 

O 431349 AA503653 Hs.156942 ESTs, Moderately similar to ALU2JHUMAN A 3.1 

417576 AA339449 Hs.82285 phosphoribosylglycinamideformyltransfer 3.1 

^5 430264 AA470519 gb:nc71f10.s1 NCI_CGAP_Pr1 Homo sapiens 3.1 

418827 BE327311 Hs.47166 HT021 3.1 
41 0835 AW806906 gb:QV4-ST0023-1 60400-172-d12 ST0023 Homo 3.1 

426269 H15302 Hs.168950 Homo sapiens mRNA; cDNA DKFZp566A1046 (f 3.1 

405336 NA Target Exon 3.1 

40 437783 AI683150 Hs.201550 ESTs, Weakly similar to ALU1_HUMAN ALUS 3.1 

440931 AI583052 Hs.270058 ESTs 3.1 
455945 BE160636 gb:PM1-HT0422-291299-002-c08 HT0422 Homo 3.1 

430437 AI768801 Hs.169943 Homo sapiens cDNAFLJ13569 fis, clone PL 3.1 

405848 NA Target Exon 3.1 

45 455685 BE066976 gb:PM0-BT0340-211299-003-c12 BT0340 Homo 3.1 

406970 M29994 gb:Human alpha-l spectrin gene, exon 12. 3.1 

409602 W26713 Hs.256972 ESTs 3.1 

423518 D45027 Hs.129732 R3H domain (binds single-stranded nuclei 3.1 

425653 AI065104 Hs.249718 ESTs, Weakly similar to A46010 X-linked 3.1 

50 426326 BE165753 Hs.250528 Homo sapiens, clone IMAGE:4098694, mRNA, 3.1 

433805 AA706910 Hs.1 12742 ESTs 3.1 
437152 AL050027 gb:Homo sapiens mRNA; cDNA DKFZp566C03243.1 

448602 AI541305 Hs.48778 niban protein 3.1 

452844 AW407181 Hs.218377 Homo sapiens cDNAFLM 1927 fis, clone HE 3.1 

5 5 407366 AF026942 gb:Homo sapiens cig33 mRNA, partial sequ 3.1 

408254 AW807227 gb:MR4-ST0062-180200-001-e10 ST0062 Homo 3.1 

424085 NM_002914 Hs.139226 replication factor C (activator 1)2 (40 3.1 

416790 R83066 Hs.7043 succinate-CoA ligase, GDP-forming, alpha 3.1 

420020 BE295866 Hs.94382 adenosine kinase 3.1 

60 426119 W94997 Hs.189917 ESTs 3.1 

426968 U07616 Hs.173034 amphiphysin (Stiff-Mann syndrome with br 3.1 

457421 AL117431 Hs.112165 Homo sapiens CDNAFLJ12198 fis, clone MA 3.1 

453403 BE466639 Hs.61779 Homo sapiens cDNA FU13591 fis, clone PL 3.1 

454141 AW138413 Hs.139336 ATP-binding cassette, sub-family C (CFTR 3.1 

65 426650 AA382814 gb:EST96097 Testis I Homo sapiens cDNA 5 3.1 

450865 AI248013 Hs.106532 ESTs, Weakly similar to I38588 reverse t 3.1 

407993 AW1 35274 Hs.12433 ESTs 3.1 
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446466 

457888 

420058 

409248 

452747 

418926 

419346 

429826 

435147 

420139 

405609 

404274 

449777 

415459 

415245 

406291 

414210 

432055 

442246 

451353 

451177 

418026 

401326 

409920 

432887 

411789 

401045 

432162 

434627 

442611 

425477 

457183 

433014 

415542 

416173 

408155 

453913 

435495 

423629 

411836 

415030 

419606 

440310 

443608 

414140 

444781 

435393 

454071 

446922 

448062 

427711 

450382 

424866 

433043 

420802 

445625 

403677 

411093 

435255 

443127 

448104 

427315 

430414 

423600 

458562 

402109 

429629 



H38026 

BE219794 

AK001423 

AB033035 

BE153855 

AA232658 

AI830417 

N93266 

AL133731 

NM_005357 

NA 

AI971362 
H07118 
N59650 
NA 

BE383592 

AW972359 

AI791988 

N21043 

AI969716 

BE379727 

NA 

BE1 69746 
AI926047 
AF245505 

AA584062 

AI221894 

BE077155 

AW958879 

H91882 

NM_014711 

R13474 

R52782 

AB014528 

AW004683 

AI754212 

AW021173 

AW901879 

D31118 

AW294795 

AA878939 

AI375957 

AA281279 

NM 014400 

AA701259 

AI041793 

BE175605 

AW295923 

M31659 

AA397658 

W01938 

W57554 

U22376 

BE246743 

NA 

BE067650 

W87434 

BE568102 

AI674818 

AA1 79949 

AW365665 

AI633559 

N34128 

NA 

BE501732 



Hs.308 

Hs.293471 

Hs.94694 

Hs.51965 

Hs.61460 

Hs. 105794 

Hs.44143 

Hs.40747 

Hs.4774 

Hs.95351 



Hs.231945 

Hs.6099 

Hs.27252 



Hs.293334 
Hs.129115 
Hs.42932 
Hs.13034 
Hs.83213 

Hs.12504 

Hs.162859 

Hs.72157 

Hs.272798 

Hs.39311 

Hs.177537 

Hs.270535 

Hs.1 18569 

Hs.279912 

Hs.290263 

Hs.43133 

Hs.78934 

Hs.21951 

Hs.18612 

Hs.314453 

Hs.191735 

Hs.198529 

Hs.125406 

Hs.289074 

Hs.23317 

Hs.11950 

Hs.189299 

Hs.42502 

Hs.255472 
Hs.180408 
Hs.60257 
Hs.337243 
Hs.125019 
Hs.1334 
Hs.288529 



anrestin 3, retinal (X-arrestin) 3.1 

ESTs 3.1 

Homo sapiens cDNA FLJ10561 fls, clone NT 3.0 

KIAA1209 protein 3.0 

Ig superfamily receptor LNIR 3.0 

UDP-glucose:glycoprotein glucosyltransfe 3.0 

polybromo 1 3.0 

ESTs 3.0 
Homo sapiens mRNA; cDNA DKFZp761C1712 (f 3.0 

lipase, hormone-sensitive 3.0 

ENSP00000241065*:CDNA 3.0 

NM_002944*:Homo sapiens v-ros avian UR2 3.0 

ESTs 3.0 

ESTs 3.0 

ESTs 3.0 

Target Exon 3.0 
gb:601297871F1 NIH_MGC_19 Homo sapiens c 3.0 

ESTs 3.0 

ESTs 3.0 

ESTs 3.0 

ESTs 3.0 

fatty acid binding protein 4, adipocyte 3.0 
C10000447*:gi|1 168375lsp|P43467|AGA1_PED 3.0 



likely ortholog of mouse Arkadia 3.0 

ESTs 3.0 

DKFZP564I1 922 protein 3.0 

C11001883*:gi|6753278|ref|NP_033938.1| c 3.0 

hypothetical protein FU2041 3 3.0 

ESTs 3.0 

hypothetical protein DKFZp761B1514 3.0 

ESTs 3.0 

Dvl-binding protein IDAX (inhibition of 3.0 

KIAA0419 gene product 3.0 

ESTs, Weakly similar to I38022 hypotheti 3.0 

gb:yg99d09.r1 Soares infant brain 1 NIB H 3.0 

KIAA0628 gene product 3.0 

mutS (E. coli) homolog 2 (colon cancer, 3.0 

Homo sapiens Xq pseudoautosomal region; 3.0 

Homo sapiens cDNA: FU21909 fis, clone H 3.0 

ESTs 3.0 

hypothetical protein MGC1 0520 3.0 

ESTs 3.0 

ESTs 3.0 

F-box only protein 22 3.0 

hypothetical protein FLJ14681 3.0 

GPI-anchored metastasis-associated prote 3.0 

ESTs 3.0 

ESTs 3.0 



gb:RC5-HT0580-100500-022-H07 HT0580 Homo3.0 
KIAA1843 protein 3.0 
solute carrier family 25 (mitochondrial 3.0 
Homo sapiens cDNA FU 1 3598 fis, clone PL 3.0 
ESTs, Weakly similar to ALU7_HUMAN ALU S 2.9 
lymphoid nuclear protein (LAF-4) mRNA 2.9 
v-myb avian myeloblastosis viral oncogen 2.9 
hypothetical protein FU22635 2.9 
C4001462:gi|4887715|gb|AAA79329.2| (L088 2.9 
gb:MR4-BT0358-090300-003-e01 BT0358 Homo 2.9 
Hs.106015 ESTs, Moderately similar to ALU1_HUMAN A 2.9 
Hs. 180312 mitochondrial ribosomal protein S16 2.9 
Hs.316433 Homo sapiens cDNA FLJ1 1375 fis, clone HE 2.9 
Hs.175563 Homo sapiens mRNA; cDNA DKFZp564N0763 (f 2.9 
Hs. 120388 ESTs 2.9 
Hs.310359 ESTs 2.9 
Hs.145268 ESTs 2.9 
Target Exon 2.9 
Hs.30622 Homo sapiens CDNAFLJ13010 fis, clone NT 2.9 
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4/o4/ y 


VOA070 
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4z4oUU 


ALUOOOOO 




X 1 1 AQG 


btU/UoUU 


50 


Xaaoca 

4uuzoy 


MA 

NA 


A XA4C0 

449168 


NM_01o20o 




A CCA QO 


AA4oozz4 




426044 


AA502490 




431854 


A A OOOCCA 

AA383OO0 


55 


405873 


NA 


440400 


AA994364 




458265 


AI075375 




413708 


BE158791 




423739 


AA398155 


60 


424408 


AI754813 


453096 


AW294631 




421825 


AA298758 




417742 


R64719 




402765 




65 


444378 


R41339 


419172 


AW338625 




401497 






402376 





Hs.224398 Homo sapiens cDNA FLJ1 1469 fis, clone HE 2.9 
Hs.152618 ESTs, Moderately similar to ZN91JHUMAN Z 2.9 
NM_005596*:Homo sapiens nuclear factor I 2.9 
Hs. 14731 3 ESTs, Weakly similar to I38022 hypotheti 2.9 
NM_002737*:Homo sapiens protein kinase C 2.9 
Hs.190149 ESTs 2.9 
Hs. 146883 ESTs 2.9 
Hs. 149006 ESTs 2.9 
Hs.20726 ESTs 2.9 
Hs.246358 ESTs, Weakly similar to T32250 hypotheti 2.9 
Target Exon 2.9 
Hs.101299 cullin 5 2.9 
gb:601315974F1 NIH_MGC_8 Homo sapiens cD2.9 
Hs.1 53954 TRAM-like protein 2.9 
Hs.55209 Homo sapiens mRNA; cDNA DKFZp434K0514 (f 2.9 
Hs.80485 adipose most abundant gene transcript 1 2.9 
gb:PM0-HT0425-141299-001-F08 HT0425 Homo2.9 
Hs.56148 NY-REN-58 antigen 2.9 
gb:ny57g01.s1 NCI_CGAP_Pr18 Homo sapiens 2.9 
Hs.1 28685 ESTs, Weakly similar to T17226 hypotheti 2.9 
Hs.270425 ESTs 2.9 
Hs.1 03982 small inducible cytokine subfamily B (Cy 2.9 
Hs.28853 CDC7 (cell division cycle 7, S.cerevisi 2.9 
Hs.1 49342 activation-induced cytidine deaminase 2.9 
Hs.333526 HER2 receptor tyrosine kinase (c-erb-b2, 2.9 
Hs.238936 ESTs, Weakly similar to (defline not ava 2.9 
C12000586*:gi|6330167|dbj|BAA86477.1 1 (A 2.9 
Hs.163944 ESTs 2.9 
Hs.200949 ESTs 2.9 
Hs.195359 hypothetical protein MGC1 0954 2.9 
Hs.303662 ESTs 2.9 
gb:601 193893F1 NIH_MGC_7 Homo sapiens cD 2.9 
Target Exon 2.8 
gb:UI-H-BWO-ajc-c-07-0-Ul.s1 NCI_CGAP_Su 2.8 
Hs.130554 Homo sapiens cDNA: FLJ23089 fis, clone L 2.8 
Hs.183861 Homo sapiens CDNAFU20042 fis, clone CO 2.8 
Hs.57787 ESTs 2.8 
Hs.3782 ESTs 2.8 
Hs.58006 ESTs, Weakly similar to ALU5_HUMAN ALUS 2.8 
gb:RC3-HT0585-160300-022-c02 HT0585 Homo 2.8 
Hs.49136 ESTs, Moderately similar to ALU7_HUMAN A 2.8 
Hs.132921 ESTs 2.8 
Hs.50002 small inducible cytokine subfamily A (Cy 2.8 
gb:QV4-ST0023-1 60400-1 72-cl 2 ST0023 Homo 2.8 
Hs.132413 ESTs 2.8 
Hs.184572 cell division cycle 2, G1 to S and G2 to 2.8 
Hs.153203 MyoD family inhibitor 2.8 
gb:RC3-BT0502-251199-011-c07 BT0502 Homo 2.8 
Eos Control 2.8 
colon carcinoma related protein 2.8 
gb:aa41b12.s1 NCI_CGAP_GCB1 Homo sapiens2.8 



Hs.23142 

Hs.336695 
Hs.271699 

Hs.125594 
Hs.128193 

Hs.97600 
Hs.146428 
Hs.11325 
Hs.183747 



Hs.1 2569 
Hs.22120 



ESTs 2.8 

polymerase (DNA directed) iota 2.8 

Target Exon 2.8 

ESTs, Weakly similar to T25472 hypotheti 2.8 

ESTs, Weakly similar to IRX2_HUMAN IROQU 2.8 

gb:IL2-HT0397-091299-025-D02 HT0397 Homo 2.8 

ESTs 2.8 

collagen, type V, alpha 1 2.8 

ESTs 2.8 

ESTs, Moderately similar to CALB_HUMAN C 2.8 
gb:EST22d11 WATM1 Homo sapiens cDNA clon 2.8 

C1003621*:gi|124O74O5|gb|AAG53491.1|AF22 2.8 

ESTs 2.8 

ESTs 2.8 

Target Exon 2.8 

C19000763*:gi|1363912|pir||JC4296 ring f 2.8 



294 



Attorney Docket No.:018501-005210US 





*tUJU*t 1 


NA 


C3001706*:gi|1345652|sp|P15989|CA36_CHIC 


2.8 




HUO/ 30 


mm flfllRRR 


Hs.47504 exonuclease 1 


2.8 




431917 


D16181 


Hs.2868 peripheral myelin protein 2 


2.8 




437583 


AA761190 


Hs.244627 ESTs 


2.8 


-J 


453737 


AA744862 


Hs. 194293 ESTs, Weakly similar to I54374 gene NF2 


2.8 




458094 


AF086325 


gb:Homo sapiens full length insert cDNA 


2.8 




401283 


NA 


Target Exon 


2.8 




410784 


AW803201 


gb:IL2-UM00774)70500-080-E06 UM0077 Homo 2.8 


10 


417601 


NM_014735 


Hs.82292 KIAA021 5 gene product 


2.8 


418236 


AW994005 


Hs.337534 ESTs 


2.8 




435532 


AW291488 


Hs.1 17305 Homo sapiens, clone IMAGE:3682908, mRNA 


2.8 




454714 


AW815098 


gb:QV4-ST021 2-091 199-023410 ST0212 Homo 2.8 




418629 


BE247550 


Hs. 86859 growth factor receptor-bound protein 7 


9 ft 


15 


442101 


AI651930 


HS.IJ0004 to IS 


9 ft 


405080 


AK000375 


Hs.ooo^u nuoivio^or proiem 


9 a 
z.o 




414661 


T97401 


Hs.21929 ESTs 


9 ft 


-SSS- 


425589 AI650633 


Hs. 140DOO Homo sapiens cuina. rLJ/juo i ns, cione l 


9 ft 




429638 


AI916662 


Hs.211577 kinectin 1 (kinesin receptor) 


9 7 
l.l 




428824 W23624 


Hs. 173059 ESTs 


9 7 
Z..I 




421566 


NMJ00399 


Hs.1 395 early growth response 2 (Krox-20 (Drosop 


9 7 
l.l 




414596 


BE386870 


gb:60l275Z7iri NiH_MbO_^u tiomo sapiens c 


9 7 
l.l 




440868 


R79707 


Hs.263339 co is, Moderately similar to \stuiL nypot 


9 7 
L.I 


i :: : 


452943 


BE247449 


Hs.3 1 082 hypothetical protein FLJ 1 0525 


9 7 
l.l 


■' : 


443772 


AV646449 


Hs.282872 ESTs 


9 7 
l.l 


^5 


432361 


AI378562 


Hs.1 59585 ESTs 


9 7 
l.l 




430375 


AW371048 


Hs.93758 H4 histone family, member H 


7 
l.l 




406504 


NA 


C5000558:gi|4504675|ref|NP_002175.1 1 int 


1 7 

2.7 




423279 


AW959861 


Hs.290943 ESTs 


1 7 
1.1 


Nrv 


424871 


NM_004525 


Hs.1 53595 low density lipoprotein-related protein 


1 7 

2.7 




453619 


H87648 


Hs.33922 Homo sapiens, clone MGC:9084, mRNA, comp 


2.7 




423961 


D13666 


Hs.1 36348 osteoblast specific factor 2 (fasciclin 


2.7 


SSI 


422156 


N34524 


gb:yy56d10.s1 Soares_multiple_sclerosis_ 


2.7 


415752 


BE314524 


Hs.78776 putative transmembrane protein 


2.7 


111 


419987 


NM 005014 


Hs.94070 osteomodulin 


O 7 

2.7 


i a j 


406182 


NA 


Target Exon 


2.7 




416495 


X69970 


Hs.79350 RYK receptor-like tyrosine kinase 


2.7 




444701 


AI916512 


Hs.198394 ESTs 


O 7 

2.7 




408171 


AA301228 


Hs.43299 hypothetical protein FLJ12890 


2.7 


40 


430153 


AW968128 


Hs.336679 ESTs 


2.7 


413383 


AA1 28978 


Hs.154706 hypothetical protein FLJ14917 


2.7 




414831 


M31158 


Hs.77439 protein kinase, cAMP-dependent, regulato 


n 7 
l.l 




413278 


BE563085 


Hs.833 interferon-stimulated protein, 15 kDa 


2.7 




433132 


AB026264 


Hs.284245 hypothetical protein IMPACT 


2.7 




437030 


AA742577 


Hs.303781 EST 


2.7 


45 


439031 


AF075079 


gb:Homo sapiens full length insert cDNA 


2.7 




449532 


W74653 


Hs.271 593 ESTs, Moderately similar to A47582 B-cel 


2.7 




406153 




Target Exon 


2.7 




406625 


Y13647 


Hs.1 19597 stearoyl-CoA desaturase (delta-9-desatur 


2.7 




444698 


AM 881 39 


Hs.1 47050 ESTs 


2.7 


50 


432328 


AI572739 


Hs.1 95471 6-phosphofructo-2-kinase/fructose-2,6-bi 


2.7 




429628 


H09604 


Hs.13268 ESTs 


2.7 




420149 


AA255920 


Hs.88095 ESTs 


2.7 




431207 


AA495925 


Hs.9394 ESTs 


2.7 


55 


438394 


BE379623 


Hs.27693 peptidylprolyl isomerase (cyclophilin)-l 


2.7 


443304 


AI050073 


Hs.1 35338 ESTs 


2.7 




427660 


AI741320 


Hs.1 14121 Homo sapiens cDNA: FLJ23228 fis, clone C 


2.7 




408460 


AA054726 


Hs.285574 ESTs 


2.7 




416515 


N91716 


Hs.194140 ESTs, Weakly similar to I38022 hypotheti 


2.7 


60 


429922 


Z97630 


Hs.226117 H1 histone family, member 0 


2.7 


418203 


X54942 


Hs.83758 CDC28 protein kinase 2 


2.7 




439509 


AF086332 


Hs.58314 ESTs 


2.7 




402184 


NA 


ENSP00000245238*:CDNA FU10922 fis, clon 


2.7 




450496 


AW449251 


Hs.257131 ESTs 


2.7 


65 


451963 


AI825440 


Hs.224952 ESTs 


2.7 


457938 


AI373638 


Hs.1 33900 ESTs 


2.7 




441541 


AA938663 


Hs.199828 ESTs 


2.7 




441111 


AI806867 


Hs.126594 ESTs 


2.7 
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423020 M383092 Hs.1608 replication protein A3 (14kD) 2.7 

445354 AV653485 Hs.6390 Homo sapiens clone FLB3344 PRO0845 mRNA, 2.7 

427961 AW293165 Hs.143134 ESTs 2.7 

410889 X91662 Hs.66744 twist (Drosophila) homolog (acrocephalos 2.7 

5 445234 AW137636 Hs.146059 ESTs 2.7 

413903 AA496493 Hs.23136 ESTs 2.7 

406069 NA Target Exon 2.7 

447410 AI470235 Hs.172698 EST 2.7 

401256 NA NM_024089*:Homo sapiens hypothetical pro 2.7 

10 415139 AW975942 Hs.48524 ESTs 2.7 

420218 AW958037 Hs.286 ribosomal protein L4 2.7 
455511 BE144762 gb:CM0-HT0180-041099-065-bO4HT0180Homo2.7 

438825 BE327427 Hs.79953 ESTs 2.6 

409564 AA045857 Hs.54943 fracture callus 1 (rat) homolog 2.6 

15 452837 AL121053 Hs.5534 Homo sapiens cDNA FLJ12961 fis, clone NT 2.6 

U 434876 AF160477 Hs.61460 Ig superfamily receptor LNIR 2.6 

421565 AK001122 Hs.105859 hypothetical protein FLJ 10260 2.6 

!ff 453279 AW893940 Hs.59698 ESTs 2.6 

L> 430785 Z30201 gb:HHEA22G Atrium cDNA library Human hea 2.6 

lf.0 456986 D38299 Hs.170917 prostaglandin E receptor 3 (subtype EP3) 2.6 

J=: 433068 NM_006456 Hs.288215 sialyltransferase 2.6 

421952 AA300900 Hs.98849 ESTs, Moderately similarto AF161511 1 H 2.6 

fli 429208 AA447990 Hs.190478 ESTs 2.6 

%. i 430733 AW975920 Hs.283361 ESTs 2.6 

.25 441720 AI346487 Hs.28739 ESTs 2.6 

W 418986 AI123555 Hs.81796 ESTs 2.6 

= 432481 AW451645 Hs.151504 Homo sapiens cDNA FU1 1973 fis, clone HE 2.6 
fp=i 434338 AW754311 gb:CM1-CT0337-141299-068-f07 CT0337 Homo 2.6 

r i! 417061 AI675944 Hs.188691 Homo sapiens CDNAFLJ12033 fis, clone HE 2.6 

BO 410530 M25809 Hs.64173 ATPase, H transporting, lysosomal (vacuo 2.6 

pi 456672 AK00201 6 Hs.1 14727 Homo sapiens, clone MGC:16327, mRNA, com 2.6 

425071 NMJ13989 Hs.1 54424 deiodinase, iodothyronine, type II 2.6 

t =: 408868 AW292286 Hs.255058 ESTs 2.6 

P 451531 AA018311 Hs.1 14762 ESTs 2.6 

|f 5 405822 Target Exon 2.6 

418301 AW976201 Hs.53913 hypothetical protein FLJ10252 2.6 

417315 AI080042 Hs.336901 ribosomal protein S24 2.6 

434699 AA643687 Hs.149425 Homo sapiens cDNA FLJ1 1980 fis, clone HE 2.6 

443204 AW205878 Hs.29643 Homo sapiens CDNAFLJ13103 fis, clone NT 2.6 

40 405638 Target Exon 2.6 
452542 AW812256 gb:RC0-ST0174-1 91 099-031 -a07ST01 74 Homo 2.6 

403943 C5000355:gi|4503225lref|NP_000765.1|cyt 2.6 

404535 Z25884 Hs.121483 chloride channel 1 .skeletal muscle (Th 2.6 

402800 NA Target Exon 2.6 

45 449144 AI989503 Hs.233405 ESTs 2.6 

454934 AW846080 Hs.314324 ESTs 2.6 

424717 H03754 Hs.152213 wingless-type MMTV integration site fami 2.6 

428303 AW974476 Hs.183601 regulatorof G-protein signalling 16 2.6 

427970 AA418187 Hs.330515 ESTs 2.6 

50 450638 AK001826 Hs.25245 hypothetical protein FLJ11269 2.6 

453034 BE246010 Hs.271468 Homo sapiens mRNA for FU00038 protein, 2.6 
455097 AW855802 gb:RC1-CT0279-170200-023-d08 CT0279 Homo 2.6 

427317 AB028955 Hs.175780 KIAA1 032 protein 2.6 

408875 NM.015434 Hs.48604 DKFZP434B168 protein 2.6 

55 427510 Z47542 Hs.179312 small nuclear RNA activating complex, po 2.6 

423201 NM_000163 Hs.125180 growth hormone receptor 2.6 

406271 Target Exon 2.6 

442696 BE566962 Hs.7063 Homo sapiens cDNA: FLJ2091 3 fis, clone A 2.6 

454018 AW016892 Hs.100855 ESTs 2.6 

60 435420 AI928513 Hs.59203 ESTs 2.6 

434398 AA121098 Hs.3838 serum-inducible kinase 2.6 
455708 BE069326 gb:QV3-BT0381-170100-060-g03 BT0381 Homo 2.6 

439347 W24320 Hs.102941 Homo sapiens cDNA: FLJ21531 fis, clone C 2.6 

407523 X64984 gb:H.sapiens mRNA HTPCRX10 for olfactory 2.6 

65 425101 AA830431 Hs.180811 ESTs 2.6 

435153 AA668763 Hs.291939 ESTs 2.6 

409139 AI681917 Hs.3321 ESTs, Highly similar to IRX1_HUMAN IROQU 2.6 
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455100 
414612 
440283 
423025 
431473 
404440 
403388 
403775 
405037 
407447 
420952 
435447 
440202 
445854 
421247 
414870 
457411 
424676 
404443 
452268 
430832 
444779 
408633 
459089 
447197 
454111 
411165 
406922 
456045 
413111 
423123 
432201 
427032 
445417 
422225 
428330 
410011 
426310 
419589 
437770 
442818 
414251 
428301 
431956 
455732 
458624 
428257 
418054 
458652 
458012 
422996 
410804 
457211 
440029 
448141 
409163 
431385 
414569 
439963 
455935 
425025 
416589 
404826 
422938 
421991 
417404 
448516 



BE160198 
BE274552 
AI732892 
AA831267 
AA825686 

NA 
NA 
NA 

AF290544 

AA282067 

AI872932 

AW516211 

AI702885 

BE391727 

N72264 

AW085961 

Y08565 

NM_003512 

AI073913 

AI192105 

AW963372 

F13036 

R36075 

AW081681 

NM_000169 

S70284 

H62943 

BE065837 

NM_012247 

AI538613 

AF012023 

AK001058 

BE245652 

L22524 

AB020641 

NMJW0909 

AW973708 

AA767881 

AK001741 

AL042306 

AW628666 

AK002032 

BE080908 

AI362790 

BE394723 

NM_002318 

AW375610 

AI424899 

BE091089 

U64820 

AW972565 

AW089705 

AI471598 

AA065081 

BE178536 

AF109298 

AW247529 

BE158687 

AW953168 

AA652687 

NMJ01809 
NM_014918 
NM_007350 
AW898595 



gb:QV1-HT0413-010200-059-h03 HT0413 Homo 2.6 

Hs.76578 protein inhibitor of activated STAT3 2.6 

Hs.1 90489 ESTs 2.6 

Hs.12244 hypothetical protein FLJ20097 2.6 

Hs.321 1 76 ESTs, Weakly similar to S65824 reverse t 2.6 

NM_021048:Homo sapiens melanoma antigen, 2.6 

C3001398*:gi|12248917|dbj|BAB20375.1| (A 2.6 

Target Exon 2.6 

NM_021 628*:Homo sapiens arachidonate lip 2.6 

gb:Homo sapiens aminopeptidase mRNA, par 2.6 

ESTs, Moderately similar to A4601 0 X-lin 2.6 

gb:wm72e03.x1 NCI_CGAP_Ut2 Homo sapiens 2.6 

ring finger protein 21 , interferon-respo 2.6 

ESTs 2.6 

general transcription factor IIH, polype 2.6 

KIAA1204 protein 2.6 

ESTs 2.6 

UDP-N-acetyl-alpha-D-galactosamine:polyp 2.6 

C8001428*:gi|6572242|emb|CAB62951 .1 1 (Z9 2.6 

H2A histone family, member L 2.6 

ESTs, Weakly similar to JE0350 Anterior 2.6 

ESTs 2.6 

PRO2000 protein 2.6 
Homo sapiens mRNA; cDNA DKFZp56401763 (f 2.6 

gb:yh88b01 .s1 Soares placenta Nb2HP Homo 2.6 

ESTs, Weakly similar to T42689 hypotheti 2.6 

galactosidase, alpha 2.6 

gb:stearoyl-CoA desaturase [human, adipo 2.6 

ESTs 2.6 
gb:RC2-BTO318-11010O-O12-g12 BT0318 Homo 2.6 

SELENOPHOSPHATE SYNTHETASE ; Human selen2.6 



Hs.88972 

Hs.1 25300 
Hs.1 45568 
Hs.102910 
Hs.300670 
Hs.1 30093 
Hs.151678 

Hs.28777 
Hs.1 00686 
Hs.1 47170 
Hs.46677 
Hs.27373 

Hs.269064 
Hs.69089 

Hs.1 541 88 

Hs.124027 
Hs.298241 
Hs.173274 
Hs.12680 
Hs.1 18281 
Hs.2256 
Hs.57856 
Hs.1 69266 
Hs.201925 
Hs.122897 
Hs.8739 
Hs.97689 
Hs.98440 
Hs.272245 

Hs.278639 
Hs.275243 
Hs.83354 
Hs.1 17102 
Hs.188211 

Hs.66521 
Hs.32399 
Hs.293711 
Hs.197531 

Hs.1 1090 

Hs.118258 

Hs.6793 

Hs.1 2407 
Hs.96151 

Hs.1594 
Hs.1 10488 
Hs.82101 



Transmembrane protease, serine 3 2.5 

integrin cytoplasmic domain-associated p 2.5 

Homo sapiens cDNA FLJ10196 fis, clone HE 2.5 

zinc finger protein 266 2.5 

matrix metalloproteinase 7 (MMP7; uterin 2.5 

PFTAIRE protein kinase 1 2.5 

neuropeptide Y receptor Y1 2.5 

Homo sapiens cDNA FLJ13446 fis, clone PL 2.5 

ESTs 2.5 

hypothetical protein FU10879 2.5 

VASA protein 2.5 

ESTs, Weakly similar to I38022 hypotheti 2.5 

Homo sapiens cDNA FU1 1 170 fis, clone PL 2.5 
gb:QV1-BT0631-280200-084-h07BT0631 Homo 2.5 

KIAA1684 protein; likely homolog of mous 2.5 

S100 calcium-binding protein A6 (calcycl 2.5 

lysyl oxidase-like 2 2.5 

hypothetical protein FLJ13046 similarto 2.5 

ESTs 2.5 
gb:PM4-BT0724-130400-006-c07 BT0724 Homo 2.5 

Machado^Joseph disease (spinocerebellar 2.5 

ESTs, Weakly similar to S51797 vasodilat 2.5 

ESTs, Weakly similar to S64329 probable 2.5 

ESTs 2.5 

gb:zm1 3a03.s1 Stratagene pancreas (93720 2.5 

membrane-spanning 4-domains, subfamily A 2.5 

prostate cancer associated protein 1 2.5 

platelet-activating factor acetylhydrola 2.5 
gb:CMO-HT0395-280100-169-b09 HT0395 Homo2.5 

ESTs 2.5 

Human DNA sequence from clone RP5-1 103G7 2.5 

Target Exon 2.5 

centromere protein A (17kD) 2.5 

KIAA0990 protein 2.5 

pleckstrin homology-like domain, family 2.5 
gb:RC1-NN0073-260400-011-g09 NN0073 Homo2.5 



297 



Attorney DocketNo.:018501-005210US 





403356 


NA 


ENSP00000251525*:Hypothetical protein Kl 


2.5 




404983 




ENSP00000252242*:Keratin, type II cytosk 


2.5 




418282 


AA215535 


Hs.98133 ESTs 


2.5 




427409 


AW467143 


Hs. 1 3541 1 actin related protein 


2.5 


5 


431806 


AF186114 


Hs.270737 tumor necrosis factor (ligand) superfami 


2.5 




443367 


AW071349 


Hs.215937 ESTs 


2.5 




421246 


AW582962 


Hs.102897 CGI-47 protein 


2.5 




439217 


AF086041 


Hs.42975 ESTs 


2.5 


10 


400925 




Target Exon 


2.5 


404552 


NA 


ENSP00000220888*:ZINC FINGER TRANSCRIPTI2. 




417168 


AL133117 


Hs.81376 Homo sapiens mRNA; cDNA DKFZp586L1121 (f 2.5 




418841 


NM 002332 


Hs.891 37 low density lipoprotein-related protein 


2.5 




426853 


U32974 


Hs. 172777 baculoviral IAP repeat-containing 4 


2.5 


15 


427738 NM 000318 


Hs.180612 peroxisomal membrane protein 3 (35kD, Ze 


2.5 


457384 AA501760 


Hs.15806 Homo sapiens mRNA; cDNA DKFZp434H2019 (f 2.5 




447128 AI271898 


Hs.164866 cyclinK 


2.5 


tzi 


454693 AW813428 


gb:MR3-ST0192-010200-210-c05 ST0192 Homo 2.5 




434657 


AA641876 


Hs.191840 ESTs 


2.5 




402077 


NA 


Target Exon 


2.5 


= 20 


400289 


X07820 


Hs.2258 matrix metalloproteinase 10 (MMP10;str 


2.5 


fti 


409723 


AW885757 


Hs.257862 ESTs 


2.5 




447020 


T27308 


Hs.1 6986 hypothetical protein FU11046 


2.5 




455068 


AI807894 


Hs.47274 Homo sapiens mRNA; cDNA DKFZp564B176 (fr 2.5 




431232 


AI024353 


Hs.131755 hypothetical protein FLJ14298 


2.5 




408938 


AA059013 


Hs.22607 ESTs 


2.5 




411571 


AA1 22393 


Hs.7081 1 hypothetical protein FLJ2051 6 


2.5 


3 


426504 


AW162919 


Hs.1 70160 RAB2, member RAS oncogene family-like 


2.5 


B 


428248 


AI126772 


Hs.40479 ESTs 


2.5 




408813 


AI580090 


Hs.48295 RNA helicase family 


2.5 


[30 


423504 


N80077 


Hs.24792 chromosome 12 open reading frame 5 


2.5 


L 


425441 


AA449644 


Hs.193063 Homo sapiens cDNA FLJ14201 fis, clone NT 


2.5 




443066 


AW297921 


Hs.255703 ESTs 


2.5 




443556 


AA256769 


Hs.94949 methylmalonyl-CoA epimerase 


2.5 




428943 


AW086180 


Hs.37636 ESTs, Weakly similar to KIAA1 392 protein 


2.5 


B 5 


425320 


U29344 


Hs.831 90 fatty acid synthase 


2.5 




430388 


AA356923 


Hs.240770 nuclear cap binding protein subunit 2, 2 


2.5 




423242 


AL039402 


Hs.125783 DEME-6 protein 


2.5 




416241 


N52639 


Hs.32683 ESTs 


2.5 


40 


440244 


AI743977 


Hs.205144 ESTs 


2.5 


409239 


AA740875 


Hs.44307 ESTs, Moderately similar to I38022 hypot 


2.5 




452464 


AW500507 


Hs.192619 KIAA1600 protein 


2.5 




410718 


AI920783 


Hs.191435 ESTs 


2.5 




408877 


AA479033 


Hs.130315 ESTs, Weakly similar to A47582 B-cell gr 


2.5 


45 


445150 


AI446747 


Hs.338704 olfactory receptor, family 7, subfamily 


2.5 


407756 


AA1 16021 


Hs.38260 ubkptin specific protease 18 


2.5 




407633 


NM_007069 


Hs.37189 similar to rat HREV107 


2.5 




449754 


H00820 


Hs.30977 ESTs, Weakly similar to B34087 hypotheti 


2.5 




419316 AA236255 


Hs.298419 ESTs 


2.5 


50 


429118 


H20669 


Hs.35406 ESTs, Highly similar to unnamed protein 


2.5 


440331 


AL046412 


Hs.202151 ESTs 


2.5 




449344 


AI640355 


Hs.312691 ESTs 


2.5 




459006 


AW298631 


Hs.27721 Wolf-Hirschhorn syndrome candidate 1-lik 


2.5 




423165 


AI937547 


Hs.124915 hypothetical protein MGC2601 


2.5 


55 


411337 


AW837349 


gb:QV2-LT0038-270300-108-d12LT0038 Homo 2.5 


438290 


AA843719 


Hs.122341 ESTs 


2.5 




406414 




C5000506*:gi|124941 |sp|P18614|ITA1_RAT I 


2.5 




424498 


AB033043 


Hs. 149377 hypothetical protein DKFZp761L0424 


2.5 




443464 


BE548446 


Hs.5167 Homo sapiens mRNA; cDNA DKFZp434F152 (fr 2.5 


60 


424856 


AA347746 


Hs.9521 ESTs, Weakly similarto ZN43.HUMAN ZINC 


2.5 


440304 


BE1 59984 


Hs.125395 ESTs 


2.5 




409045 


AA635062 


Hs.50094 Homo sapiens mRNA; cDNA DKFZp434O051 5 (f 2.5 




422648 


D86983 


Hs.1 1 8893 Melanoma associated gene 


2.5 




428819 


AL1 35623 


Hs.193914 KIAA0575 gene product 


2.5 


65 


412520 


AA442324 


Hs.795 H2A histone family, member O 


2.5 


430602 


D13752 


Hs.184927 cytochrome P450, subfamily XIB (steroid 


2.5 




408031 


AA081395 


Hs.42173 Homo sapiens cDNA FU10366 fis, clone NT 


2.5 




403133 




Target Exon 


2.5 
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413189 BE070231 gb:QV4-BT0407-260100-087-f12 BT0407 Homo 2.5 

400346 AB041269 Hs.272263 Homo sapiens mRNA for keratin 19, partia 2.5 

435509 AI458679 Hs. 181 91 5 ESTs 2.5 

458145 AI239457 Hs.130794 ESTs 2.5 



299 



Attorney Docket No.:01 850 1 -0052 1 OUS 



TABLE 19A 



Table 19A shows the accession numbers for those pkeys lacking unigenelD's for Table 19. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number: Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 

407647 1007366J AW860158 AW862385 AW860159 AW862386 AW862341 AW821869 AW821893 AW062660 AW062656 
407980 103087J M046309 AI263500 AA046397 

408254 1049346 1 AW807227 AW807576AW807137 AW807157AW807495 AW807494 AW807417 AW807083 AW845786 AW845801 
AW807130 AW807335 AW807081 AW807349 AW807339 AW807164 AW807341 AW807224 AW845903 AW177424 
AW807159 AW807123 BE141576 AW807340 AW807334 AW807520 AW807205 AW807505 BE141574 AW807390 
AW807395 AW845789 AW807101 AW807089 AW807519 AW807239 AW807509 AW807356 AW807526 AW807098 
AW807307 AW807153 AW807295 AW807313 AW807322 AW807265 AW807513 AW807516 AW807166 AW807501 
AW807120 AW807168 AW807121 AW807527 AW807151 AW807203 AW807489 AW807511 AW807158 AW845800 
AW807507 AW845793 AW845796 AW807296 AW845781 AW807444 AW845871 AW807512 AW807242 AW807141 
AW807522 AW807487 AW807514 AW807142 AW807232 AW807379 AW8071 14 AW807518 AW807199 AW80721 1 
AW807498 AW807086 AW807492 AW807218 AW807082 AW807525 AW807493 AW807523 AW807087 AW845784 
AW807037 AW807128 AW807080 AW8071 18 AW845807 AW807524 AW845803 AW807249 AW845795 AW807160 
AW807343 AW807515 AW807233 AW807289 AW177102 AW807352 AW807394 AW177105 AW807176 AW177103 
AW845870 AW1 77099 AW1 77101 AW807528 AW807336 AW807038 AW177100 AW807411 AW807088 AW845865 
AW807226 AW807517 AW807397 AW807303 AW807177 AW807154 AW807136 AW807146 AW807085 AW807521 
AW807488 AW807385 AW807355 AW807223 AW807155 

409163 1 10418J AA065081 M075017 M084791 AA071015 M081560 AA071459 AA545727 AA083100 AA085366 AA1 15845 AA075457 
M064704 AA082878 AA075742 AA0691 62 

409695 114876J AA296961 AA296889 AA076945 AA077528 AA077497 

410534 1207247J AW905138 AW753008 R13818Z43519 

410672 1214882J AW794600 AW794730 

410784 1221005.1 AW803201 BE079700 BE062940 

410785 1221055J AW803341 AW803265 AW803403 AW803466 AW803402 AW803413 AW803268 AW803396 AW803334 AW803355 
410835 1223785J AW806906 AW806915 AW866460 AW866475 AW866462 AW866448 AW866372 AW866604 

411050 1230330.1 AW814902 BE156656 BE156667 BE156590 BE156441 BE156447 

411086 1231500J BE070800 AW875226 BE149115 

411093 1231970.1 BE067650 AW817053 

411111 1232669.1 AW818127 AW818161 R09719 

411171 1234393.1 AW820260 AW820332 R94406 

411337 1239217.1 AW837349 AW837355 AW882717 

41 1514 1248638J AW850178 AW850233 AW850445 AW850446 

411670 1253680J AW856552 AW861101 AW856574 AW861099 AW861100 AW856573 AW856576 AW856562 
411905 1265181.1 BE265067 BE264978 AW875420 
412102 1277395.1 H56435 H56572 AW892929 
412209 1283610.1 AW901456 AW901450 AW901441 
41 2248 1 285000.1 BE1 76480 AW903298 AW90331 3 

413043 1346556.1 BE158766 BE061699 BE147360 BE147362 BE061666 BE061697 BE061647 BE061678 

4131 1 1 1349546.1 BE065837 BE065805 BE065799 BE065818 BE065839 BE065831 BE065894 BE065789 BE065792 

413189 1352723.1 BE070231 BE070229 BE070255 

413221 1353887.1 BE161151 BE162495BE161002BE072205BE160989BE162482 
413499 1373910 1 BE 1 44884 H97942 

413708 1384140 J BE158791 BE158806 BE158748 BE158744 BE158740 BE158739 BE158811 BE158700 BE158741 BE158683 BE158685 
414210 1426051.1 BE383592 BE261671 
414596 1465004.1 BE386870 Z41986 H08501 
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4 140U0 


aark7QCi 1 
i400i yu„- 1 


DCO*7w t r i tU 




AAP.7A7 


4CE4QQ A 

iooioy_i 


AA0.1M9AQ AA^ft194R AA1R7Kft^ 
AAOOl ZUy AAOO I Z40 MM. I 0/ D00 




4 ID 1/0 


io/4y/o_ i 


0*97*19 Q 17110. H9A1Q9 R1Qfl7fi 
rwZ/ OZ K I id 10 nZ4 iyz r\ iyO/0 




iM77v19 
41 //4z 


■iCQCOQO A 

i oyozo*:_i 


DR/t71Q 7AARAC\ D19AS1 
r\04/ iy Z.4400U r\ IZ40 I 


J 


4i/y/4 


171 0^7 A 

1 / 1Z0/.1 


A A04H7RR TQ^7fin \4QAA(\7 

aaziu/oo iyo/uuny44u/ 




41ob00 


4 77 AM A 
Aft 4UZ_1 


^hnAo^fsf^ aaoo^qqa Awrr^nonft Awv^nonfi 
Mvv/4yooo MMzzoyyo mvv/ouzuo mvv/ouzuo 




4iyooo 


A ft £. fiftft 1 
1 OOOOO. 1 


AARfi^nc. AA9AAHQ^ AA9AA1ft^ 
MMDUOOUO MMZ44U30 MMZ*Hf 1 00 
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TABLE 19B 



Table 19B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 19. For each predicted ex on, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication 

entitled "The DNA sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 
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TABLE 20: 544 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES THAT ARE 
LIKELY TO ENCODE EXTRACELLULAR OR CELL- 
5 SURFACE PROTEINS 

Table 20 shows 544 genes up-regulated in breast cancer compared to normal adult tissues 
that are likely to encode extracellular or cell-surface proteins. These were selected as for 
Table 19, except that the ratio was greater than or equal to 3.0, the "average" normal adult 
-JO tissue level was set to the 85 th percentile value amongst 144 non-malignant tissues, and the 
U 96 th percentile value amongst the 73 breast cancer specimens was greater than or equal 1 00 
P units, and the predicted protein contained a structural domain that is indicative of 
jjri extracellular localization (e.g. ig, m3, egf, 7tm domains, signal sequences, transmembrane 
w domains). The predicted protein domains are noted. 

fib 

O Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

5 UnkjenelD: Unkjene number 

C2© Pred.ProtDomains: Predicted Protein Domains 

y Unkjene Title: Unkjene gene title 

. R1: Ratio of 93 rd percentile of tumor to 85 1 " percentile of normal body tissue 

• 25 Pkey ExAccn UnigenelD 

W 408591 AF015224 Hs.46452 

400291 AA401369 Hs.190721 
449746 AI668594 Hs.1 76588 

30 407277 AW170035 Hs.326736 

400292 AA250737 Hs.72472 
424735 U31875 Hs.272499 
426878 BE069341 
428848 NM_000230Hs.194236 

35 407178 AA195651 Hs.104106 

408000 L11690 Hs.620 

427585 D31152 Hs.179729 

429441 AJ224172 Hs.204096 

450375 AA009647 Hs.8850 

40 420931 AF044197 Hs. 100431 

422109 S73265 Hs.1473 

445730 AI624342 Hs.170042 

451110 AI955040 Hs.265398 

400297 AI127076 Hs.334473 

45 420813 X51501 Hs.99949 

452744 AI267652 Hs.30504 

424634 NM_003613Hs.151407 

420757 X78592 Hs.99915 

424399 AI905687 Hs.2533 

50 447350 A1375572 Hs. 172634 

456207 AA193450 

431448 AL137517 Hs.334473 

427217 AA399272 Hs.144341 

456938 X52509 Hs. 161 640 

5 5 435496 AW840171 Hs.265398 

402578 

453160 AI263307 Hs.239884 

422505 AL120862 Hs.124165 
444342 NM_014398Hs.10887 
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Pred.Prot. Domains 
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ig,tsp_1,SS,AAA 


cartilage intermediate layer protein, nu 


21.7 


hormone recAndrogen recep,zf-C4, 


androgen receptor (dihydrotestosterone r 


21.7 


SS 


aldehyde dehydrogenase 9 family, member 


20.3 


.pkinase, 


ESTs 


19.2 


,SS,TM,p450,p450 


gb:zr40e07.r1 Soares_NhHMPu_S1 Homo sapi 


18.3 


TM 


hypothetical protein DKFZp56401278 


18.2 


SS 


ESTs 


18.2 


,SS r TM,aminotran_1_2,Cadherin_C_term,cad tyrosine aminotransferase 


18.1 


SS 


ESTs, Weakly similar to transformation-r 


17.9 


SS,p450,SS,TM,p450 


C1001134:gi|2117372|pir||l65981 fatty ac 


17.8 


SS 


H2B histone family, member L 


17.8 


SS 


programmed cell death 9 (PDCD9) 


17.7 


Lamp,SS,TM,Lamp, 


similar to lysosome-associated membrane 


17.5 
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449765 


N92293 


Hs.206832 


SS 


ESTs, Moderately similar to ALU8_HUMAN A 


17.3 




428227 


AA321649 


Hs.2248 


SSJL8, 


small inducible cytokine subfamily B (CX 


17.0 




425692 


D90041 


Hs.155956 


,SS,Acetyltransf2, 


N-acetyltransferase 1 (arylamine N-acety 


16.7 




424001 


W67883 


Hs.137476 


.pkinase, 


paternally expressed 10 


16.5 


5 


448595 


AB014544 


Hs.21572 


LRRCT,LRR,SS,LRRCT,serine_carbpept 


KIAA0644 gene product 


16.3 




449448 


D60730 


Hs.57471 


SS 


ESTs 


16.2 




418007 


M13509 


Hs.83169 


SS,hemopexin,Peptidase_M10,SS,Peptidase_matrix metalloproteinase 1 (MMP1; inters 


15.7 




418994 


AA296520 


Hs.89546 


SS lectin c,sushi,EGF,SS,EGF,lectin c,su 


selectin E (endothelial adhesion molecul 


15.5 


10 


453596 


AA441838 


Hs.62905 


SS 


hypothetical protein FLJ14834 


15.5 


452401 


NM 007115HS.29352 


SS f.l IR Xlink 


tumor necrosis factor, alpha-induced pro 


15.0 




446591 


H44186 


Hs.15456 




PD7 rinmain rjmtaininn 1 

1 lJ£m \J\Jl 1 IC1II 1 \AJl ll£lll III iy 1 


14.9 




419296 


M236115 


Hs.120785 


oo 


ESTs 


14.8 




452838 


U65011 


Hs.30743 


00,00 


nroforonfialli/ PYrtrpQCP/H antinpn in mpls 
\ji cicici iiiaiiy cA[jicaocu ainiycti in nicia 


14.7 




422805 


AA436989 


Hs.121017 


hietnna QQ hictnno htcf/^n£i 

nisione t oo,nioiuiic,[noioiic 


hictnnp familv mpmhpr A 
n^n iiioiuiic iciiiinji uicinuci n 


14.3 


15 


448390 


AL035414 


Hs.21068 


CQ 
OO 


hunrifhottr'al nmtoin 
liypuulcUUdl {JIULClll 


14.2 




447342 


AI199268 


Hs.19322 


,SS,lipocalin 


Hnmn «anipnQ Similar tn RIKFN rHNA 7010 
nuriiu odpicMo, on i iiicti isj r\ir\L.i « K^uwr^ iu iu 


14.2 




411869 


W20027 


Hs.23439 


,SS,Peptidase M1, 


ESTs 


13.9 




443709 


AI082692 


Hs.134662 


,SS,TM,SNF 


ESTs 


13.7 




459587 


AA031956 




,SS,LIM, 


gb:zk15e04.s1 Soares_pregnanLuterus_NbH 


13.7 




442580 


AI733682 


Hs.130239 


SS 


ESTs 


13.5 


i r -V" : 


400289 


X07820 


Hs.2258 


hemopexin,Peptidase_M10,SS,Peptidase_ 


MIOmatrix metalloproteinase 10 (MMP10; str 


13.5 




411598 


BE336654 


Hs.70937 


histone,SS,histone,histone 


H3 histone family, member A 


13.3 


Tt \ 


415263 


AA948033 


Hs.130853 


,SS,histone,histone,linker_histone 


ESTs 


13.2 


%. 


433805 


AA706910 


Hs.1 12742 


,SS,Ribosomal_L7Ae, 


ESTs 


13.1 


§5 


407276 


AI951118 


Hs.326736 


TM 


Homo sapiens breast cancer antigen NY-BR 


13.1 




443348 


AW873596 


Hs.182278 


,SS,DENN 


calmodulin 2 (phosphorylase kinase, delt 


13.0 


9 


421037 


AI684808 


Hs.197653 


SS 


programmed cell death 9 (PDCD9) 


12.9 




424086 


AI351010 


Hs.102267 


,SS,Lysyl oxidase 


lysyl oxidase 


12.8 




400295 


W72838 


Hs.2533 


SS 


aldehyde dehydrogenase 9 family, member 


12.7 


Ho 


452461 


N78223 


Hs.108106 


,SS,G9a,PHD, 


transcription factor 


12.5 


3 %s'- 


427365 


AI873274 


Hs.190721 


TM 


ESTs 


12.4 




433365 


AF026944 


Hs.293797 


SS TPR 


ESTs 


12.3 




409269 M576953 


Hs.22972 


SS,TM,UPF0016,SS,TM,UPF0016 


hypothetical protein FLJ13352 


12.0 




432596 


AJ224741 


Hs.278461 


SS,EGF,vwa,SS,TM,vwa, 


matrilin 3 


11.9 


r»5 


408771 


AW732573 Hs.47584 


TM,K_tetra,ion_trans,SS,TM,K_tetra,ionJ 


potassium voltage-gated channel, delayed 


11.9 




432912 


BE007371 


Hs.200313 


,SS,TM,Folate carrier 


ESTs 


11.9 




447033 


AI357412 


Hs.1 57601 


SS 


ESTs 


11.8 




421155 


H87879 


Hs.102267 


SS,Lysyl_oxidase,Aldose_epim,Epimerase,S lysyl oxidase 


11.8 


40 


424905 


NM_002497Hs.153704 


pkinase,SS,TM,pkinase,polyprenyl_synt, 


NIMA (never in mitosis gene a)-related k 


11.7 


425398 


AL049689 


Hs.1 56369 


SS 


hypothetical protein similar to tenascin 


11.6 




438167 


R28363 


Hs.24286 


,SS,TM,7tm_1,p450,rrm 


ESTs 


11.5 




459583 


AI907673 




, pkinase, 


gb:IL-BT152-080399-004 BT152 Homo sapien 


11.5 




423945 


AA410943 




death,ZU5,TM,Activin_recp,pkinase, 


gb:zt32h03.r1 Soares ovary tumor NbHOT H 


11.4 




439820 


AL360204 


Hs.283853 


SS 


Homo sapiens mRNA full length insert cDN 


11.4 


45 


402606 






SS 


NM_024626:Homo sapiens hypothetical prot 


11.3 




445263 


H57646 


Hs.42586 


,SS,Acyltransferase, 


KIAA1560 protein 


11.2 




430217 


N47863 


Hs.336901 


,SS,RNA_poLA,RNA_pol_A2,Ribosomal_S24e,ribosomal protein S24 


11.1 




447164 


AF026941 


Hs.17518 


,TM,IBR 


Homo sapiens cig5 mRNA, partial sequence 


11.1 


50 


431385 


BE1 78536 


Hs.1 1090 


,SS,TM 


membrane-spanning 4-domains, subfamily A 


11.1 


423.887 AL080207 


Hs.134585 


,SS,TM,BRCT,ank,ABC_tran,ABC_tran 


DKFZP434G232 protein 


10.9 




415385 


R17798 


Hs.7535 


,SS,Fork_head, 


COBW-like protein 


10.9 




425704 


U79293 


Hs.1 59264 


SS 


Human clone 23948 mRNA sequence 


10.7 




429859 


NM_007050Hs.225952 


,SS,TM,Y_phosphatase, MAM,fn3, 


protein tyrosine phosphatase, receptor t 


10.4 


55 


425523 


AB007948 


Hs.158244 


,SS,laminin_B,laminin_EGF,laminin_Nterm KIAA0479 protein 


10.3 


428368 


BE440042 


Hs.83326 


SS.Peptidase M1 0,hemopexin,SS,Peptidc 


se_matrix metalloproteinase 3 (stromelysin 


10.3 




418912 


NM_000685Hs.89472 


SS,TM,7tm 1,SS,TM,7tm 1, 


angiotensin receptor 1 


10.3 




422026 


U80736 


Hs.1 10826 


SS 


trinucleotide repeat containing 9 


10.3 




451952 


AL120173 


Hs.301663 


,SS,pkinase, 


ESTs 


10.3 




438199 


AW016531 Hs.122147 


,SS,ArfGap, 


ESTs 


10.2 


60 


400608 






SS,TM,SS,TM 


C10001899:gi|7508633|pir||T25392hypothe 


10.1 




413472 


BE242870 


Hs.75379 


SS 


solute carrier family 1 (glial high affi 


10.0 




432374 


W68815 


Hs.301885 


SS 


Homo sapiens cDNA FLJ11346 fis, clone PL 


9.9 




402408 


NA 




,SS,carb_anhydrase 


NM_030920*:Homo sapiens hypothetical pro 


9.8 


65 


445537 


AJ245671 


Hs.1 2844 


,SS,TM,ras 


EGF-like-domain, multiple 6 (EGFL6) 


9.7 


451621 


AI879148 


Hs.26770 


SSJipocalin.lipocalin, 


fatty acid binding protein 7, brain 


9.6 




405654 


NA 




BTB.SS 


C12001521:gi|7513934|pir||T31081 cca3 pr 


9.6 




434988 AI418055 


Hs.161160 


SS 


ESTs 


9.6 
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416220 

431808 

414142 

400298 

418601 

415539 

421451 

429432 

442441 

426429 

406687 

448693 

419948 

426214 

427718 

414812 

400285 

422330 

416182 

420077 

452281 

434531 

408380 

443788 

411078 

445495 

433426 

424871 

426215 

409045 

435525 

409203 

424902 

431725 

418092 

439840 

427811 

420807 

426320 

447078 

415786 

410102 

404347 

433687 

421373 

422634 

453310 

435957 

407771 

443646 

446142 

444649 

435147 

439138 

429220 

428804 

453511 

439809 

414869 

416276 

452862 

452926 

453331 

420802 

450603 

422867 

418004 



N49776 Hs.170994 
M30703 Hs.270833 
AW368397 Hs.1 50042 
AA032279 Hs.61635 



AA279490 

AI733881 

AA291377 

AI678059 

AI820662 

X73114 

M31126 



Hs.86368 

Hs.72472 

Hs.50831 

Hs.202676 

Hs.129598 

Ms.169849 

Hs.272620 



AW004854 Hs.228320 
AB041035 Hs.93847 



H59846 
AI798680 
X72755 
NA 

D30783 



Hs.128355 

Hs.25933 

Hs.77367 

Hs.1 15263 



NM 004354HS.79069 
AW512260 Hs.87767 



T93500 

AA642007 

AF1 23050 

AI732643 

AI222020 

BE622641 

H69125 



Hs.28792 

Hs.1 16369 

Hs.44532 

Hs.144151 

Hs.182364 

Hs.38489 

Hs.133525 



NM 004525HS.153595 
AW963419 Hs.1 55223 
AA635062 Hs.50094 
AI831297 Hs.123310 
AA780473 Hs.687 
NM_003866Hs.153687 
X65724 Hs.2839 
R45154 Hs.106604 
AW449211 Hs.105445 
M81057 Hs.1 80884 
AA280627 Hs.57846 
W47595 Hs. 169300 
AW885727 Hs.301570 
AW419196 Hs.257924 
AW248508 Hs.279727 

AA743991 

AA808229 Hs.167771 
NM_016010Hs.1 18821 



X70697 

N39015 

AL138272 

AI085198 

AI754693 



Hs.553 
Hs.190368 
Hs.62713 
Hs.1 64226 
Hs.1 45968 



AW207523 Hs.197628 
AL133731 Hs.4774 
AI742605 Hs.1 93696 
AW207206 Hs.136319 
AK000713 Hs.193736 



AL031224 
R41396 
AA1 57291 
U41060 



Hs.33102 
Hs.1 01 774 
Hs.21479 
Hs.79136 



AW378065 Hs.8687 
AI742170 Hs.31297 



AI240665 

U22376 

R43646 

L32137 

U37519 



Hs.8895 
Hs.1 334 
Hs.1 2422 
Hs.1 584 
Hs.87539 



,SS,TM 

SS,TM,EGF,SS 

,SS,UDPGT 

TM 

SS,TM,calreticulin,SS,TM,calreticulin, 

death,ZU5,TM,Activin_recp,pkinase, 

TM 

SS 

ss 

,SS,TM,fn3,ig, 



hypothetical protein MGC10946 

amphiregulin (schwannoma-derived growth 

Homo sapiens cDNA FU14438 fis, clone HE 

six transmembrane epithelial antigen of 

calmegin 

BMP-R1B 

ESTs 

synaptonemal complex protein 2 
ESTs 

myosin-binding protein C, slow-type 



SS,Peptidase_M10,hemopexin,SS,Peptidase_matrix metal loproteinase 11 (MMP11; stro 



SS 

Ferric_reduct,TM,Ferric_reduct, 
SS 

,SS,TM,histone,Sec1,histone,sugar_tr 
SS,IL8,SS,IL8 

,TMABC_tran,ABC_membrane, 

SS,TM,EGF,SS,TM 

cyclin.SS 

SS 

,SS,TGF-beta,TGFb_propeptide, 
SS 

,SS,TM,ubiquitin,7tm_3,ANF_receptor,sush 
TM 
SS.SS 



hypothetical protein FLJ23537 
NM_016931:Homo sapiens NADPH oxidase 4 ( 
ESTs, Moderately similar to ALU7_HUMAN A 
ESTs 

monokine induced by gamma interferon 

Eos Control 

epiregulin 

cyclin G2 

ESTs 

Homo sapiens cDNA FLJ11041 fis, clone PL 

ESTs 

diubiquitin 

ESTs 

CocoaCrisp 



SS,SS,ENTH,l_LWEQ,ENTH,LLWEQ,DNA_mis_reESTs, Weakly similar to I38022 hypotheti 



,SS,TM 

SS,EGF,ldl_recepLa,ldl_recepLb,SS,TM,E 

SS 

TM 

TM 

SS,p450,SS,p450 
SS.SS 

SS,Cys_knot,SS 
,death,ZU5,pkinase,Activin_recp, 
SS 



ESTs 

low density lipoprotein-related protein 
stanniocalcin 2 

Homo sapiens mRNA; cDNA DKFZp434O0515 (f 
ESTs 

cytochrome P450, subfamily IVB, polypept 
inositol polyphosphate-4-phosphatase, ty 
Nome disease (pseudoglioma) 
ESTs 

GDNF family receptor alpha 1 



SS,Zn_carbOpept,Propep_M14,SS,Propep_M14carboxypeptidase B1 (tissue) 



SS,cpn10 

SS,TGF-beta,TGFb_propeptide,SS 

,SS,kazal, 

SS 

SS 

SS 

TM 

,SS,IMPDH_C,IMPDH_N,CBS 
SS 

TM,SNF,SS,TM,SNF, 
,SS,TM 

,TM,cpn60_TCP1,Sema, 
,TSPN,vwc,tsp_1,EGF,thiored, 
,TM,cadhen"n,Cadherin_C_term, 
,SS,rrm, 

,TM.SDF,UPAR_LY6, 

TM 

SS 

,SS,UDPGT 

SS.SS 

SS 

SS 

SSJM.TM 

,SS,Pep_M1 2B_propep,Reprolysin,tsp_1 , 
,SS,TM 



ESTs 

transforming growth factor, beta 2 
ESTs 

hypothetical protein FU13782 

Homo sapiens cDNA FLJ14035 fis, clone HE 

Target Exon 

gb:ny57g01.s1 NCI_CGAP_Pr18 Homo sapiens 
ESTs 

CGI-62 protein 

solute carrier family 6 (neurotransmitte 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA DKFZp761C1712 (f 

ESTs 

ESTs 

hypothetical protein FLJ20706 
transcription factor AP-2 beta (actjvati 
hypothetical protein FLJ23045 
ubinuclein 1 

LIV-1 protein, estrogen regulated 
ESTs 

duodena! cytochrome b 



,SS,TM,disintegrin,Pep_M12B_propep,Repro ESTs 
SS,NA,myb_DNA-binding v-myb avian myeloblastosis viral oncogen 

SS ESTs 

SS,EGF,tsp_3,SS,E2F_TDP, cartilage oligomeric matrix protein (COM 

SS,aldedh,SS,aldedh, aldehyde dehydrogenase 3 family, member 



9.5 
9.5 
9.4 
9.4 
9.4 
9.4 
9.3 
9.3 
9.1 
9.1 
9.1 
9.1 
9.1 
9.0 
8.9 
8.8 
8.8 



8.7 
8.7 
8.6 
8.6 
8.6 
8.5 
8.5 
8.5 
8.4 
8.4 
8.4 
8.3 
8.3 
8.3 
8.3 
8.3 
8.2 
8.2 
8.2 
8.2 
8.1 
8.1 
8.0 
8.0 
8.0 
8.0 
7.9 
7.9 
7.8 
7.8 
7.8 
7.7 
7.6 
7.6 
7.6 
7.6 
7.5 
7.5 
7.5 
7.5 
7.5 
7.4 
7.4 
7.3 
7.3 
7.2 
7.2 
7.2 
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426451 


AI908165 Hs.169946 


SS.GATA, 


GATA-binding protein 3 (T-cell receptor 


7.1 


450701 


H39960 Hs.288467 


,SS,LRR 


Homo sapiens cDNA FLJ12280 fis, clone MA 


7.1 


419519 


AI198719 Hs.176376 


SS 


ESTs 


7.1 


410555 


U92649 Hs.64311 


,TM,disintegrin,Reprolysin, 


a disintegrin and metalloproteinase doma 


7.1 


433138 


AB029496 Hs.59729 


SS,ig,Sema,SS,Sema,efhand 


semaphorin sem2 


7.0 


411558 


AA102670 Hs.70725 


SS,TM,SS,TM 


gamma-aminobutyric acid (GABA) A recepto 


7.0 


409079 


W87707 Hs.82065 


,TM,fn3, 


interieukin 6 signal transducer (gp130, 


7.0 


417275 


X63578 Hs.295449 


SS,efhand,SS,efhand,ras 


paralbumin 


7.0 


432731 


R31178 Hs.287820 


,SS,fn3,fn1,fn2,fn2,fn1 


fibronectin 1 


6.9 


442818 


AK001741 Hs.8739 


WD40.SS 


hypothetical protein FU 10879 


6.9 


407366 


AF026942 


,TM,IBR 


gb:Homo sapiens cig33 mRNA, partial sequ 


6.8 


427427 


AF077345 Hs.177936 


SS.Iectin c,SS 


ESTs 


6.8 


410785 


AW803341 


SS 


gb:IL2-UM0079-090300-050-D03 UM0079 Homo 


6.7 


401045 




ICE_p20,SS,ICE_p10,ICE_p20,ICE_p10,ICE_p 






C1 1001883*:gi|6753278]reflNP_033938.1 1 c 


6.7 


6.7 


418986 


AI123555 Hs.81796 


,SS,Repro!ysin,tsp_1, 


ESTs 


442082 


R41823 Hs.7413 


,TM,EPH lbd,pkinase,SAM,fn3, 


ESTs; calsyntenin-2 


6.7 


442861 


AA243837 Hs.57787 


SS 


ESTs 


6.6 


418836 


AI655499 Hs.161712 


,TM,Activin recp,pkinase,death,ZU5, 


ESTs 


6.6 


422060 


R20893 Hs.325823 


,SS,TM,CD36 


ESTs, Moderately similar to ALU5_HUMAN A 


6.6 


444381 


BE387335 Hs.283713 


,SS,mito_carr 


ESTs, Weakly similar to S64054 hypotheti 


6.6 


4n4fW1 


NA 


,TM,7tm 3.ANF receptor, 


Target Exon 


6.6 


417866 


AW067903 Hs.82772 


SS,Collagen,COLFI,TSPN,SS,TSPN 


collagen, type XI, alpha 1 


6.6 


428819 


AL1 35623 Hs. 1939 14 


SS.SS 


KIAA0575 gene product 


6.5 


41(177'; 


U85658 Hs.61796 


,SS,Ribosomal S4e 


transcription factor AP-2 gamma (activat 


6.4 




AW067800 Hs. 155223 


SS 


stanniocalcin 2 


6.2 


41KRR0 


NM_005025Hs.78589 


, SS.se rpin, 


serine (or cysteine) proteinase inhibito 


6.2 


1 loo ly 


AI815601 Hs.79197 


SS,TM,ig,SS,TM 


CD83 antigen (activated B lymphocytes, i 


6.2 


4 l£ I4u 


AA219691 Hs.73625 


,SS,kinesin, 


RAB6 interacting, kinesin-like (rabkines 


6.2 


449049 


AW167087 Hs.131562 


,SS,ig,Sema,pkinase, 


ESTs 


6.2 


H40 1 00 


AA026880 Hs.25252 


,SS,TM,fn3, 


prolactin receptor 


6.1 


44^1 R9 


T49951 Hs.9029 


filamentSS,filament,filament 


DKFZP434G032 protein 


6.1 




W26713 Hs.256972 


,SS,TM,DAGKa,DAGKc, 


ESTs 


6.1 


49R47Q 


Y00272 Hs. 184572 


,SS,pkinase,pkinase 


cell division cycle 2, G1 to S and G2 to 


6.1 


1UUJUU 


X03363 


,SS,TM,pkinase,Recep_L_domain,SH2,PH,FurHER2 receptor tyrosine kinase (c-erb-b2, 


6.1 


4n4fl4 


T32982 Hs. 102720 


SS 


ESTs 


6.1 


4mn7Q 

H 1 UUi 3 


U94362 Hs.58589 


Glyco_transf_8,SS 


glycogenin 2 


6.1 


4ni7fl1 

4U I / o I 




,SS,filament,Pribosyltran,filament,Armad 


Target Exon 


6.1 


447"1<;q 


NM_012093Hs. 18268 


SS.adenylatekinase, 


adenylate kinase 5 


6.1 




NA 


,SS,TM,p450, 


Target Exon 


6.1 


497R74 

Hi/ 0/4 


NM 003528HS.2178 


histone.SS.histone, 


H2B histone family, member Q 


6.1 


4£00U0 


AI249368 Hs.98558 


,SS,TM 


ESTs 


6.0 


4 JOKJ30 


BE550224 Hs.74170 


SS 


metallothionein 1E (functional) 


6.0 


41QQfift 


X04430 Hs.93913 


SS,IL6,IL6, 


interieukin 6 (interferon, beta 2) 


6.0 


41fifttfi 

4 IUUOU 


N32536 Hs.42645 


,SS,TM 


solute carrier family 16 (monocarboxylic 


6.0 


41Q7fM 


AI793257 Hs. 1281 51 


,SS,zf-C2H2, 


ESTs 


5.8 


494R87 

tLtWI 


J05070 Hs. 151 738 


SS,Peptidase_M10,fn2,hemopexin,SS,TM,Pepmatrix metalloproteinase 9 (gelatinase B 


5.8 


44QR7Q 


AI823951 Hs.129700 


SS 


tolloid-like 1 


5.8 


4919Qfi 


NM 002666HS.103253 


SS 


perilipin 


5.8 


449117 
44 C. 1 t / 


AW664964 Hs.128899 


,SS,TM 


ESTs 


5.7 


4nnw? 

4UU0U0 


AA242758 Hs.79136 


,SS,TM 


LIV-1 protein, estrogen regulated 


5.7 


41Q440 


AB020689 Hs.90419 


SS 


KIAA0882 protein 


5.7 


444858 


AI199738 Hs.208275 


SS 


ESTs, Weakly similar to ALUA HUMAN !!!! 


5.7 


432239 


X81334 Hs.2936 


SS,Peptidase_M10,hemopexin,SS,Peptidase_matrix metalloproteinase 13 (collagenase 


5.6 


440705 


AA904244 Hs.1 53205 


TM 


ESTs 


5.6 


400286 


NA 


SS,TM,ABC_tran,ABC_membrane,SS 


C16000922:gi|7499103|pir||T20903hypothe 


5.6 


446466 


H38026 Hs.308 


arrestin.SS 


arresfjn 3, retinal (X-arrestin) 


5.5 


423201 


NM 000163HS.125180 


SS,TM,fn3,SS 


growth hormone receptor 


5.5 


433043 


W57554 Hs.1 25019 


SS 


lymphoid nuclear protein (LAF-4) mRNA 


5.5 


439509 


AF086332 Hs.58314 


,SS,TM,Syntaxin 


ESTs 


5.4 


425247 


NM 005940HS.1 55324 


SS,Peptidase_M10,hemopexin,SS 


matrix metalloproteinase 1 1 (MMP1 1 ; stro 


5.4 


409757 


NM 001898HS.123114 


,SS,cystatin, 


cystatin SN 


5.4 


425292 


NM 005824HS.1 55545 


SS 


37 kDa leucine-rich repeat (LRR) protein 


5.4 


448045 


AJ297436 Hs.20166 


,SS,TM 


prostate stem cell antigen 


5.4 


452681 


AF1 53330 Hs.30246 


,SS,TM 


solute carrier family 19 (thiamine trans 


5.3 


452243 


AL355715 Hs.28555 


SS 


programmed cell death 9 


5.3 


439310 


AF086120 Hs.102793 


,SS,TM,UDPGT,casein_kappa 


ESTs 


5.2 
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441111 


AI806867 Hs.126594 


,SS,TM,Phosphodiest 


ESTs 


5.2 




452355 


N54926 Hs.29202 


TM,7tm 1,TM 


G protein-coupled receptor 34 


5.2 




427711 


M31659 Hs.180408 


SS 


solute carrier family 25 (mitochondrial 


5.2 




418636 


AW749855 


,SS,TM,HECT 


gb:QV4-BT0534-281299-053-c05 BT0534 Homo 


5.2 


5 


429353 


AL1 17406 Hs.200102 


,SS,TM,ABC tran,ABC_membrane, 


ATP-binding cassette transporter MRP8 


5.1 




441690 


R81733 Hs.33106 


,SS,HECT,zf-UBR1,PABP, 14-3-3, 


ESTs 


5.1 




430447 


W17064 Hs.332848 


SS 


SWI/SNF related, matrix associated, acti 


5.1 




429698 


AI685086 Hs.26339 


,SS,ras, 


ESTs, Weakly similar to S21348 probable 


5.1 


10 


425325 


X52730 Hs.1892 


SS,NNMT_PNMT_TEMT,SS,NNMT_PNMT_TEMT,STAR 






phenylethanolamine N-methyltransferase 


5.1 


5.1 




423600 


AI633559 Hs.310359 


SS 


ESTs 




414737 


AI160386 Hs.125087 


SS 


ESTs 


5.1 




403593 


NA 


,CIDE-N,pkinase 


Target Exon 


5.1 


15 


407758 


D50915 Hs.38365 


SS.SS 


KIAA0125 gene product 


5.0 


445234 AW137636 Hs.146059 


,SS,TM 


ESTs 


5.0 




411165 


NM 000169HS.69089 


SS,Melibiase,BTK,PH,pkinase,SH2,SH3,Ribc 


> galactosidase, alpha 


4.9 




420633 


NM 014581 Hs.274480 


SS,lipocalin,SS,lipocalin 


odorant-binding protein 2A 


4.9 




414117 


W88559 Hs.1787 


,TM,ion_trans,K_tetra, 


proteotipid protein 1 (Pelizaeus-Merzbac 


4.9 


E.I 


416783 AA206186 Hs.79889 


SS.TM.TM 


monocyte to macrophage differentiation-a 


4.9 




401093 




TM,LRRCT,TM,LRRCT, 


C12000586*:gi|6330167|dbjlBAA86477.1|(A 


4.9 




411096 


U80034 Hs.68583 


Peptidase M3, 


mitochondrial intermediate peptidase 


4.9 


•1=5 ! 


457411 


AW085961 Hs. 130093 


SS 


ESTs 


4.9 




436007 


AI247716 Hs.232168 


,SS,adh_zinc, 


ESTs 


4.9 


-is"' 


450506 


NM 004460HS.418 


SS,DPPIV_N_term,Peptidase_S9,SS,DPPIV_N_fibroblast activation protein, alpha 


4.9 




417975 


AA641836 Hs.30085 


,SS,trypsin 


hypothetical protein FLJ23186 


4.9 




421072 


AI215069 Hs.89113 


SS 


ESTs 


4.8 


fe: ; 


427032 


AF012023 Hs.173274 


,SS,14-3-3 


integrin cytoplasmic domain-associated p 


4.8 




447752 


M73700 Hs.105938 


SS,transfem'n,7tm_1 .transferrin, 


lactotransferrin 


4.8 


£ 


403199 


NA 


SS,TM,Folate_carrier,SS,TM,Folate_carrie 


NM_025243*:Homo sapiens solute carrier f 


4.8 


Cio 


427122 


AW057736 Hs.323910 


,SS,TM,pkinase,Recep_L_domain,SH2,PH,FurHER2 receptor tyrosine kinase (c-erb-b2, 


4.8 


}=:.: 


445900 


AF070526 Hs. 13429 


,SS,Ca_channel_B, 


Homo sapiens clone 24787 mRNA sequence 


4.7 




413048 


M93221 Hs.75182 


SSTMJectin c,fn2,Ricin B lectin.SS.TM 


mannose receptor, C type 1 


4.7 


1 y 


419563 


AA526235 Hs.193162 


SS 


Homo sapiens cDNA FLJ1 1983 fis, clone HE 


4.7 


J!: 


442432 


BE093589 Hs.38178 


SS 


hypothetical protein FLJ23468 


4.6 




452093 


AA447453 Hs.27860 


,SS,TM,7tm_1, 


Homo sapiens mRNA; cDNA DKFZp586M0723 (f 


4.6 




442323 AW016669 Hs.29190 


,SS,TM,CBS,voltage_CLC 


ESTs 


4.6 




450606 


A1668605 Hs.60380 


,TM,Glyco_hydro_1 


ESTs, Moderately similar to ALU6 HUMAN A 


4.6 




435542 


AA687376 Hs.269533 


,SS,pkinase,RhoGEF,ig,PH,SH3, 


ESTs 


4.6 


40 


417576 


AA339449 Hs.82285 


AIRS,formyLtransf,GARS,SS,GARS,AIRS,for phosphoribosylglycinamide formyltransfer 


4.6 


446089 


AI860021 Hs.270651 


.pkinase 


ESTs, Moderately similar to A47582 B-cel 


4.6 




445413 


AA151342 Hs.12677 


SS,UPF0099,SS,UPF0099, 


CGI-147 protein 


4.6 




424420 


BE614743 Hs.146688 


,SS,TM,MAPEG, 


prostaglandin E synthase 


4.5 




432378 


AI493046 Hs.146133 


,SS,TM,UDPGT 


ESTs 


4.5 


HO 


452190 


H26735 Hs.91668 


,SS,TM,PH,SH2,Furin-like,pkinase,Recep_L 


Homo sapiens clone PP1498 unknown mRNA 


4.5 


434674 


AA831879 Hs.136985 


,SS,HisLdeacetyl, 


ESTs 


4.5 




419986 


AI345455 Hs.78915 


pkinase.OPR, 


GA-binding protein transcription factor, 


4.5 




421582 


AI910275 Hs.1406 


SS,trefoii,SS,TM,ldl recept_a,SRCR,tryps 


trefoil factor 1 (pS2) 


4.5 




410361 


BE391804 Hs.62661 


SS,TM,GBP,TM,GBP 


guanylate binding protein 1, interferon- 


4.5 




426327 


W03242 Hs.44898 


SS 


Homo sapiens clone TCCCTA001 51 mRNA sequ 


4.5 


50 


406639 


M97711 


SS.SS.fcj, 


gb:Human T-cell receptor (V beta 18.1, J 


4.5 




452834 


AI638627 Hs.105685 


,SS,DEAD,Fork_head 


KIAA1688 protein 


4.5 




427315 


AA179949 Hs.175563 


SS 


Homo sapiens mRNA; cDNA DKFZp564N0763 (f 


4.4 




446733 


AA863360 Hs.26040 


,SS,TM,p450, 


ESTs, Weakly similar to fatty acid omega 


4.4 


55 


442118 


AA976718 Hs.202242 


,ig,Sema, 


ESTs 


4.4 


421524 


AA312082 Hs.105445 


SS 


GDNF family receptor alpha 1 


4.4 




453060 


AW294092 Hs.21594 


,SS,ras,Y _phosphatase,ras 


hypothetical protein MGC15754 


4.4 




453403 


BE466639 Hs.61779 


,SS,HMG_box,filament, 


Homo sapiens cDNA FLJ13591 fis, clone PL 


4.4 




444301 


AK000136 Hs.10760 


SS.LRR.SS 


asporin (LRR class 1) 


4.4 




453619 


H87648 Hs.33922 


SS 


Homo sapiens, clone MGC:9084, mRNA, comp 


4.3 


60 


432656 


NM 000246HS.3076 


SS.LRR, 


MHC class II transactivator 


4.3 




426384 


AI472078 Hs.303662 


,SS,ArfGap, 


ESTs 


4.3 




431701 


AW935490 Hs.14658 


,SS,BIR 


Human chromosome 5q13.1 clone 5G8 mRNA 


4.3 




416931 


D45371 Hs.80485 


SS,C1q,Collagen,SS,C1q, 


adipose most abundant gene transcript 1 


4.3 


65 


420854 


AW296927 


,SS,TM,Peptidase_M1, 


gb:lll-H-BW0-aj(x:-07-0-Ul.s1 NCI_CGAP_Su 


4.3 


418867 


D31771 Hs.89404 


SS,homeobox,homeobox, 


msh (Orosophila) homeo box homolog 2 


4.3 




443514 


BE464288 Hs.141937 


,SS,TM,MIP, 


ESTs 


4.3 




447499 


AW262580 Hs.147674 


,SS,TM,cadherin,cadherin 


protocadherin beta 16 


4.3 
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441560 

409064 

422667 

454032 

432663 

401747 

432882 

437036 

447754 

443194 

451871 

457211 

421566 

431657 

427899 

444779 

442295 

436396 

446039 

422938 

406922 

439285 

424800 

429922 

447178 

409038 

452747 

420139 

408877 

403329 

439926 

430832 

432481 

452410 

418661 

431958 

425071 

447197 

428722 

428330 

423242 

449048 

414831 

413589 

408875 

418629 

450787 

414870 

450325 

407633 

426172 

442262 

427961 

445563 

403943 

408761 

423279 

420440 

445107 

428303 

411667 

427809 

418203 

430376 

444190 

433495 

429638 



F13386 Hs.7888 
AA062954 Hs.141883 



H25642 
W31790 
AI984317 



Hs.133471 
Hs.194293 
Hs.122589 



NM_013257Hs.279696 
AI571514 Hs.133022 
AW073310 Hs.163533 
A1954968 Hs.279009 
AI821005 Hs.1 18599 
AW972565 Hs.32399 
NM_000399Hs.1395 
AI345227 Hs.105448 
AA829286 Hs.332053 
AI192105 Hs.147170 
AI827248 Hs.224398 
AI683487 Hs.152213 
AI150491 Hs.90756 
NM_001809Hs.1594 
S70284 

AL133916 Hs.172572 
AL035588 Hs. 153203 
Z97630 Hs.226117 
AW594641 Hs.192417 
T97490 Hs.50002 
BE153855 Hs.61460 
NM_005357Hs.95351 
AA479033 Hs.130315 
NA 

AW014875 Hs.137007 
AI073913 Hs. 100686 
AW451645 Hs.151504 
AL133619 Hs.29383 
NM_001949Hs.1189 
X63629 Hs.2877 
NM_013989Hs. 154424 
R36075 



U76456 
L22524 



Hs.190787 
Hs.2256 



AL039402 Hs.1 25783 



Z45051 
M31158 



Hs.22920 
Hs.77439 



AW452631 Hs.313803 
NM_015434Hs.48604 
BE247550 Hs.86859 
AB006190 Hs.25475 
N72264 Hs.300670 
AI935962 Hs.26289 
NM_007069Hs.37189 
AA371307 Hs.125056 
BE1 70651 Hs.8700 
AW293165 Hs.143134 
AW873606 Hs.149006 

M057264 Hs.238936 
AW959861 Hs.290943 
NM_002407Hs.97644 
AI208121 Hs.147313 
AW974476 Hs.183601 
BE160198 

M26380 Hs.180878 
X54942 Hs.83758 
AW292053 Hs.12532 
AI878918 Hs.10526 
AW373784 Hs.71 
AI916662 Hs.211577 



.pkinase, 

,SS,CUB, 

,SS,TM,FMO-like 

,SS,TM 

TM 

,SS,filamentfilament 
pkinase,pkinase_C, 
,SS,TM 
.pkinase, 
,SS,TM 
,SS,GDNF, 
WH1.WH1 
zf-C2H2,SS 
,SS,TM,pkinase 



,SS,SAA _prateins,ABC_membrane,ABC_tran,serum amyloid A1 
SS ESTs 
,COLFI,vwc,Collagen, 
,wnt, 

,TM,Glyco_hydro_1 
,SS,TM,thiolase, 
SS,TM,Desaturase,SS 
,SS,ig,pkinase,LRRNT,LRRCT, 
HLH.SS 

,SS,TM,linker_histone,7tm_1 
,SS,TM 
SS,IL8,SS,IL8 
,SS,HLH 
,SS,TM,p450, 
,SS,TM 
SS.SS 
SS 
SS 

,SS,Collagen,COLFl,TSPN, 
,SS,TM,ras 
SS 



Homo sapiens clone 23736 mRNA sequence 

ESTs 

ESTs 

ESTs, Weakly similar to I54374 gene NF2 
ESTs 

Homo sapiens keratin 17 (KRT17) 
serum/glucocorticoid regulated kinase-li 
ESTs 

Homo sapiens cDNA FLJ14142 fis, clone MA 

matrix Gla protein 

ESTs 

ESTs, Weakly similar to S51797 vasodilat 
early growth response 2 (Krox-20 (Drosop 
ESTs, Weakly similar to B34087 hypotheti 



Homo sapiens cDNA FLJ1 1469 fis, clone HE 
wingless-type MMTV integration site fami 
ESTs 

centromere protein A (17kD) 
gb:stearoyl-CoA desaturase [human, adipo 
hypothetical protein FLJ20093 
MyoD family inhibitor 
H1 histone family, member 0 
ESTs 

small inducible cytokine subfamily A (Cy 
kj superfamily receptor LNIR 
lipase, hormone-sensitive 
ESTs, Weakly similar to A47582 B-cell gr 
Target Exon 
ESTs 

ESTs, Weakly similar to JE0350 Anterior 
Homo sapiens cDNA FLJ1 1973 fis, clone HE 
Homo sapiens mRNA; cDNA DKFZp434E2321 (f 
E2F transcription factor 3 
SS,TM,Cadherin_C_term,cadherin,SS,TM,cadcadherin 3, type 1, P-cadherin (placenta 
SS,T4_deiodinase,T4_deiodinase, deiodinase, iodothyronine, type II 

,TM,SDF, gb:yh88b01 .s1 Scares placenta Nb2HP Homo 

,SS,TIMP, tissue inhibitor of metalloproteinase 4 

SS,Peptidase_M10,SS,Peptidase_M10,hemopematrix metalloproteinase 7 (MMP7; uterin 
SS DEME-6 protein 

SS.SS.TM similar to S68401 (cattle) glucose indue 

,SS,cNMP_binding,Rlla,HMG_box protein kinase, cAMP-dependent, regulato 

,SS,abhydrolase ESTs, Highly similar to AF1 57833 1 noncl 

SS DKFZP434B168 protein 

SS,SH2,PH,SS,TM,PH,SH2,Furin-like,pkinas growth factor receptor-bound protein 7 



SS,TM,MIP,SS,TM,MIP, 

SS 

SS 

TM.TM 
,SS,DENN 
,SS,START, 
SS 

,SS,WH1,WH1 
p450,SS,p450 
,SS,TM,7tm_1, 
SS 

,SS,SRCR,Uteroglobin 
,SS,TM 

SS,RGS,RGS,RGS 
TM 

,SS,lipase,PLAT,Sec7,PH, 

CKS.SS.CKS, 

SS 

SS 

SS,ig,MHCJ,connexin,SCAN,SS,TM 
SS.TM.SS 



aquaporin 7 
KIAA1204 protein 
ESTs 

similar to rat HREV1 07 
ESTs 

deleted in liver cancer 1 

ESTs 

ESTs 

C5000355:gi|4503225|ref|NP_000765.1 1 cyt 
ESTs, Weakly similar to (defline not ava 
ESTs 

mammaglobin 2 (MGB2; mammaglobin B; lip 
ESTs, Weakly similar to I38022 hypotheti 
regulator of G-protein signalling 16 
gb:QV1-HT0413-010200-059-h03 HT0413 Homo 
lipoprotein lipase 
COC28 protein kinase 2 
chromosome 1 open reading frame 21 
cysteine and glycine-rich protein 2 
alpha-2-glycoprotein 1, zinc 
kinectin 1 (kinesin receptor) 



4.3 

4.3 

4.3 

4.3 

4.3 

4.3 

4.2 

4.2 

4.2 

4.2 

4.2 

4.2 

4.2 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

4.0 

3.9 

3.9 

3.9 

3.9 

3.9 

3.8 

3.8 

3.8 

3.8 

3.8 

3.8 

3.8 

3.8 

3.8 

3.8 

3.8 

3.8 

3.8 

3.8 

3.8 

3.7 

3.7 

3.7 

3.7 

3.7 

3.7 

3.7 

3.7 

3.7 
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454071 

451859 

420281 

427691 

428824 

424676 

418026 

457465 

417601 

407999 

425548 

446619 

411213 

406625 

417511 

428769 

407137 

401866 

451195 

426044 

426310 

440029 

408573 

431830 

444781 

431493 

414175 

411789 

418851 

453968 

407104 

449051 

434398 

454042 

459496 

414998 

407756 

442101 

449722 

452554 

421991 

420058 

425776 

407846 

406925 

445873 

418054 

414921 

452268 

428862 

412520 

410530 

401780 

447131 

418334 

415138 

421168 

431473 

421379 

411984 

408101 

405366 

414612 

411393 

435767 

416406 

433068 



AI041793 Hs.42502 
H44491 Hs.252938 
AI623693 Hs.191533 
AW194426 Hs.20726 
W23624 Hs.173059 
Y08565 Hs.151678 
BE379727 Hs.83213 
AW301344 Hs.122908 
NM_014735Hs.82292 
AI1 26271 Hs.49433 
AA890023 Hs.1906 
AU076643 Hs.313 
AA676939 Hs.69285 
Y13647 Hs.1 19597 
AL049176 Hs.82223 
AW207175 Hs.106771 
T97307 

U10492 Hs.438 
AA502490 Hs.336695 
NM_000909Hs.169266 
AW089705 Hs.293711 
AA284775 Hs.43148 
Y16645 Hs.271387 
NM_014400Hs.11950 
AI791493 Hs.129873 
AI308876 Hs.103849 
AF245505 Hs.72157 
AI417828 Hs.192435 
AA847843 Hs.62711 
S57296 Hs.323910 
AW961400 Hs.333526 
AA121098 Hs.3838 
H22570 Hs.172572 
AA808940 Hs.274450 
NM_002543Hs.77729 
M1 16021 Hs.38260 
AI651930 Hs.135684 
BE280074 Hs.23960 
AW452434 Hs.58006 
NM_014918Hs.1 10488 
AK001423 Hs.94694 
U25128 Hs. 159499 
AA426202 Hs.40403 
L34041 Hs.9739 
AA250970 Hs.251946 
NM_002318Hs.83354 
BE390551 Hs.77628 
NM 00351 2HS.28777 
NM_000346Hs.2316 
AA442324 Hs.795 
M25809 Hs.64173 

NM_004585Hs.17466 
AA319233 Hs.5521 
C18356 Hs.295944 
AF182277 Hs.330780 
AA825686 Hs.321176 
Y15221 Hs.103982 
NM_005419Hs.72988 
AW968504 Hs.123073 

BE274552 Hs.76578 
AW797437 Hs.69771 
H73505 Hs.1 17874 
D86961 Hs.79299 
NM_006456Hs.288215 



,TM,7tm_1, 

,SS,TM,EGF,ldl_recept_a,ldl_recept_b,EGF 
,SS,AAA, 

,SS,Glycos_transf_2, 
SS 

.GIycos_transf_2,Ricin_BJectin,SS,Glycos 

lipocalin,SS,lipocalin,ljpocalin,ferriti 

,SS,Pribosyltran,Sulfatase 

PHD.pkinase.SS 

SS 

SS,TM,fn3,SS,TM,fn3, 
,SS,TM,efhand,ion_trans 



SS,TM,CUB,F5_F8_type_C,MAM,SS,TM,CUB,F5_neuropilin1 



ESTs 

ESTs, Weakly similar to ALU1_HUMAN ALU S 

ESTs 

ESTs 

ESTs 

UDP-N-acetyl-alpha-D-galactosamine:polyp 

fatty acid binding protein 4, adipocyte 

DNA replication factor 

KIM0215gene product 

ESTs, Weakly similar to YZ28_HUMAN HYPOT 

prolactin receptor 

secreted phosphoprotein 1 (osteopontin, 



stearoyl-CoA desaturase (delta-9-desatur 
chordin-like 
ESTs 

gb:ye53h05.s1 Scares fetal liver spleen 
Target Exon 

mesenchyme homeo box 1 
ESTs 

neuropeptide Y receptor Y1 
ESTs, Weakly similar to S64329 probable 
ESTs 

small inducible cytokine subfamily A (Cy 
GPI-anchored metastasis-associated prate 
ESTs, Weakly similar to A36036 cytochrom 
TM.hemopexin.Peptidase.MIO.hemopexin.Pehypothetical protein DKFZp761D112 
ig.LRRCT, DKFZP564I1922 protein 

,SS,TM ESTs 

,SS,HMG_box, Homo sapiens, clone IMAGE:3351 295, mRNA 

SS,TM,SH2,PH,pkinase,Recep_L_domain,Furv-erb-b2 avian erythroblastic leukemia v 



SS,TM,Desaturase,SS 
SS 

,SS,7tm_1,SPRY, 

,SS,TM,GDA1_CD39 

,SS,filament, 

SS,homeobox,Ets,SS,homeobox, 
SS 

SS,TM,7tm_1, 
SS 

,SS,TM,PMP22_Claudin, 
,SS,TM,IL8 
,SS,PH,lactamase_B 
,SS,p450,p450 



HER2 receptor tyrosine kinase (c-erb-b2, 
serum-inducible kinase 
hypothetical protein FLJ20093 
EST 

oxidised low density lipoprotein (lectin 
ubiquitin specific protease 18 
ESTs 
cyclin B1 

ESTs, Weakly similar to ALU5_HUMAN ALU S 
KIAA0990 protein 

Homo sapiens cDNA FLJ10561 fis, clone NT 
parathyroid hormone receptor 2 



SS 

pkinase,POLO_box,SS,pkinase,POLO_box, 
,SS,ig,pkinase,LRRNT,LRRCT, 
,SS,TM,KRAB,SCAN,zf-C2H2,ig 
,SS,TM 

SS,UCH-1,UCH-2,SS,TM,G_glu_transpept 
SS 

cydin,SS,TM,cyclin, 
SS 
SS 
SS 

SS,TM,7tm_2,SS,TM,7tm_2 

,TM,ABC_membraneABC_tran,Ribosomal_S4eCbp/p300-interactingtransactivator, wit 
,SS,TM,transport_prot,SWIB,RhoGAP,DAG_PEglycerol-3-phosphate dehydrogenase 1 (so 
,SS,rrm,PABP,pkinase,14-3-3,rrm poly(A)-binding protein, cytoplasmic 1-1 

,SS > TM,mito_carr,LysyLoxidase lysyl oxidase-like 2 

SS,START,SS,START,NNMT_PNMT_TEMT, steroidogenic acute regulatory protein r 
SS,histone,Calc_CGRP_IAPP,ig,MHC_l,SPRY,H2A histone family, member L 
SS.HMGJxw, 
histone,SS,histone,BolA 
ATP-synLab,SS,7tm_1 ,ATP-synt_ab 
filament,SS,filament,filament 
TM 

,SS,TM,Ribosomal_L27e, 
,Kunitz_BPTI, 
SS,p450,SS 
SS 

SS.IL8, 

SH2,STAT,SS,STAT 
,pkinase, 

RhoGEF,PH,SS,RhoGEF,PH, 
SAP.SS.FG-GAP.vwa 
SS,sushi,trypsin,vwa,mm,fibrinogen_C,fn 
,SS,Peptidase_S8,P,Peptidase_S8,P 
,SS,TM 

,SS,Pribosyltran, 
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SRY (sex determining region Y)-box 9 (ca 
H2A histone family, member O 
ATPase, H transporting, lysosomal (vacuo 
NM_005557*:Homo sapiens keratin 16 (foca 
retinoic acid receptor responder (tazaro 
ESTs 

tissue factor pathway inhibitor 2 
cytochrome P450, subfamily IIB (phenobar 
ESTs, Weakly similar to S65824 reverse t 
small inducible cytokine subfamily B (Cy 
signal transducer and activator of trans 
CDC2-related protein kinase 7 
NM_003371*:Homo sapiens vav 2 oncogene ( 
protein inhibitor of activated STAT3 
B-factor, properdin 
ESTs 

lipoma HMGIC fusion partner-like 2 
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Hs.22785 
Hs.269109 



445462 AA378776 Hs.288649 
439452 AA918317 Hs.57987 
452017 AF109302 Hs.27495 
409099 AK000725 Hs.50579 
452106 AI141031 Hs.21342 
447519 U46258 Hs.339665 
426928 AF037062 Hs.172914 
438825 BE327427 Hs.79953 
414575 H11257 Hs.22968 
417837 AL079905 Hs.1103 
422128 AW881145 
445941 AI267371 Hs.172636 
429973 AI423317 Hs.164680 
444542 AI161293 Hs.280380 
459561 AI547306 Hs.134981 
425741 AF052152 Hs.159412 
426501 AW043782 Hs.293616 
456508 AA502764 Hs.1 23469 
434228 Z42047 Hs.283978 
415752 BE314524 Hs.78776 
400419 AF084545 
439750 AL359053 Hs.57664 
423858 AL1 37326 Hs. 133483 
428514 AW236861 Hs.193139 
428698 AA852773 Hs.334838 
448988 Y09763 
432072 N62937 
417433 BE270266 Hs.82128 
452194 AI694413 Hs.332649 
444051 N48373 Hs. 10247 
420042 AW015140 Hs.161723 
457292 AI921270 Hs.334882 
421458 NM_003654Hs. 104576 
431104 AW970859 Hs.313503 
443767 BE562136 Hs.9736 
419589 AW973708 Hs.201925 
415447 Z97171 Hs.78454 
443464 BE548446 Hs.5167 
423431 AA326062 
413278 BE563085 Hs.833 
458451 AW297181 Hs.195922 
440449 AA885430 Hs.201925 
413753 U17760 Hs.75517 
434876 AF1 60477 Hs.61460 
435575 AF213457 Hs.44234 
415773 R21651 Hs.324725 
446440 AV658411 Hs.42656 
450847 NM_003155Hs.25590 
426075 AW513691 Hs.270149 
452110 T47667 Hs.28005 
439963 AW247529 Hs.6793 
402837 NA 

439451 AF086270 Hs.278554 
406664 L34041 Hs.9739 
417315 AI080042 Hs.336901 
413011 AW068115 Hs.821 
414987 AA524394 Hs.294022 
429197 H24471 Hs.26930 
448030 N30714 Hs.325960 
407604 AW191962 Hs.249239 
419092 J05581 Hs.89603 
456672 AK002016 Hs.1 14727 
443171 BE281128 Hs.9030 
452256 AK000933 Hs.28661 
432201 AI538613 Hs.298241 
406642 AJ245210 
400903 NA 



SS.SS 

SS.SS 

SS 

SS 

SS 

SS 

,SS,adh_short,TGF-beta,TGFb_propeptide 

,SS,TM,histone,ANF_receptor,guanylate_cy 

,SS,pkinase,ig, 

SS,TGFb_propeptide,TGF-beta,SS 
SS 

SS,SS,lipoxygenase,PLAT 
,SS,T-box,UDPGT 

SS,SS,Peptidase_M1,EGF,ig,lectin_c,sushi 
SS 

.pkinase, 

SS 

SS 

,SS,TM,7tm_1 
TM 

,SS,Peptidase_M1, 
,TM,integrin_B,Ricin_BJecan,imi 

,ss,tm 

,ss,start,nnmt_pnmt_temt, 

SS 

SSJMJM 
,Sema,ig, 

SS,TM,LRRCT,LRRNT,LRR,TM,LRRCT, 

,SS,TM,7tm_3,ANF_receptor,sushi 

,SS,ig, 

,SS,CUB, 

SS,TM,SS,TM,G-patch 
SS 

,Sema,ig, 

,SS,PCI,RasGEF,hormone_rec,zf-C4, 
,FGF, 

SS,OLF,OLF,OLF,Ribosomal_L4 

SS,TM,SSF,SS,TM 

,SS,p450,p450 

,SS,TM,ubiqurSn,laminin_G,laminin_EGF,k 

,SS,Ribosomal_L14 

,FGF, 

SS,laminin_EGF,laminin_Nterm,adh_short,S 

,SS,HLH 

SS,ig,SS,TM 

,SS,TM,Ribosomal_S3Ae,G-gamma 
SS 

,SS,homeobox, 
,SS,fn3, 

,SS,TM,Activin_recp,pkinase 

,TM,p450,Ets 

SS 

,SS,Chromo_shadow,chromo, 



hypothetical protein MGC3077 

B-cell CLL/lymphoma 1 1 B (zinc finger pro 

prostate cancer associated protein 7 

hypothetical protein FLJ20718 

ESTs 

ESTs 

retinol dehydrogenase 5 (1 1-cis and 9-ci 
ESTs 

Homo sapiens clone IMAGE:451939, mRNA se 
transforming growth factor, beta 1 
gb:QV0-OT0033-01 0400-1 82-a07 OT0033 Homo 
ESTs 
ESTs 

aminopeptidase 
ESTs 

Homo sapiens clone 24628 mRNA sequence 
ESTs 

ESTs, Weakly similar to AF208855 1 BM-01 
Homo sapiens PR02751 mRNA, complete cds 
putative transmembrane protein 
Target 

Homo sapiens mRNA full length insert cDN 
Homo sapiens mRNA; cDNA DKFZp434B0650 (f 
ESTs 

KIM1866 protein 

gamma-aminobutyric acid (GABA) A recepto 
ESTs 

5T4 oncofetal trophoblast glycoprotein 
olfactory receptor, family 2, subfamily 
activated leucocyte cell adhesion molecu 
ESTs 

hypothetical protein FLJ14251 
carbohydrate (keratan sulfate Gal-6) sul 
ESTs 

proteasome (prosome, macropain) 26S subu 
Homo sapiens cDNA FLJ13446 fts, clone PL 
myocilin, trabecular meshwork inducible 
Homo sapiens mRNA; cDNA DKFZp434F152 (fr 
gb:EST29171 Cerebellum II Homo sapiens c 
interferon-stimulated protein, 15 kDa 
ESTs 

Homo sapiens cDNA FLJ 13446 fis, clone PL 
laminin, beta 3 (nicein (125kD), kalinin 
Ig superfamily receptor LNIR 
triggering receptor expressed on myeloid 
ESTs, Moderately similar to A47582 B-cel 
KIAA1681 protein 
stanniocalcin 1 

ESTs, Weakly similar to 2109260A B cell 
Homo sapiens cDNA FLJ 1 1309 fis, clone PL 
platelet-activating factor acetylhydrola 
ENSP00000241312*:DJ947L8.1.8 (novel Sush 
heterochromatin-like protein 1 



,oo,\/liiUMiu_atiaui/»v,v,iiiuHiv, 

,SS,TM,transporLprot,SWIB,RhoGAP,DAG_PEglycerol-3-phosphate dehydrogenase 1 (so 
,SS,RNA_poLA,RNA_poLA2,Ribosomal_S24e,ribosomal protein S24 



SS.LRR.LRRNT.SS.LRRNT.LRR, 
,SS,connexin,hormone_rec,zf-C4,connexin 
.SS.Gelsolin, 
,SS,TM 
,SS,TM,C1q, 
SS.TM.SEA, 
,SS,PK,PK 
SS,TM,7tm_1,rrm,SS 
,TM,GD!,7tm_1, 

SS,TM,trypsin,SS,TM,trefoil,trypsin,tref 

SS 

SS 
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biglycan 
hypothetical protein FLJ 14950 
ESTs, Weakly similar to T20272 hypotheti 
membrane-spanning 4-domains, subfamily A 
collagen, type VIII, alpha 2 
mucin 1, transmembrane 
Homo sapiens, clone MGC:16327, mRNA, com 
TONDU 

Homo sapiens cDNA FU10071 fis, clone HE 
Transmembrane protease, serine 3 
gb:Homo sapiens mRNA for immunoglobulin 
Target Exon 
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434408 
452994 
445903 
424364 
410196 
419150 
433417 
418624 
436291 
411000 
412519 
450223 
422790 
424269 
435854 
447388 
451631 
448105 
438637 
423024 
456592 
425920 
444670 
401197 
437755 
452560 
410274 
450098 
404826 
458389 
408196 
433675 
418848 
422095 
415992 
424631 
409956 
419667 
450946 
447770 
423826 
418838 
442804 
432284 



A1031771 

AW962597 

AI347487 

AW383226 

AI936442 

T29618 

AA587773 

AI734080 

BE568452 

N40449 

AA196241 

AA418204 

AA809875 

AW137691 

AJ278120 

AW630534 

R00866 

AW591433 

BE500941 

AA593731 

R91600 

AL049977 

H58373 



Hs.132586 

Hs.31305 

Hs.132781 

Hs.201189 

Hs.59838 

Hs.89640 

Hs.8859 

Hs.104211 

Hs.5101 

Hs.201619 

Hs.73980 

Hs.241493 

Hs.25933 

Hs.199754 

Hs.4996 

Hs.76277 

Hs.298241 
Hs.1 26730 
Hs.325823 

Hs.162209 
Hs.332938 



AW204256 Hs.291887 
BE077084 Hs.336432 
AA381807 Hs.61762 
W27249 Hs.8109 



H70284 

AL034548 

AW977653 

AI820961 

AI868872 

C05837 

AA688021 

AW103364 

AU077005 

AA374569 

AB032417 

U20325 

AW385224 

AW300118 

AA532807 



Hs.160152 

Hs.43627 

Hs.75319 

Hs.193465 

Hs.282804 

Hs.145807 

Hs.179808 

Hs.727 

Hs.92208 

Hs.1 27698 

Hs.19545 

Hs.1 707 

Hs.35198 

Hs.131257 

Hs.105822 



,SS,Glyco_hydro_2 ESTs 
SS,WD40,SS,WD40, KIAA1547 protein 

fn3,SS,TM,EF1BD class I cytokine receptor 

SS ESTs, Weakly similar to G01763 atrophin- 

UBACT_repeat,SS,UBACT_repeatThiF_familyhypothetical protein FLJ10808 
,TM,pkinase,fn3, TEK tyrosine kinase, endothelial (venous 

,SS,SRCR, Homo sapiens, Similar to RIKEIM cDNA 5830 

.Sema.ig, ESTs 
,SS,abhydrolase, protein regulator of cytokinesis 1 

SS ESTs, Weakly similar to S38383 SEB4B pro 

SS,Troponin,Hemagglutjnin,SS,TM,C2,Tropo troponin T1, skeletal, slow 



,SS,pro_isomerase, 
,TM,histone,Sec1,histone,sugar_tr 
,SS,TM,7tm_2,GPS 
,SS,WD40 

,SS,TM,rrm,oxidored_q6,oxidored_q6 
SS 

,SS,TM,trefoil,trypsin,trefoil 
,TM,PH, 
,SS,TM,CD36 



natural killer-tumor recognition sequenc 
ESTs 
ESTs 

putative ankyrin-repeat containing prate 
Homo sapiens, clone MGC:9381, mRNA, comp 
gb:ye79c02.s1 Soares fetal liver spleen 
Transmembrane protease, serine 3 
ESTs, Weakly similar to KIAA1214 protein 
ESTs, Moderately similar to ALU5JHUMAN A 



,SS,Ran_BP1,LIM,Ran_BP1 ,GRIP,TPR,LIM gb:yq10c02.r1 Soares fetal liver spleen 
SS,TM,SS,TM,PMP22_Claudin,PMP22_Claudinclaudin8 
SS TM hypothetical protein MGC5370 

arf.arf, ENSP00000229263*:HSPC213. 
,wnt, ESTs 
,SS,rrm,zf-RanBP,pkinase,C2,pkinase_C,DA ESTs 
SS.SS hypoxia-inducible protein 2 

SS hypothetical protein FLJ21080 

,SS,TM Target Exon 

,SS,RA ESTs, Weakly similar to FPHU alpha-fetop 

HMGJ)ox,pkinase,zf-CCHC,SS,TM,HMG_box,SRY (sex determining region Y)-box 22 
,SS,ribonuc_red_sm, ribonucleotide reductase M2 polypeptide 

,death,ZU5,pkinase,Activin_recp, ESTs 
SS,Cu-oxidase,SS,Cu-oxidase,Cu-oxidase hypothetical protein FLJ22704 
,SS,TM hypothetical protein FU13593 

SS ESTs 

SS,TGF-beta,TGFb_propeptide,SS,TGF-beta,inhibin, beta A (activin A, activin AB a 
SS,disintegrin,Reprolysin,Pep_M12B_prope a disintegrin and metalloproteinase doma 
SS ' ESTs, Moderately similar to 21 09260A Be 

Friz2led,Fz,SS,TM,Frizzled,Fz, frizzled (Drosophila) homolog 4 

SS.SS cocaine- and amphetamine-regulated trans 

,SS,TM,Phosphodiest, ectonucleotide pyrophosphatase/phosphodi 

,SS,TM,G-gamma ESTs 
,SS,TM,pkinase, ESTs 
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TABLE 20A 

Table 20A shows the accession numbers for those pkeys lacking unigenelD's for Table 20. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey 


CAT number 


Accessions 


410785 


1221055 1 


AW803341 AW803265 AW803403 AW803466 AW803402 AW803413 AW803268 AW803396 AW803334 AW803355 


411667 


1253334J 


BE160198 AW935898 T1 1520 AW935930 AW856073 AW861034 


418636 


177402 1 


AW749855 AA225995 AW750208 AW750206 


420854 


197072 1 


AW296927 AI684514 AI263168 AA281079 


422128 


211994J 


AW881145 AA490718 M85637 AA304575 T06067 AA331991 


423431 


228162 1 


M326062 M325758 AW962182 


423945 


233566J 


AA410943 AW948953 AA334202 AA332882 


426878 


273265 1 


BE069341 AW748403 AL044891 AI908240 M393080 


433687 


373061 1 


AA743991 AA604852 AW272737 


447197 


711623J 


R36075AI366546 R36167 


451631 


878098 1 


R00866R01523AI806815 


456207 


165078_-1 


AA1 93450 


456592 


202684 1 


R91600T87079AA291455 
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TABLE 20B 



Table 20B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 20. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication 

entitled "The DNA sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey Ref 



Strand NLposition 



400608 9887666 Minus 96756-97558 

400903 2911732 Plus 59112-59228 

401045 8117619 Plus 90044-90184,91111-91345 

401093 8516137 Minus 22335-23166 

401197 9719705 Plus 176341-176452 

401747 9789672 Minus 118596-118816,119119-119244,119609-119761,120422-120990,130161-130381,130468-130593,131097-131258,131866- 
131932 132451-132575 133580-134011 

401780 7249190 Minus 28397-28617,28920-29045,29135-29296,29411-29567,29705-29787,30224-30573 

401781 7249190 Minus 83215-83435,83531-83656,83740-83901,84237-84393,84955-85037,86290-86814 
401866 8018106 Plus 73126-73623 

402230 9966312 Minus 29782-29932 

402408 9796239 Minus 110326-110491 

402578 9884928 Plus 66350-66496 

402606 9909429 Minus 81747-82094 

402837 9369121 Minus 2013-2186,9570-9758,11136-11309,19429-19677,21210-21455,23368-23562,24342-24527,29132-29320 

403199 9958183 Minus 58895-59036,66618-66789 

403329 8516120 Plus 96450-96598 

403593 6862650 Minus 62554-62712,69449-69602 

403943 7711864 Plus 100742-100904,101322-101503 

404091 7684554 Minus 82121-83229 

404347 9838195 Plus 74493-74829 

404826 6572184 Plus 47726-48046 

405366 2182280 Plus 22478-22632 

405654 4895155 Minus 53624-53759 
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TABLE 21: 210 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES THAT ARE 
5 LIKELY TO ENCODE EITHER ENZYMES OR PROTEINS 

AMENABLE TO MODULATION BY SMALL MOLECULES 

Table 21 shows 210 genes up-regulated in breast cancer compared to normal adult tissues 
that are likely to encode either enzymes or proteins amenable to modulation by small 
10 molecules. These were selected as for Table 19, except that the ratio was greater than or equal 

to 3.0, the "average" normal adult tissue level was set to the 85 th percentile value amongst 
jss. 144 non-malignant tissues, and the 96 th percentile value amongst 73 breast cancers was 
P greater than or equal 80 units, and the predicted protein contained a structural domain that is 
=5 indicative of enzymatic function or of being modulatable by small molecules (e.g. pkinase, 
Jfc peptidase, phosphatase, ATPase, or ionjransporter domains). The predicted protein domains 
2\ are noted. 



.20 



25 



30 



35 



40 



45 



50 



55 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unkjene number 

Unigene Title: Unkjene gene title 

R1 : Ratio of 93rd percentile tumor to 85* percentile of normal body tissue 



Pkey 


ExAccn 


UnigenelD 


449746 


AI668594 


Hs.176588 


400292 


AA250737 


Hs.72472 


424735 


U31875 


Hs.272499 


407178 


AA1 95651 


Hs.104106 


408045 


AW138959 Hs.245123 


450375 


AA009647 


Hs.8850 


429170 


NM_001394Hs.2359 


445730 


AI624342 


Hs.170042 


424634 


NM_003613Hs.151407 


420757 


X78592 


Hs.99915 


424399 


AI905687 


Hs.2533 


447350 


AI375572 


Hs.172634 


456207 


AA1 93450 




456938 


X52509 


Hs.161640 


402578 






425692 


D90041 


Hs.1 55956 


424001 


W67883 


Hs.1 37476 


418007 


M13509 


Hs.83169 


421727 


Y13153 


Hs.1 07318 


411869 


W20027 


Hs.23439 


400289 


X07820 


Hs.2258 


443348 


AW873596 


Hs.182278 


424086 


AI351010 


Hs.102267 


400295 


W72838 


Hs.2533 


408771 


AW732573 Hs.47584 


421155 


H87879 


Hs.102267 


424905 


NM_002497Hs.153704 


438167 


R28363 


Hs.24286 


459583 


A1907673 




423945 


AA410943 




445263 


H57646 


Hs.42586 



Predicted Protein Domains 

SS,p450 

death,ZU5,TM,Activin_recp,pkinase, 
SS.TM 

SS.Dihydroorotase, 
PhosphodiestSomatomedin_B, 



UnigeneTitle 

ESTs, Weakly similar to CP4Y_HUMAN CYTOC 
BMP-R1B 

short-chain alcohol dehydrogenase family 

ESTs 

ESTs 



SS,TM,disintegrin,Pep_M12B_propep,Reprol a disintegrin and metalloproteinase doma 



DSPc,Rhodanese, 

SS,TM,Cation_efflux 

ig,tsp_1,SS,AAA 

hormone rec,Androgen_recep,zf-C4, 
SS 

pkinase, 

SS,TM,p450,p450 



dual specificity phosphatase 4 
ESTs 

cartilage intermediate layer protein, nu 
androgen receptor (dihydrotestosterone r 
aldehyde dehydrogenase 9 family, member 
ESTs 

gb:zr40e07.r1 Soares_NhHMPu_S1 Homo sapi 



SS,TM,aminotran_1_2,Cadherin_CJerm,cadhtyrosine aminotransferase 
SS,p450,SS,TM,p450 C1001134:gi|2117372|pir||l65981 fatty ac 

SS,Acetyltransf2, N-acetyltransferase 1 (arylamine N-acety 

pkinase, paternally expressed 10 

SS,hemopexin,Peptidase_M10,SS,Peptidase_matrix metalloproteinase 1 (MMP1; inters 

kynurenine 3-monooxygenase (kynurenine 3 
SS,Peptidase_M1, ESTs 

hemopexin,Peptidase_M10,SS,Peptidase_M10matrix metalloproteinase 10 (MMP10; str 
SS.DENN calmodulin 2 (phosphorylase kinase, delt 

SS.LysyLoxidase lysyl oxidase 

SS aldehyde dehydrogenase 9 family, member 

TM,K_tetra,ion_trans,SS,TM,K_tetra,ion_t potassium voltage-gated channel, delayed 
SS,LysyLoxidase,Aldose_epim,Epimerase,S lysyl oxidase 
pkinase,SS,TM,pkinase,polyprenyl_synt, NIMA (never in mitosis gene a)-related k 



SS,TM,7tm_1,p450,rrm 
pkinase, 

death,ZU5,TM,Activin_recp,pkinase, 
SS,Acyltransferase, 



ESTs 

gb:IL-BT1 52-080399-004 BT152 Homosapien 
gb:zt32h03.r1 Soares ovary tumor NbHOT H 
KIAA1560 protein 



R1 

65.7 
55.9 
53.8 
39.3 
34.9 
25.7 
24.9 
24.1 
21.7 
21.7 
20.3 
19.2 
18.3 
18.1 
17.8 
16.7 
16.5 
15.7 
15.3 
13.9 
13.5 
13.0 
12.8 
12.7 
11.9 
11.8 
11.7 
11.5 
11.5 
11.4 
11.2 
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423887 AL080207 Hs.134585 
429859 NM_007050Hs.225952 
428368 BE440042 Hs.83326 
418912 NM_000685Hs.89472 
451952 AL120173 Hs.301663 
402408 NA 

415539 AI733881 Hs.72472 
406687 M31126 Hs.272620 
419948 AB041035 Hs.93847 
400285 NA 

408380 AF1 23050 Hs.44532 
409203 AA780473 Hs.687 
424902 NM_003866Hs.153687 
431725 X65724 Hs.2839 
418092 R45154 Hs.106604 
M81057 Hs.180884 
M90516 Hs.1674 
AW242243 Hs.168670 
NM_001034Hs.75319 
NM_004482Hs.278611 
D38299 Hs.170917 
AI240665 Hs.8895 
Y12735 Hs.38018 
U37519 Hs.87539 
U92649 Hs.64311 



SS,TM,BRCT,ank,ABC_tran,ABC_tran DKFZP434G232 protein 
SS,TM,Y_phosphatase,MAM,fn3, protein tyrosine phosphatase, receptor t 

SS,Peptidase_M10,hemopexin,SS > Peptidase_matrix metalloproteinase 3 (stromelysin 
SS,TM,7tm_1 ,SS,TM,7tm_1 , angiotensin receptor 1 

SS.pkinase, ESTs 

SS,carb_anhydrase NM_030920*:Homo sapiens hypothetical pro 

death,ZU5,TM,Activin_recp,pkinase, BMP-R1B 

SS,Peptidase_M10,hemopexin,SS I Peptidase_matrix metalloproteinase 11 (MMP11; stro 
Fem'c_reductTM,Ferric_reduct, NM_01 6931 :Homo sapiens NADPH oxidase 4 ( 

TM,ABC_tran,ABC_membrane, Eos Control 

SS,TM,ubiquitin,7tm_3,ANF_receptor,sushi diubiquitin 

SS,p450,SS,p450 cytochrome P450, subfamily IVB, polypept 



SS.SS 

SS,Cys_knot,SS 
death,ZU5,pkinase,Activin_recp, 



inositol polyphosphate-4-phosphatase, ty 
Norrie disease (pseudoglioma) 
ESTs 



427811 
423554 
426261 
413374 
432677 
456986 
453331 
407721 
418004 
410555 
443695 
423545 
401045 



AW204099 Hs.337720 
AP000692 Hs.129781 



SS,Zn_carbOpept,Propep_M14,SS,Propep_M14carboxypeptidase B1 (Issue) 
GATase_2,SIS, glutamine-fructose-6-phosphate transamin 

SS,TM,WD40,ubiquitin,E1 -E2_ATPase,Cation peroxisomal famesylated protein 
SS ribonucleotide reductase M2 polypeptide 

SS,TM,Glycos_transf_2,Ricin_BJectin,DPP UDP-N-acetyl-alpha-D-galactosamine:polyp 
SS,TM,7tm_1 , prostaglandin E receptor 3 (subtype EP3) 

SS,TM,disintegrin,Pep_M12B_propep,Reprol ESTs 

pkinase, dual-specificity tyrosine-(Y)-phosphoryl 

SS,aldedh,SS,aldedh, aldehyde dehydrogenase 3 family, member 

TM.disintegrin.Reprolysin, a disintegrin and metalloproteinase doma 

ESTs, Weakly similar to AF1 26780 1 retin 
GAF.PDEase chromosome 21 open reading frame 5 

ICE_p20,SS,ICE_p10,ICE_p20,ICE_p10,ICE_p 



442082 
418836 
404091 



Hs.161712 



C1 1001 883*:gi|6753278|ref |NP_033938.1 1 c 
R41823 Hs.7413 TM,EPH_lbd,pkinase,SAM,fn3, 

TM,Activin_recp,pkinase,death,ZU5, 
TM,7tm_3,ANF_receptor, 
zf-C2H2 

SS,AAA,Viral_helicase1 ,rrm, 
TM,ionJrans,SS,TM,ion_trans, 
TM,7tm_1, 
SS.se rpin, 

SS,ig,Sema,pkinase, 
SS.TM 

SS.pkinase.pkinase 



AI655499 
NA 

450865 AI248013 Hs.106532 
424085 NM_002914Hs.139226 
449650 AF055575 Hs.23838 
432304 AA932186 Hs.69297 
415669 NM_005025Hs.78589 
442942 AW167087 Hs.131562 
428795 R45503 Hs.97469 
428479 Y00272 Hs. 184572 
400300 X03363 
447359 NM_012093Hs.18268 
402230 NA 

424687 J05070 Hs. 151 738 
432328 AI572739 Hs. 195471 
432239 X81334 Hs.2936 
400286 NA 

425247 NM_005940Hs.155324 
434737 AA828246 Hs.291884 
439310 AF086120 Hs.102793 
441111 AI806867 
N54926 
M31659 
AL1 17406 
X52730 



6.7 

ESTs; calsyntenin-2 
ESTs 

Target Exon 

ESTs, Weakly similar to I38588 reverse t 
replication factor C (activator 1) 2 (40 
calcium channel, voltage-dependent, Lty 
ESTs 

serine (or cysteine) proteinase inhibito 
ESTs 

ESTs, Highly similar to A39769 N-acetyll 
cell division cycle 2, G1 to S and G2 to 



452355 
427711 
429353 
425325 



Hs.1 26594 
Hs.29202 
Hs.1 80408 
Hs.200102 
Hs.1 892 

phenylethanolamine N-methyltransferase 
448706 AW291095 Hs.21814 SS.TM.pkinase 
403593 NA 

432777 AA564991 Hs.269477 
446232 AI281848 Hs.194691 
411165 NM_000169Hs.69089 
414117 W88559 Hs.1787 
411096 U80034 Hs.68583 
450506 NM 004460HS.418 
417975 AA641836 Hs.30085 
447752 M73700 Hs.105938 
427122 AW057736 Hs.323910 



SS,TM,pkinase,Recep_L_domain,SH2,PH,FuriHER2 receptor tyrosine kinase (c-erb-b2, 
SS,adenylatekinase, adenylate kinase 5 

SS,TM,p450, Target Exon 

SS,Pepfidase_M10,fn2,hemopexin,SS,TM,Pepmatrix metalloproteinase 9 (gelatinase B 
6PF2K.PGAM, 6-phosphofructo-2-kinase/fructose-2,6-bi 
SS,Peptidase_M10,hemopexin,SS,Peptidase_matrix metalloproteinase 13 (collagenase 
SS,TM,ABC_tran,ABC_membrane,SS C16000922:gi|7499103|pir)|T20903 hypothe 
SS,Peptidase_M1 0,hemopexin,SS matrix metalloproteinase 1 1 (MMP1 1 ; stro 

UCH-1,pkinase,OPR,Rhodanese,AMP-binding,ESTs 
SS,TM,UDPGT,casein_kappa ESTs 
SS,TM,Phosphodiest, ESTs 
TM,7tm_1 ,TM G protein-coupled receptor 34 

SS solute carrier family 25 (mitochondrial 

SS,TM,ABC_tran,ABC membrane, ATP-binding cassette transporter MRP8 

SS,NNIvn'_PNm_TEIvrr,SS,NNMT_PNMT_TErv1T,STAR 

5.1 

interleukin 20 receptor, alpha 
CIDE-N,pkinase Target Exon 

alpha-amylase, ESTs 
SS,TM,7tm_3,Ribosomal_L13 retjnoic acid induced 3 

SS,Melibiase,BTK,PH,pkinase,SH2,SH3,Ribo galactosidase, alpha 
TM,ion_trans,K_tetra, proteolipid protein 1 (Pelizaeus-Merzbac 

Peptidase_M3, mitochondrial intermediate peptidase 

SS,DPPlV_N_term,Peptidase_S9,SS,DPPIV_N_fibroblast activation protein, alpha 
SS.trypsin hypothetical protein FLJ23186 

SS,transferrin,7tm_1 .transferrin, lactotransferrin 
SS,TM,pkinase,Recep_L_domain,SH2,PH,FuriHER2 receptor tyrosine kinase (c-erb-b2, 
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10.9 

10.4 

10.3 

10.3 

10.3 

9.8 

9.4 

9.1 

9.1 

8.8 

8.6 

8.3 

8.3 

8.3 

8.3 

8.2 

8.1 

7.8 

7.6 

7.6 

7.5 

7.3 

7.2 

7.2 

7.1 

6.9 

6.8 



6.7 
6.6 
6.6 
6.5 
6.5 
6.4 
6.2 
6.2 
6.2 
6.1 
6.1 
6.1 
6.1 
6.1 
5.8 
5.8 
5.6 
5.6 
5.4 
5.4 
5.2 
5.2 
5.2 
5.2 
5.1 



5.1 
5.1 
5.0 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.8 
4.8 
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400181 
452093 
435542 
417576 
446089 
424420 
452190 
419986 
421582 
446733 
453060 
400205 
420854 
432690 
441560 
416445 
439024 
432882 
447754 
453775 
431657 
427899 
422938 
418478 
406922 
439285 
429922 
420139 
425071 
424511 
428722 
428330 
414831 
413589 
418629 
413453 
403943 
444618 
408761 
427809 
418203 
454071 
424676 
457465 
417601 
446619 
406625 
428769 
426310 
417531 
444781 
431493 
428966 
414175 
455325 
429597 
425320 
431854 
407104 
449051 
434398 
454042 
407756 
401464 
412970 
412049 
425776 



NA 

M447453 Hs.27860 

AA687376 Hs.269533 

AA339449 Hs.82285 

AI860021 Hs.270651 

BE614743 Hs.146688 



SS,TM,3Beta_HSD, ENSP00000171555:CDNA FLJ10727 fis, clone 

SS,TM,7tm_1 , Homo sapiens mRNA; cDNA DKFZp586M0723 (f 

SS,pkinase,RhoGEF,ig,PH,SH3, ESTs 

AIRSJormyl_transf,GARS,SS,GARS,AIRS,forphosphoribosylglycinamideformyltransfer 



H26735 

AI345455 

AI910275 



Hs.91668 
Hs.78915 
Hs.1406 



AA863360 Hs.26040 
AW294092 Hs.21594 
NA 

AW296927 

AF181490 Hs.278627 
F13386 Hs.7888 
AL043004 Hs.79337 
R96696 Hs.35598 
NM_013257Hs.279696 
AW073310 Hs.163533 
NM_002916Hs.35120 
AI345227 Hs.105448 
AA829286 Hs.332053 
NM_001809Hs.1594 
U38945 Hs.1174 
S70284 

AL133916 Hs.172572 
Z97630 Hs.226117 
NM_005357Hs.95351 
NM_013989Hs.154424 
BE300512 Hs.193557 
U76456 Hs.190787 
L22524 Hs.2256 
M31158 Hs.77439 
AW452631 Hs.313803 
BE247550 Hs.86859 
M129640 Hs.128065 

AV653785 Hs.173334 
AA057264 Hs.238936 
M26380 Hs.1 80878 
X54942 Hs.83758 
AI041793 Hs.42502 
Y08565 Hs.151678 
AW301344 Hs.122908 
NM_014735Hs.82292 
AU076643 Hs.313 
Y13647 Hs.119597 
AW207175 Hs.106771 
NM_000909Hs.169266 
NM_003157Hs.1087 
NM_014400Hs.11950 
AI791493 Hs.129873 
AF059214 Hs.194687 
AI308876 Hs.103849 
AW895719 
NM_003816Hs.2442 
U29344 Hs.83190 
M383550 Hs.271699 
S57296 Hs.323910 
AW961400 Hs.333526 
AA121098 Hs.3838 
H22570 Hs.172572 
AA1 16021 Hs.38260 
AF039241 Hs.9028 
AB026436 Hs. 177534 
N53437 Hs.18268 
U25128 Hs.159499 



pkinase 
SS.TM.MAPEG, 

SS 1 TM,PH,SH2,Furin-like,pkinase,Recep_L 
pkinase.OPR, 

SS 1 trefoil,SS,TM,ldl_recepLa,SRCR,tryps 
SS.TM,p450, 

SS,tas,Y_phosphatase,ras 

SS,TM,PeptkJase_M1 > 
SS,pyr_redox,SS,Ribosomal_L39 



SS.pkinase, 
SS,TM,trypsin,vwd,ig 
pkinase,pkinase_C, 
pkinase, 



ESTs, Moderately similar to A47582 B-cel 
prostaglandin E synthase 
Homo sapiens clone PP1498 unknown mRNA 
GA-binding protein transcription factor, 
trefoil factor 1 (pS2) 

ESTs, Weakly similar to fatty acid omega 
hypothetical protein MGC15754 
NM_006265*:Homo sapiens RAD21 (S. pombe) 
gb:UI-H-BWO-ajc-c-07-0-Ul.s1 NCI_CGAP_Su 
prenylcysteine lyase 

Homo sapiens clone 23736 mRNA sequence 

KIAA0135 protein 

ESTs 

serum/glucocorticoid regulated kinase-li 
Homo sapiens cDNA FLJ14142 fis, clone MA 



SS,AAA,P!3_PI4_kinase,PI3Ka,PI3K_rbd,PI3 replication factor C (activator 1) 4 (37 
SS,TM,pkinase ESTs, Weakly similar to B34087 hypotheti 

SS,SAA_proteins,ABC_membraneABC_tran, serum amyloid A1 



SS.TM.thiolase, 
ank.ank 

SS.TM.Desaturase.SS 
SS,ig,pkinase,LRRNT,LRRCT, 
SS,TM,linker_histone,7tm_1 
SS,TM,p450, 

SS,T4_deiodinase,T4_deiodinase, 

SS,Y_phosphatase,Band_41 

SS.TIMP, 



centromere protein A (17kD) 
cyclin-dependent kinase inhibitor 2A (me 
gb:stearoyl-CoA desaturase [human, adipo 
hypothetical protein FLJ20093 
H1 histone family, member 0 
lipase, hormone-sensitive 
deiodinase, iodothyronine, type II 
ESTs, Moderately similar to ALU7_HUMAN A 
tissue inhibitor of metalloproteinase 4 



SS,Peptidase_M10,SS,Peptidase_IVI10,hemopematrix metalloproteinase 7 (MMP7; uterin 
SS,cNMP_binding,Rlla,HMG_box protein kinase, cAMP-dependent, regulato 

SS,abhydrolase ESTs, Highly similar to AF1 57833 1 none! 

SS,SH2,PH,SS,TM,PH,SH2,Furin-like,pkinas growth factor receptor-bound protein 7 



SS,Peptidase_C1,gpdh 
p450,SS,p450 

SS,TM,7tm_1, 
SS,Iipase,PLAT,Sec7,PH, 
CKS.SS.CKS, 
TM,7tm_1, 

Glycos_transf_2,Ricin_B_lectin,SS,Glycos 
SS,Pribosyltran,Sulfatase 
PHD.pkinase.SS 
SS,TM,efhand,ion_trans 
SS.TM.Desaturase.SS 
SS,7tm 1.SPRY, 
SS,TM,7tm_1, 

SS,pkinase,vwa,vwa,Glyco_transf_8 
SS,PH,lactamase_B 
SS,p450,p450 

cholesterol 25-hydroxylase 
TM,hemopexin,Peptidase_M10,hemopexin,Pephypothetical protein DKFZp761D112 
TM,ion_trans,K_tetra, gb:QV4-NN0039-290300-1 54-f06 NN0039 Homo 

a disintegrin and metalloproteinase doma 
Acyl_transf,adh_zinc,ketoacyl-synt,pp-bi fatty acid synthase 
IMS.SS polymerase (DNA directed) iota 

SS,TM,SH2,PH,pkinase,Recep_L_domain,Furiv-erb-b2 avian erythroblastic leukemia v 



ESTs 

C5000355:gi|4503225|ref|NP_000765.1 1 cyt 
ELL-RELATED RNA POLYMERASE II, ELONGATIO 
ESTs, Weakly similar to (defline not ava 
lipoprotein lipase 
CDC28 protein kinase 2 
ESTs 

UDP-N-acetyl-alpha-D-galactosamine:polyp 
DNA replication factor 
KIAA0215 gene product 
secreted phosphoprotein 1 (osteopontin, 
stearoyl-CoA desaturase (delta-9-desatur 
ESTs 

neuropeptide Y receptor Y1 
serine/threonine kinase 2 
GPI-anchored metastasis-associated prote 
ESTs, Weakly similar to A36036 cytochrom 



SS 

pkinase,POLO_box,SS,pkinase,POLO_box, 

SS,ig,pkinase,LRRNT,LRRCT, 

SS,UCH-1,UCH-2,SS,TM,G_glu_transpept 

Peptidase_M24, 

DSPc,Myosin_tail, 

SS.adenylatekinase, 

SS,TM,7tm_2,SS,TM,7tm_2 



HER2 receptor tyrosine kinase (c-erb-b2, 
serum-inducible kinase 
hypothetical protein FU20093 
ubiquitin specific protease 18 
histone deacetylase 5 
dual specificity phosphatase 10 
adenylate kinase 5 
parathyroid hormone receptor 2 



4.6 
4.6 
4.6 
4.6 
4.6 
4.5 
4.5 
4.5 
4.5 
4.4 
4.4 
4.4 
4.3 
4.3 
4.3 
4.3 
4.3 
4.2 
4.2 
4.2 
4.1 
4.1 
4.1 
4.1 
4.1 
4.1 
4.1 
4.0 
4.0 
4.0 
3.9 
3.9 
3.9 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
3.6 
3.5 
3.5 
3.5 
3.5 
3.5 
3.5 
3.5 
3.5 
3.5 
3.5 
3.4 
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407846 AA426202 Hs.40403 
406925 L34041 Hs.9739 
445873 AA250970 Hs.251946 
418054 NM_002318Hs.83354 
406815 AA833930 Hs.288036 
410530 M25809 Hs.64173 
407021 U52077 
421168 AF182277 Hs.330780 
431473 AA825686 Hs.321176 
408101 AW968504 Hs.123073 
422083 NM_001 141 Hs.1 11256 
411393 AW797437 Hs.69771 
435767 H73505 Hs.1 17874 
433068 NM_006456Hs.288215 
426928 AF037062 Hs.172914 
414575 H11257 
445941 AI267371 
444542 Al 161 293 
425741 AF052152 
434228 Z42047 
433264 D85782 
400419 AF084545 
439750 AL359053 
417757 R19897 
452194 AI694413 
421458 NM_003654Hs. 104576 
443767 BE562136 Hs.9736 
422648 D86983 
423431 AA326062 
451264 AI768235 
452110 T47667 
439963 AW247529 Hs.6793 
453941 U39817 Hs.36820 
406664 L34041 Hs.9739 
phosphate dehydrogenase 1 (so 
453487 R31770 Hs.23540 
420911 U77413 Hs.1 00293 
443171 BE281128 Hs.9030 
452256 AK000933 Hs.28661 
432201 AI538613 Hs.298241 
419150 T29618 
444443 AI149286 
426283 NM_003937Hs.1 69139 
436291 BE568452 Hs.5101 
450223 AA418204 Hs.241493 
424269 AW137691 Hs.199754 
448105 AW591433 Hs.298241 
452560 BE077084 Hs.336432 



TM,ABC_membrane,ABC_tran,Ribosomal_S4eCbp/p300-interacting transactivator, wit 
SS,TM,transporLprot,SWIB,RhoGAP,DAG_PE-glycerol-3-phosphate dehydrogenase 1 (so 



Hs.22968 

Hs.172636 

Hs.280380 

Hs.159412 

Hs.283978 

Hs.3229 

Hs.57664 

Hs.106604 

Hs.332649 



Hs.1 18893 



Hs.28005 



Hs.89640 
Hs.55099 



SS,rrm,PABP,pkinase,14-3-3,rrm 
SS.TM ,m ito_carr,Lysyl_oxidase 
SS.IPPT, 

ATP-synt_ab,SS,7tm_1 ,ATP-synt_ab 

SS,p450,SS 
SS 

pkinase, 

lipoxygenase.PLAT, 

SS,sushi,trypsin,vwa,rrm,fibrinogen_C,fn 

SS,Peptidase_S8,P,Peptidase_S8,P 

SS.Pribosyltran, 

SS,adh_short,TGF-beta,TGFb_propeptide 

SS.pkinase.ig, 

SS,SS,lipoxygenase,PLAT 

SS,SS,Peptidase_M1,EGF,ig,lectin_c,sushi 

pkinase, 

SS,TM,7tm_1 



poly(A)-binding protein, cytoplasmic 1-1 
lysyl oxidase-like 2 

tRNA isopentenylpyrophosphate transferas 
ATPase, H transporting, lysosomal (vacuo 
gb:Human marinerl transposase gene, comp 
cytochrome P450, subfamily I IB (phenobar 
ESTs, Weakly similar to S65824 reverse t 
CDC2-related protein kinase 7 
arachidonate 15-lipoxygenase, second typ 
B-factor, properdin 
ESTs 



SS,Peptidase_M1, 
TM,integrin_B,Ricin_BJectin,rrm 
death,ZU5,pkinase,Activin_recp, 
SS,TM,7tm_3,ANF_receptor,sushi 
SS 

SS,PCI,RasGEF,hormone_rec,zf-C4, 
peroxidase.LRRCT, 
SS,p450,p450 
SS.Trehalase 

SS,TM,Activin_recp, pkinase 
TM,p450,Ets 

SS,DEAD,HRDC,helicase_C, 
SS,TM,transporLprot,SWIB,RhoGAP,DAG_PE 
3.1 

TM,7tm_1, 



retinol dehydrogenase 5 (1 1-cis and 9-ci 
Homo sapiens clone IMAGE:451939, mRNA se 
ESTs 

aminopeptidase 

Homo sapiens clone 24628 mRNA sequence 
Homo sapiens PR02751 mRNA, complete cds 
cysteine dioxygenase, type I 
Target 

Homo sapiens mRNA full length insert cDN 
ESTs 

olfactory receptor, family 2, subfamily 
carbohydrate (keratan sulfate Gal-6) sul 
proteasome (prosome, macropain) 26S subu 
Melanoma associated gene 
gb:EST29171 Cerebellum II Homo sapiens c 
gb:wg82g08.x1 Soares_NSF_F8_9W_OT_PA_P_S 
Homo sapiens cDNA FU1 1309 fis, clone PL 
platelet-activating factor acetylhydrola 
Bloom syndrome 



SS,TM,7tm_1,rrm,SS 
TM,GDI,7tm_1, 

SS,TM,trypsin,SS,TM,trefoil,tjypsin,tref 
TM,pkinase,fn3, 
SS 

SS.abhydrolase, 
SS.proJsomerase, 
SS,TM,7tm_2,GPS 
SS,TM,trefoi1,trypsin,trefoil 
SS,rrm,zf-RanBP,pkinase,C2,pkinase_C,DAGESTs 



ESTs 

O-linked N-acetylglucosamine (GlcNAc) tr 
TONDU 

Homo sapiens cDNA FLJ 10071 fis, clone HE 
Transmembrane protease, serine 3 
TEK tyrosine kinase, endothelial (venous 
rab6 GTPase activating protein (GAP and 
kynureninase (L-kynurenine hydrolase) 
protein regulator of cytokinesis 1 
natural killer-tumor recognition sequenc 
ESTs 

Transmembrane protease, serine 3 



3.4 
3.4 
3.4 
3.4 
3.4 
3.4 
3.4 
3.4 
3.4 
3.4 
3.3 
3.3 
3.3 
3.3 
3.3 
3.3 
3.3 
3.3 
3.3 
3.3 
3.3 
3.3 
3.3 
3.3 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.1 

glycerol-3- 

3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.0 
3.0 



318 



Attorney Docket No.:018501-005210US 



TABLE 21A 

Table 21 A shows the accession numbers for those pkeys lacking unigenelD's for Table 21. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

10 





Pkey: 


Unique Eos probeset identifier number 




CAT number Gene cluster number 




Accession: 


Genbank accession numbers 




Pkey 


CAT number 


Accessions 




420854 


197072J 


AW296927 AI68451 4 AI2631 68 AA281 079 


J20 

ha.:} 


423431 


228162 1 


AA326062 M325758 AW962182 


423945 


233566 1 


AA410943 AW948953 AA334202 AA332882 


5 


451264 


863988J 


AI768235 R31400 H29082 H23107 




455325 


1279475 1 


AW895719N31451N41451 




456207 


165078_-1 


AA193450 
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TABLE 21B 



Table 21B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 21. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication 

entitled "The DNA sequence of human chromosome 22." Dunham I. et al., Nature (1 999) 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


401045 


8117619 


Plus 


90044-90184,91111-91345 


402230 


9966312 


Minus 


29782-29932 


402408 


9796239 


Minus 


110326-110491 


402578 


9884928 


Plus 


66350-66496 


403593 


6862650 


Minus 


62554-62712,69449-69602 


403943 


7711864 


Plus 


1 00742-100904, 1 01 322-1 01503 


404091 


7684554 


Minus 


82121-83229 
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TABLE 22: 739 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT BREAST 

Table 22 shows 739 genes up-regulated in breast cancer compared to normal adult breast. 
These were selected as for Table 19, except that the ratio was greater than or equal to 3.0, the 
denominator was the 85 th percentile value for 12 non-malignant breast specimens, and the 
96* percentile value amongst the 73 breast cancers was greater than or equal 100 units. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of 90 th percentile tumor to 85 1 " percentile normal breast tissue 



Pkey ExAccn UnigenelD UnigeneTitle R1 

400292 AA250737 Hs.72472 BMP-R1B 51.5 

424735 U31875 Hs.272499 short-chain alcohol dehydrogenase family 38.3 

400297 Al 127076 Hs.334473 hypothetical protein DKFZp56401 278 29.9 

431448 AL137517 Hs.334473 hypothetical protein DKFZp56401 278 26.9 

451110 AI955040 Hs.265398 ESTs, Weakly similar to transformation-r 25.8 

431211 M86849 Hs.323733 gap junction protein, beta 2, 26kD (conn 23.2 

418203 X54942 Hs.83758 CDC28 protein kinase 2 22.6 

407980 AA046309 gb:2f12f01.s1 Soares_fetal_heart_NbHH19W 19.8 

414646 AA353776 Hs.901 CD48 antigen (B-cell membrane protein) 18.9 

446921 AB012113 Hs.16530 small inducible cytokine subfamily A (Cy 18.0 

409041 AB033025 Hs.50081 KIAA1 199 protein 17.6 

412140 AA219691 Hs.73625 RAB6 interacting, kinesin-like (rabkines 17,6 

407824 AA147884 Hs.9812 Homo sapiens cDNA FLJ14388 fis, clone HE 17.1 

453160 AI263307 Hs.239884 H2B histone family, member L 17.0 

407137 T97307 gb:ye53h05.s1 Soares fetal liver spleen 16.1 

425692 D90041 Hs.155956 N-acetyltransferase 1 (arylamine N-acety 16.1 

438533 AI440266 Hs.1 70673 ESTs, Weakly similar to T24832 hypotheti 16.0 

428227 AA321649 Hs.2248 small inducible cytokine subfamily B (CX 15.5 

444342 NM_014398Hs.10887 similar to lysosome-associated membrane 15.1 

422505 AL120862 Hs.124165 programmed cell death 9 (PDCD9) 14.9 

430515 AA746503 Hs.283313 ESTs 14.7 

417308 H60720 Hs.81892 KIAA01 01 gene product 14.4 

452744 AI267652 Hs.30504 Homo sapiens mRNA; cDNA DKFZp434E082 (fr14.4 

412446 AI768015 Hs.92127 ESTs 14.2 

415539 AI733881 Hs.72472 BMP-R1B 14.1 

435496 AW840171 Hs.265398 ESTs, Weakly similar to transformation-r 13.8 

438209 AL1 20659 Hs.6111 aryl-hydrocarbon receptor nuclear transl 13.8 

400205 NA NM_006265*:Homo sapiens RAD21 (S. pombe)13.5 

430965 AA489732 Hs. 15491 8 ESTs 13.4 

415263 AA948033 Hs. 130853 ESTs 13.3 

451952 AL120173 Hs.301663 ESTs 13.2 

449722 BE280074 Hs.23960 cyclinBI 13.2 

406685 M18728 gb:Human nonspecific crossreacting antig 13.0 

406690 M29540 Hs.220529 carcinoembryonic antigen-related cell ad 12.8 

429925 NM_000786Hs.226213 cytochrome P450, 51 (lanosterol 14-alpha 12.8 

416498 U33632 Hs.79351 potassium channel, subfamily K, member 1 12.7 

432378 AI493046 Hs.146133 ESTs 12.5 

441377 BE218239 Hs.202656 ESTs 12.5 

456207 AA1 93450 gb:zr40e07.r1 Soares_NhHMPu_S1 Homo sapil 2.4 

422805 AA436989 Hs.121017 H2A histone family, member A 12.2 

40781 1 AW190902 Hs.40098 cysteine knot superfamily 1 , BMP antagon 12.2 

407178 AA195651 Hs.104106 ESTs 12.2 
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10 



15 



So 

1 fi 



©0 

Ss 

40 
45 

50 
55 
60 
65 



420931 
421727 
434408 
446591 
431385 
443348 
416602 
433365 
437866 
412472 
416030 
439979 
420757 
411598 
423600 
430770 
421037 
452461 
409269 
417791 
447268 
424001 
447342 
424905 
453619 
442942 
434377 
427217 
445730 
432887 
452243 
424590 
432169 
438950 
418836 
430291 
444665 
407377 
445413 
443462 
442145 
435570 
439820 
428966 
449448 
433929 
432731 
411815 
415385 
422026 
432596 
439451 
423945 
442432 
446715 
408771 
437021 
428479 
428839 
402408 
418601 
426327 
419519 
440621 
446142 
418196 
447178 



AF044197 

Y13153 

AI031771 

H44186 

BE178536 



Hs.100431 
Hs.107318 
Hs.1 32586 
Hs.15456 
Hs.11090 



AW873596 Hs.182278 
NM_006159Hs.79389 
AF026944 Hs.293797 
AA156781 Hs.74170 
AW975398 Hs.293836 
H15261 Hs.21948 
AW600291 Hs.6823 
X78592 Hs.99915 
BE336654 Hs.70937 



AI633559 

AA765694 

AI684808 

N78223 

M576953 



Hs.310359 
Hs.123296 
Hs.197653 
Hs.108106 
Hs.22972 



AW965339 Hs.1 11471 
AI370413 Hs.36563 
W67883 Hs.137476 
AI199268 Hs.19322 
NM_002497Hs.153704 
H87648 Hs.33922 
AW167087 Hs.131562 
AW1 37148 Hs.306593 
AA399272 Hs.144341 
AI624342 Hs.1 70042 
AI926047 Hs.1 62859 
AL355715 Hs.28555 
AW966399 Hs.46821 
Y00971 Hs.2910 
H23789 Hs.144530 
AI655499 Hs.161712 



AV660345 
BE613126 
C16391 
AA151342 
AI064690 
AI022650 
AF212222 
AL360204 
AF059214 
D60730 
AI375499 
R31178 
AA156679 
R17798 
U80736 
AJ224741 
AF086270 
AA410943 
BE093589 
AI337735 



Hs.238126 
Hs.47783 

Hs.12677 
Hs.171176 
Hs.8117 
Hs.1 7781 2 
Hs.283853 
Hs.194687 
Hs.57471 
Hs.27379 
Hs.287820 
Hs.1 25790 
Hs.7535 
Hs.1 10826 
Hs.278461 
Hs.278554 

Hs.38178 
Hs.1 7391 9 
AW732573 Hs.47584 
AI076089 Hs.292239 



small inducible cytokine B subfamily {Cy 12.1 

kynurenine 3-monooxygenase (kynurenine 3 12.0 

ESTs 12.0 

PDZ domain containing 1 1 1 .9 

membrane-spanning 4-domains, subfamily A 1 1 .8 

calmodulin 2 (phosphorylase kinase, delt 1 1 .7 

nel (chicken)-like 2 11.7 

ESTs 11.6 

metallothionein 1E (functional) 11.5 

ESTs 11.4 

ESTs 11.3 

hypothetical protein FU 1 0430 1 1 .3 

androgen receptor (dihydrotestosterone r 1 1 .3 

H3 histone family, member A 11.2 

ESTs 11.2 

ESTs 11.0 

programmed cell death 9 (PDCD9) 10.9 

transcription factor 10.7 

hypothetical protein FLJ 1 3352 1 0.6 

ESTs 10.6 

hypothetical protein FLJ22418 10.4 

paternally expressed 10 10.4 

Homo sapiens, Similar to RIKEN cDNA 2010 10.4 

NIMA (never in mitosis gene a)-related k 10.1 
Homo sapiens, clone MGC:9084, mRNA, comp10.1 

ESTs 10.1 

Homo sapiens cDNA FU1 1382 fis, clone HE 10.1 

ESTs 10.1 

ESTs 10.0 

ESTs 10.0 

programmed cell death 9 9.9 

hypothetical protein FLJ20086 9.9 

phosphoribosyl pyrophosphate synthetase 9.9 

EST 9.9 

ESTs 9.8 

CGI-49 protein 9.8 

B aggressive lymphoma gene 9.7 



gb:C16391 Clontech human aorta polyA mRN 9.7 
CGI-147 protein 9.7 
ESTs 9.7 
erbb2-interacBng protein ERBIN 9.7 
uncharacterized bone marrow protein BM04 9.7 
Homo sapiens mRNA full length insert cDN 9.6 
cholesterol 25-hydroxylase 9.6 
ESTs 9.6 
ESTs 9.5 
fibranect'n 1 9.3 
leucine-rich repeat-containing 2 9.3 
COBW-like protein 9.3 
trinucleotide repeat containing 9 9.2 
matrilin 3 9.2 
heterochromatin-like protein 1 9.2 
gb:zt32h03.rl Soares ovary tumor NbHOT H 9.1 
hypothetical protein FLJ23468 9.1 
ESTs, Moderately similar to ZN91_HUMAN Z 9.0 
potassium voltage-gated channel, delayed 9.0 
ESTs 9.0 
Y00272 Hs.184572 cell division cycle 2, G1 to Sand G2 to 8.9 
AI767756 Hs.82302 Homo sapiens CDNAFLJ14814 fis, clone NT 8.9 
NA NM_030920*:Homo sapiens hypothetical pro 8.9 

AA279490 Hs.86368 calmegin 8.8 
W03242 Hs.44898 Homo sapiens clone TCCCTA001 51 mRNA sequ8.f 
AI198719 Hs.1 76376 ESTs 8.8 
AW296024 Hs.1 50434 ESTs 8.8 
AI754693 Hs.145968 ESTs 8.8 
AI745649 Hs.26549 KIAA1 708 protein 8.7 
AW594641 Hs.1 92417 ESTs 8.7 
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427585 D31 152 Hs.179729 collagen, type X, alpha 1 (Schmid metaph 8.6 

415857 AA866115 Hs.127797 Homo sapiens cDNA FLJ11381 fis, clone HE 8.6 

435061 AI651474 Hs.163944 ESTs 8.6 

431374 BE258532 Hs.251871 CTP synthase 8.4 

5 417866 AW067903 Hs.82772 collagen, type XI, alpha 1 8.4 

437211 AA382207 Hs.5509 ecotropic viral integration site 2B 8.3 

437751 AA767373 Hs.35669 ESTs, Moderately similar to ALU1_HUMAN A 8.3 

423887 AL080207 Hs.134585 DKF2P434G232 protein 8.2 

440941 BE268362 Hs.7535 COBW-like protein 8.2 

10 429859 NM_007050Hs.225952 protein tyrosine phosphatase, receptor t 8.2 

410193 AJ132592 Hs.59757 zinc finger protein 281 8.2 

431725 X65724 Hs.2839 Norrie disease (pseudoglioma) 8.1 

446258 AI283476 Hs.263478 ESTs 8.1 

416747 AW876523 Hs.15929 hypothetical protein FLJ12910 8.1 

15 434424 AI811202 Hs.325335 Homo sapiens cDNA: FU23523 fis, clone L 8.1 

421650 AA781795 Hs.122587 ESTs 8.0 

s = 429534 AW976987 Hs.163327 ESTs, Weakly similarto2109260AB cell 8.0 

457465 AW301 344 Hs.1 22908 DNA replication factor 8.0 

Q 427961 AW293165 Hs.143134 ESTs 8.0 

f£0 436481 AA379597 Hs.5199 HSPC1 50 protein similar to ubiquitin-con 8.0 

7Z\ 418216 AA662240 Hs.283099 AF15q14 protein 8.0 

V 1 418250 U29926 Hs.83918 adenosine monophosphate deaminase (isofo 7.9 

ffii 400285 NA Eos Control 7.9 

s ? ; 401464 AF039241 Hs.9028 histone deacetylase 5 7.9 

:25 407242 M18728 gb:Human nonspecific crossreacting antig 7.8 

Ni 422232 D43945 Hs. 113274 transcription factor EC 7.8 

f": 454024 AA993527 Hs.293907 hypothetical protein FLJ23403 7.8 

444542 AI161293 Hs.280380 aminopeptidase 7.8 

436396 AI683487 Hs.152213 wingless-type MMTV integration site fami 7.7 

GO 437204 AL110216 Hs.12285 ESTs, Weakly similar to 15521 4 salivary 7.6 

y ; 408805 H69912 Hs.48269 vaccinia related kinase 1 7.6 

L: 437207 T27503 Hs.15929 hypothetical protein FLJ12910 7.6 

'i! 442818 AK001741 Hs.8739 hypothetical protein FLJ 10879 7.6 

4=. ; 426283 NM_003937Hs.169139 kynureninase (L-kynurenine hydrolase) 7.5 

r35 424687 J05070 Hs.151738 matrix metalloproteinase 9 (gelatinase B 7.5 

_! 446315 NM_016293Hs. 14770 bridging integrator 2 7.5 

W 433426 H69125 Hs.133525 ESTs 7.5 

406639 M97711 gb:Human T-cell receptor (V beta 18.1, J 7.5 

420077 AW512260 Hs.87767 ESTs 7.4 

40 457332 AA961694 Hs. 1051 87 kinesin protein 9 gene 7.4 

422938 NM_001809Hs.1594 centromere protein A (1 7kD) 7.4 
447555 AI391662 Hs.160963 Homo sapiens, clone MGC:12318, mRNA, com7.4 
444618 AV653785 Hs.173334 ELL-RELATED RNA POLYMERASE II, ELONGATI07.3 

410361 BE391804 Hs.62661 guanylate binding protein 1, interferon- 7.3 
45 400268 NA NM_003292:Homo sapiens translocated prom 7.3 

439509 AF086332 Hs.58314 ESTs 7.3 

407771 AL138272 Hs.62713 ESTs 7.3 

407202 N58172 Hs.1 09370 ESTs 7.3 

433096 AU076803 Hs.282975 carboxylesterase 2 (intestine, liver) 7.2 

50 422094 AF129535 Hs.272027 F-box only protein 5 7.1 

430832 AI073913 Hs.100686 ESTs, Weakly similar to JE0350 Anterior 7.1 
430287 AW182459 Hs.125759 ESTs, Weakly similar to LEU5_HUMAN LEUKE7.0 

423739 AA398155 Hs.97600 ESTs 7.0 
448212 AI475858 gb:tc87d07.x1 NCI_CGAP_CLL1 Homo sapiens7.0 

55 407277 AW170035 Hs.326736 Homo sapiens breast cancer antigen NY-BR 7.0 

454440 BE062906 Hs.28338 KIAA1 546 protein 7.0 

444783 AK001468 Hs.62180 anillin (Drosophila Scraps homolog), act 7.0 

421373 AA808229 Hs.167771 ESTs 6.9 

431960 AW241821 Hs.301927 c6.1A 6.9 

60 424704 AI263293 Hs.152096 cytochrome P450, subfamily IIJ (arachido 6.8 

449517 AW500106 Hs.23643 serine/threonine protein kinase MASK 6.8 

439840 AW449211 Hs.1 05445 GDNF family receptor alpha 1 6.8 

414080 AA135257 Hs.47783 B aggressive lymphoma gene 6.8 

441243 AI767056 Hs.193002 ESTs 6.7 

65 408380 AF123050 Hs.44532 diubiquitin 6.7 

422956 BE545072 Hs.122579 hypothetical protein FLJ10461 6.7 

446651 AA393907 Hs.97179 ESTs 6.7 
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419839 U24577 Hs.93304 phospholipase A2, group VII (platelet-ac 6.7 

437740 AA810265 Hs.122915 ESTs 6.7 

421582 AI910275 Hs.1406 trefoil factor 1 (pS2) 6.7 

427356 AW023482 Hs.97849 ESTs 6.6 

429597 NM_003816Hs.2442 a disintegrin and metalloproteinase doma 6.6 

422634 NM_016010Hs.1 18821 CGI-62 protein 6.6 

421072 AI215069 Hs.89113 ESTs 6.5 

427718 AI798680 Hs.25933 ESTs 6.5 

411000 N40449 Hs.201619 ESTs, Weakly similar to S38383SEB4B pro 6.5 

449343 AI151418 Hs.272458 protein phosphatase 3 (formerly 2B), cat 6.4 

409757 NM_001898Hs.123114 cystatinSN 6.4 

447164 AF026941 Hs.17518 Homo sapiens cig5 mRNA, partial sequence 6.4 

456938 X52509 Hs.161640 tyrosine aminotransferase 6.4 

418848 AI820961 Hs. 193465 ESTs 6.4 

424902 NM_003866Hs.153687 inositol polyphosphate-4-phosphatase, ty 6.4 

452838 U65011 Hs.30743 preferentially expressed antigen in mela 6.4 

439452 AA918317 Hs.57987 B-cell CLL/lymphoma 11B (zinc finger pro 6.4 

407266 AJ235664 gb:Homo sapiens mRNA for immunoglobulin 6.3 

411078 AI222020 Hs. 182364 CocoaCrisp 6.3 

433001 AF217513 Hs.279905 clone HQ0310 PRO0310p1 6.3 

434340 AI193043 Hs.128685 ESTs, Weakly similar to T1 7226 hypotheti 6.2 

429503 M394183 Hs.26873 ESTs 6.2 

402578 C1001134:gi|2117372|pir||l65981 fatty ac 6.2 

409646 AW161391 Hs.709 deoxycytidine kinase 6.1 

430447 W17064 Hs.332848 SWI/SNF related, matrix associated, acti 6.1 

432415 T16971 Hs.289014 ESTs, Weakly similar to A43932 mucin 2 p 6.1 

443709 AI082692 Hs.134662 ESTs 6.1 

420929 AI694143 Hs.296251 programmed cell death 4 6.1 

428368 BE440042 Hs.83326 matrix metalloproteinase 3 (stromelysin 6.1 

428248 AI126772 Hs.40479 ESTs 6.0 

420344 BE463721 Hs.97101 putative G protein-coupled receptor 6.0 

453392 U23752 Hs.32964 SRY (sex determining region Y)-box 11 6.0 

425397 J04088 Hs.156346 topoisomerase (DNA) II alpha (170kD) 6.0 

418007 M1 3509 Hs.83169 matrix metalbproteinase 1 (MMP1; inters 6.0 

428585 AB007863 Hs.1 85140 KIAA0403 protein 6.0 

437608 M761605 Hs.292308 ESTs, Weakly similartoALU1_HUMAN ALU S6.0 

427408 AA583206 Hs.2156 RAR-related orphan receptor A 6.0 

406687 M31 126 Hs.272620 matrix metalloproteinase 11 (MMP11;stro 6.0 

418092 R45154 Hs.106604 ESTs 6.0 

447051 AW139130 Hs.160951 ESTs, Weakly similartoConl [H.sapiens] 6.0 

441233 AA972965 Hs.135568 ESTs 6.0 

432239 X81334 Hs.2936 matrix metalloproteinase 13 (collagenase 6.0 

435106 AA100847 Hs.193380 ESTs, Highly similar to AF1 74600 1 F-box 5.9 

435525 AI831297 Hs.123310 ESTs 5.9 

458809 AW972512 Hs.20985 sin3-associated polypeptide, 30kD 5.9 
410785 AW803341 gb:IL2-UM0079-090300-050-D03 UM0079 Homo5.9 

422576 BE548555 Hs.1 18554 CGI-83 protein 5.9 

451398 AI793124 Hs.144479 ESTs 5.9 

441881 AW968904 Hs.179566 hypothetical protein FU22624 5.8 

412022 AI005043 Hs.24143 Wiskott-Aldrich syndrome protein interac 5.8 

416636 N32536 Hs.42645 solute carrier family 16 (monocarboxylic 5.8 

447350 AI375572 Hs.172634 ESTs 5.8 

434094 M305599 Hs.238205 hypothetical protein PRO2013 5.8 

409151 AA306105 Hs.50785 SEC22, vesicle trafficking protein (S. c 5.8 

448807 AI571940 Hs.7549 ESTs 5.8 

452281 T93500 Hs.28792 Homo sapiens cDNA FU1 1041 fis, clone PL 5.8 

421281 AI299139 Hs.17517 ESTs 5.8 

430361 AI033965 Hs.239926 sterol-C4-methyl oxidase-like 5.8 

400289 X07820 Hs.2258 matrix metalloproteinase 10 (MMP10; str 5.7 

440527 AV657117 Hs.184164 ESTs, Moderately similar to S65657 alpha 5.7 

434674 AA831879 Hs.136985 ESTs 5.7 

426320 W47595 Hs.169300 transforming growth factor, beta 2 5.7 

452401 NM_007115Hs.29352 tumor necrosis factor, alpha-induced pro 5.7 

448663 BE614599 Hs.106823 hypothetical protein MGC14797 5.7 

438199 AW016531 Hs.122147 ESTs 5.7 

446203 Z47553 Hs.14286 flavin containing monooxygenase 5 5.7 

428336 AA503115 Hs.183752 microseminoprotein, beta- 5.6 
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430379 AF134149 Hs.240395 potassium channel, subfamily K, member 6 5.6 

422835 BE218705 Hs.121378 metallothionein-like 5, testis-specific 5.6 

444758 AL044878 Hs.11899 3-hydroxy-3-methykjlutaryl-CoenzymeA re 5.6 

443426 AF098158 Hs.9329 chromosome 20 open reading frame 1 5.6 

400301 X03635 Hs.1657 estrogen receptor 1 5.6 

447078 AW885727 Hs.301570 ESTs 5.6 

432015 AL157504 Hs.159115 Homo sapiens mRNA; cDNA DKFZp586O0724 (f5.5 

438691 AA906288 Hs.212184 ESTs 5.5 

439809 R41396 Hs.101774 hypothetical protein FLJ23045 5.5 

415786 AW419196 Hs.257924 hypothetical protein FU 13782 5.5 

456373 BE247706 Hs.89751 membrane-spanning 4-domains, subfamily A 5.5 

401645 NA C16001440*:gi|12330704|gb|AAG52890.1|AF35.5 

437967 BE277414 Hs.5947 mel transforming oncogene (derived from 5.5 

445885 AI734009 Hs.127699 KIAA1 603 protein 5.4 

439138 AI742605 Hs.193696 ESTs 5.4 

440270 NM_015986Hs.7120 cytokine receptor-like molecule 9 5.4 

437536 X91221 Hs.144465 ESTs 5.4 

438167 R28363 Hs.24286 ESTs 5.4 

452741 BE392914 Hs.30503 Homo sapiens cDNAFLJ1 1344 fis, clone PL 5.4 

426214 H59846 Hs.128355 ESTs, Moderately similar to ALU7_HUMAN A 5.4 

413554 AA319146 Hs.75426 secretogranin II (chromogranin C) 5.4 

422867 L32137 Hs.1584 cartilage oligomeric matrix protein (COM 5.4 

434263 N34895 Hs.44648 ESTs 5.4 

446382 AW205168 Hs.150823 ESTs 5.4 

422406 AF025441 Hs.1 16206 Opa-interacting protein 5 5.3 

438321 AA576635 Hs.6153 CGI-48 protein 5.3 

418310 AA814100 Hs.86693 ESTs 5.3 

419625 U91616 Hs.91640 nudear factor of kappa light polypeptid 5.3 

450701 H39960 Hs.288467 Homo sapiens cDNA FLJ 12280 fis, clone MA 5.3 

445900 AF070526 Hs.13429 Homo sapiens clone 24787 mRNA sequence 5.2 

449051 AW961 400 Hs.333526 HER2 receptor tyrosine kinase (c-erb-b2, 5.2 

418758 AW959311 Hs.172012 hypothetical protein DKFZp434J037 5.2 

431070 AW408164 Hs.249184 transcription factor 19 (SC1) 5.2 

417079 U65590 Hs.81134 interleukin 1 receptor antagonist 5.2 

421928 AF013758 Hs.109643 polyadenylate binding protein-interactin 5.2 

428804 AK000713 Hs.193736 hypothetical protein FU20706 5.2 

427427 AF077345 Hs.177936 ESTs 5.2 

403485 C3001813*:gi|12737279|ref|XP_012163.1|k 5.2 

422168 AA586894 Hs.1 12408 S100 calcium-binding protein A7 (psorias 5.1 

421937 AI878857 Hs.109706 hematological and neurological expressed 5.1 

426752 X69490 Hs.172004 titJn 5.1 

453310 X70697 Hs.553 solute carrier family 6 (neurotransmitte 5.1 

423198 M81933 Hs.1634 cell division cycle 25A 5.1 

412281 AI810054 Hs.14119 ESTs 5.1 

447513 AW955776 Hs.313500 ESTs, Moderately similar to ALU7_HUMAN A 5.1 

453931 AL121278 Hs.25144 ESTs 5.1 

404347 Target Exon 5.1 

431808 M30703 Hs.270833 amphiregulin (schwannoma-derived growth 5.1 

429113 D28235 Hs.196384 prostaglandin-endoperoxide synthase 2 (p 5.1 

436291 BE568452 Hs.5101 protein regulator of cytokinesis 1 5.1 

450603 R43646 Hs.12422 ESTs 5.1 

434725 AK000796 Hs.4104 hypothetical protein 5.0 

435981 H74319 Hs.1 88620 ESTs 5.0 

407376 AA993138 Hs.142287 ESTs, Weakly similar to ALUF_HUMAN !!!! 5.0 

431689 AA305688 Hs.267695 UDP-Gal:betaGlcNAc beta 1,3-galactosyltr 5.0 

405348 NA C7001664:gi|12698061|dbj|BAB21849.1| (AB 5.0 

436196 AK001084 Hs.333498 Homo sapiens cDNA FLJ10222 fis, clone HE 5.0 

437065 AL036450 Hs.1 03238 ESTs 5.0 

410196 AI936442 Hs.59838 hypothetical protein FLJ10808 5.0 

429412 NM_006235Hs.2407 POU domain, class 2, associating factor 5.0 

446619 AU076643 Hs.313 secreted phosphoprotein 1 (osteopontin, 4.9 

403329 NA TaigetExon 4.9 

442875 BE623003 Hs.23625 Homo sapiens clone TCCCTA001 42 mRNA sequ4.9 

442441 AI820662 Hs.1 29598 ESTs 4.9 

430375 AW371048 Hs.93758 H4 histone family, member H 4.9 

424128 AW966163 gb:EST378236 MAGE resequences, MAGI Homo4.9 

408873 AL046017 Hs.182278 calmodulin 2 (phosphorylase kinase, delt 4.9 
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407910 AA650274 Hs.41296 fibronectin leucine rich transmembrane p 4.9 

432606 NM_002104Hs.3066 granzyme K (serine protease, granzyme 3; 4.9 

453204 R10799 Hs.191990 ESTs 4.8 
452020 AA722012 Hs.255757 ESTs, Weakly similar to AT2A.HUMAN POTEN4 .8 

449048 Z45051 Hs.22920 similar to S68401 (cattle) glucose indue 4.8 

408369 R38438 Hs.182575 solute carrier family 15 (H??? transport 4.8 

431645 AF078849 Hs.266483 dynein light chain-A 4.8 

423575 C18863 Hs.163443 Homo sapiens cDNAFLJ11576fis, clone HE 4.8 

444246 H93281 Hs.10710 hypothetical protein FLJ20417 4.8 

421524 AA312082 Hs.105445 GDNF family receptor alpha 1 4.8 

452827 AI571835 Hs.55468 ESTs 4.8 

414222 AL135173 Hs.878 sorbitol dehydrogenase 4.8 

456086 AL161999 Hs.77324 eukaryotic translation termination facto 4.8 

419078 M93119 Hs.89584 insulinoma-associated 1 4.8 

418973 AA233056 Hs.191518 ESTs 4.8 

447033 AI357412 Hs.157601 ESTs 4.8 

451621 AI879148 Hs.26770 fatty acid binding protein 7, brain 4.7 

419968 X04430 Hs.93913 interieukin 6 (interferon, beta 2) 4.7 

424326 NM_014479Hs.145296 disintegrin protease 4.7 

431585 BE242803 Hs.262823 hypothetical protein FLJ 10326 4.7 

429294 AA095971 Hs.198793 Homo sapiens cDNA: FLJ22463 fis, clone H 4.7 

416814 AW192307 Hs.80042 dolichyl-P-Glc:Man9GlcNAc2-PP-dolichylgl 4.7 

439897 NM_015310Hs.6763 KIAA0942 protein 4.7 

429687 AI675749 Hs.211608 nucleoporin 153kD 4.7 

422880 AF228704 Hs.121524 glutathione reductase 4.7 
405801 NM_000390:Homo sapiens choroideremia (Ra4.6 

432435 BE218886 Hs.282070 ESTs 4.6 

439544 W26354 Hs.28891 hypothetical protein FLJ 11 360; artemis p 4.6 

425354 U62027 Hs.155935 complement component 3a receptor 1 4.6 

436027 AI864053 Hs.39972 ESTs, Weakly similar to I38588 reverse t 4.6 

424623 AW963062 Hs.337404 ESTs 4.6 

403366 NA Target Exon 4.6 

402542 Target Exon 4.6 

450193 AI916071 Hs.15607 Homo sapiens Fanconi anemia complementat 4.6 

411678 AI907114 Hs.71465 squalene epoxidase 4.6 

456844 AI264155 Hs.152981 CDP-diacylglycerol synthase (phosphatida 4.6 

448072 AI459306 Hs.24908 ESTs 4.5 

408045 AW138959 Hs.245123 ESTs 4.5 

423782 AI472209 Hs.323117 ESTs 4.5 
447388 AW630534 Hs.76277 Homo sapiens, clone MGC:9381 , mRNA, comp4.5 

448140 AF146761 Hs.20450 BCM-like membrane protein precursor 4.5 

452561 AI692181 Hs.49169 KIAA1 634 protein 4.5 
425331 AW962128 gb:EST374201 MAGE resequences, MAGG Homo4.5 

428801 AW277121 Hs.254881 ESTs 4.5 

428500 AI815395 Hs.184641 fatty acid desaturase 2 4.5 

426075 AW513691 Hs.270149 ESTs, Weakly similar to 21 09260AB cell 4.4 

437259 AI377755 Hs. 120695 ESTs 4.4 

400409 AF153341 Hs.283954 Homo sapiens winged helix/forkhead trans 4.4 

412863 AA121673 Hs.59757 zinc finger protein 281 4.4 

426989 AI815206 Hs.99395 ESTs 4.4 

401866 Target Exon 4.4 

418819 AA228776 Hs.191721 ESTs 4.4 

406348 Target Exon 4.4 

412138 AW895387 gb:QV4-NN0038-300300-157-c10NN0038Homo4.4 

428550 AW297880 Hs.98661 ESTs 4.4 
411743 AW862214 gb:QV4-CT0361-301299-074-b05CT0361 Homo4.4 

429966 BE081342 Hs.283037 HSPC039 protein 4.4 

423291 NM_004129Hs.126590 guanylate cyclase 1, soluble, beta 2 4.4 

423456 AL110151 Hs.128797 DKFZP586D0824 protein 4.4 
452190 H26735 Hs.91668 Homo sapiens clone PP1498 unknown mRNA4.4 

424871 NM_004525Hs.153595 low density lipoprotein-related protein 4.3 

429575 AA706003 Hs.99387 ESTs 4.3 

429922 Z97630 Hs.2261 17 H1 histone family, member 0 4.3 

421379 Y15221 Hs.103982 small inducible cytokine subfamily B (Cy 4.3 

400300 X03363 HER2 receptor tyrosine kinase (c-erb-b2, 4.3 

437258 AL041243 Hs.174104 ESTs 4.3 

446595 T57448 Hs.15467 hypothetical protein FLJ20725 4.3 
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403011 NA ENSP00000215330*:Probableserine/threoni 4.3 

419055 AI365384 Hs.11571 Homo sapiens cDNAFLJ1 1570 fis, clone HE 4.3 

418661 NM_001949Hs.1189 E2F transcription factor 3 4.3 

407786 AA687538 Hs.38972 tetraspan 1 4.3 

5 429183 AB014604 Hs. 197955 KIAA0704 protein 4.3 

442914 AW188551 Hs.99519 hypothetical protein FLJ14007 4.3 

441029 AI091795 Hs.179246 ESTs 4.3 

452194 AI694413 Hs.332649 olfactory receptor, family 2, subfamily 4.3 

414821 M63835 Hs.77424 Fc fragment of IgG, high affinity la, re 4.2 

10 410102 AW248508 Hs.279727 Homo sapiens cDNA FLJ14035 fis, done HE 4.2 

452110 T47667 Hs.28005 Homo sapiens cDNAFLH 1309 fis, clone PL 4.2 

442007 AA301116 Hs.142838 nucleolar phosphoprotein Nopp34 4.2 

417318 AW953937 Hs.12891 ESTs 4.2 

431 81 8 AW510444 Hs. 1 91 705 ESTs, Weakly similar to T471 84 hypotheti 4.2 

15 443646 AI085198 Hs.164226 ESTs 4.2 

419169 AW851980 Hs.262346 ESTs, Weakly similar to S72482 hypotheti 4.2 

446839 BE091926 Hs. 16244 mitotic spindle coiled-coil related prat 4.2 

H' 423242 AL039402 Hs.125783 DEME-6 protein 4.2 

==; 432116 AA902953 Hs.308538 ESTs 4.2 

;.20 409038 T97490 Hs.50002 small inducible cytokine subfamily A (Cy 4.2 

445625 BE246743 Hs.288529 hypothetical protein FLJ22635 4.2 

111 425139 AW630488 Hs.325820 protease, serine, 23 4.2 

ff-, 447397 BE247676 Hs.18442 E-1 enzyme 4.2 

!?: 410166 AK001376 Hs.59346 hypothetical protein FLJ10514 4.1 

IU5 437295 AW779318 Hs.88417 ESTs 4.1 

%..\ 430486 BE062109 Hs.241551 chloride channel, calcium activated, fam 4.1 

==: 441790 AW294909 Hs.132208 ESTs 4.1 

410129 BE244074 Hs.58831 regulator of Fas-induced apoptosis 4.1 

s 427521 AW973352 Hs.290585 ESTs 4.1 

f30 425247 NM_005940Hs.1 55324 matrix metalloproteinase 11 (MMP11; stro 4.1 

='"' 412886 AF041163 Hs.74647 Human T-cell receptor active alpha-chain 4.1 

^ 441153 BE562826 gb:601336534F1 NIH_MGC_44 Homo sapiens c4.1 

pj 444301 AK000136 Hs.10760 asporin (LRR class 1) 4.1 

426711 AA383471 Hs.1 80669 conserved gene amplified in osteosarcoma 4.1 

235 405850 NA Target Exon 4.1 

O 440283 AI732892 Hs.1 90489 ESTs 4.0 

fj i 432441 AW292425 Hs.1 63484 ESTs 4.0 

400284 NA estrogen receptor 1 4.0 

417341 N91453 Hs.1 02987 ESTs 4.0 

40 429732 U20158 Hs.2488 lymphocyte cytosolic protein 2 (SH2 doma 4.0 

411393 AW797437 Hs.69771 B-factor, properdin 4.0 

425704 U79293 Hs.1 59264 Human clone 23948 mRNA sequence 4.0 

419594 AA013051 Hs.91417 topoisomerase (DNA) II binding protein 4.0 

419092 J05581 Hs.89603 mucin 1, transmembrane 4.0 

45 443147 AI034351 Hs.19030 ESTs 4.0 

408633 AW963372 Hs.46677 PRO2000 protein 4.0 

433404 T32982 Hs.1 02720 ESTs 4.0 

421506 BE302796 Hs.105097 thymidine kinase 1 , soluble 4.0 

417900 BE250127 Hs.82906 CDC20 (cell division cycle 20, S. cerevi 3.9 

50 414602 AW630088 Hs.76550 Homo sapiens mRNA; cDNA DKFZp564B1264 (f3.9 

413762 AW411479 Hs.848 FK506-binding protein 4 (59kD) 3.9 

404580 NM_014112*:Homosapienstrichorhinophala 3.9 

452046 AB018345 Hs.27657 KIAA0802 protein 3.9 

459587 AA031956 gb:zk15e04.s1 Soares_pregnant_uterus_NbH 3.9 

55 416658 U03272 Hs.79432 fibrillin 2 (congenital contractu ral ara 3.9 

426647 AA243464 Hs.294101 pre-B-cell leukemia transcription factor 3.9 

429353 AL117406 Hs.200102 ATP-binding cassette transporter MRP8 3.9 

419038 AW134924 Hs.190325 ESTs 3.9 

418918 X07871 Hs.89476 CD2 antigen (p50), sheep red blood cell 3.9 

60 421977 W94197 Hs.110165 ribosomal protein L26 homolog 3.9 

442567 AI201183 Hs.1 30251 ESTs 3.9 

421168 AF182277 Hs.330780 cytochrome P450, subfamily IIB (phenobar 3.9 

431701 AW935490 Hs.14658 Human chromosome 5q13.1 clone 5G8mRNA3.9 

418526 BE019020 Hs.85838 solute carrier family 16 (monocarboxylic 3.9 

65 414998 NM_002543Hs.77729 oxidised low density lipoprotein (lectin 3.9 

422790 AA809875 Hs.25933 ESTs 3.9 

419741 NM_007019Hs.93002 ubiquitin carrier protein E2-C 3.9 
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430017 


AA263172 Hs.35 protein tyrosine phosphatase, non-recept 


3.9 




458814 


AI498957 Hs.170861 ESTs, Weakly similar to Z1 95 HUMAN ZINC 


3.8 




428514 


AW236861 Hs. 1931 39 ESTs 


3.8 




434521 


NM_002267Hs.3886 karyopherin alpha 3 (importin alpha 4) 


3.8 


5 


409425 


U40462 Hs.54452 zinc finger protein, subfamily 1 A, 1 (Ik 


3.8 




439560 


BE565647 Hs.74899 hypothetical protein FLJ12820 


3.8 




424028 


AF055084 Hs.153692 Homo sapiens cDNAFLJ14354fis, clone Y7 


3.8 




400021 


AFFX control - HUMISGF3A/M97935_MA 


3.8 


10 


453403 


BE466639 Hs.61 779 Homo sapiens cDNA FLJ 1 3591 fis, clone PL 


3.8 


445941 


AI267371 Hs.172636 ESTs 


3.8 




434378 


AA631739 Hs.335440 EST 


3.8 




429220 


AW207206 Hs.136319 ESTs 


3.8 




439176 


AI446444 Hs.1 90394 ESTs, Weakly similar to B28096 line-1 pr 


3.8 


15 


401045 


C1 1 001 883*:gi|6753278|reflNP_033938.1 1 c 


3.8 


430178 


AW449612 Hs.152475 ESTs 


3.8 




423397 


NM_001838Hs.1 652 chemokine (C-C motif) receptor 7 


3.8 




447630 


AI660149 Hs.44865 lymphoid enhancer-binding factor 1 


3.8 




436391 


AJ227892 Hs.1 46274 ESTs 


3.8 




413011 


AW068115 Hs.821 biglycan 


3.8 


[20 


422121 


AI767949 Hs.1 79833 ESTs 


3.8 


s 5 K '5 


452268 


NM_00351 2HS.28777 H2A histone family, member L 


3.8 


; 

Ki 


427811 


M81057 Hs.1 80884 carboxypeptidase B1 (tissue) 


3.8 


415579 


AA165232 Hs.222069 ESTs 


3.8 


== :■ 


437330 


AL353944 Hs.50115 Homo sapiens mRNA;cDNADKFZp761J11 12 (f 3.8 




427122 


AW057736 Hs.323910 HER2 receptor tyrosine kinase (oerb-b2, 


3.7 




400286 


NA C16000922:gi|7499103|pir||T20903 hypothe 


3.7 




420281 


AI623693 Hs.191533 ESTs 


3.7 


£ 


419926 


AW900992 Hs.93796 DKFZP586D2223 protein 


3.7 




417541 


AI992191 Hs.180040 hypothetical protein FLJ22439 


3.7 


Hpo 


426172 


AA371307 Hs.125056 ESTs 


3.7 


r~ 




AI916662 Hs.211577 kinectin 1 (kinesin receptor) 


3.7 




457001 


J03258 Hs.2062 vitamin D (1 ,25- dihydroxyvitamin D3) re 


3.7 


11= i 


424109 


AW406878 gb:UI-HF-BL0-adg-g-06-0-Ul.r1 NIH_MGC_373.7 


„jr"* 


417022 


NM_014737Hs.80905 Ras association (RalGDS/AF-6) domain fam 


3.7 


f35 


436222 


AI208737 Hs.122810 Homo sapiens cDNA FLJ11489 fis, clone HE 3.7 




430448 


AI633553 Hs.13303 Homo sapiens cDNA: FLJ21 784 fis, clone H 


3.7 


s 7...; 


432729 


AK000292 Hs.278732 hypothetical protein FLJ20285 


3.7 




413916 


N49813 Hs.75615 apolipoprotein C-ll 


3.7 


40 


421662 


NM_014141Hs.106552 cell recognition molecule Caspr2 


3.7 


441633 


AW958544 Hs.1 12242 normal mucosa of esophagus specific 1 


3.7 




408761 


AA057264 Hs.238936 ESTs, Weakly similar to (defline not ava 


3.7 




*tUU 1 <JO 


Target Exon 


3.7 






AW873606 Hs.149006 ESTs 


3.7 


45 




AI88491 1 Hs.32989 receptor (calcitonin) activity modifying 


3.7 


448918 


AB011152 Hs.22572 KIAA0580 protein 


3.7 






AF113676 Hs.297681 serine (or cysteine) proteinase inhibito 


3.6 




448069 


U76248 Hs.20191 seven in absentia (Drosophila) homolog 2 


3.6 




453313 


BE005771 Hs.153746 hypothetical protein FLJ22490 


3.6 


50 


425234 


AW152225 Hs.165909 ESTs, Weakly similar to I38022 hypothetj 


3.6 


41QQ41 

t 1 


X98654 Hs.93837 phosphatidylinositol transfer protein, m 


3.6 






AF188625 Hs.189507 phospholipaseA2, group IID 


3.6 






Z29572 Hs.2556 tumor necrosis factor receptor superfami 


3.6 




448106 


AI800470 Hs.171941 ESTs 


3.6 


55 


426431 


NM 000458HS.169853 transcription factor 2, hepatic; LF-B3; 


3.6 


431843 


AA516420 Hs.183526 ESTs, Weakly similar to 138022 hypothetj 


3.6 




426878 


BE069341 gb:QV3-BT0381-270100-073-c08BT0381 Homo3.6 




434061 


AW024973 Hs.283675 NPD009 protein 


3.6 




445292 


AV653264 Hs.13982 Homo sapiens cDNA FLJ14666 fis, clone NT 3.6 


60 


452101 


T60298 Hs.10844 Homo sapiens cDNA FU14476 fis, clone MA 3.6 


427581 


NM 014788HS.179703 KIAA0129 gene product 


3.6 




409047 


AW961434 Hs.31539 ESTs 


3.6 




416820 


NM 000402HS.80206 glucose-6-phosphate dehydrogenase 


3.6 




410386 


W26187 Hs.3327 Homo sapiens cDNA: FLJ22219 fis, clone H 


3.6 


65 


440516 


S42303 Hs.161 cadherin 2, type 1,N-cadherin (neuronal 


3.6 


434360 


AW015415 Hs.127780 ESTs 


3.6 




428970 


BE276891 Hs.194691 retinoic acid induced 3 


3.6 




415079 


R43179 Hs.22895 hypothetical protein FLJ23548 


3.6 
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409619 AK001015 Hs.55220 BCL2-associated athanogene 2 3.6 

430044 M464510 Hs.152812 ESTs 3.6 

430829 AW451999 Hs.194024 ESTs 3.6 

434224 AA380731 Hs.84 interieukin 2 receptor, gamma (severe co 3.6 

439247 AF088020 Hs.46767 EST 3.6 

431542 H63010 Hs.5740 ESTs 3.5 

430713 AA351647 Hs.2642 eukaryotic translation elongation factor 3.5 

434988 AI418055 Hs.161160 ESTs 3.5 

437748 AF234882 Hs.5814 suppression of tumorigenicity 7 3.5 

418322 AA284166 Hs.84113 cyclin-dependent kinase inhibitor 3 (CDK 3.5 

439569 AW602166 Hs.222399 CEGP1 protein 3.5 

459583 AI907673 gb:IL-BT1 52-080399-004 BT1 52 Homo sapien3.5 

403212 NM_019595:Homo sapiens intersectin 2 (IT 3.5 

409099 AK000725 Hs.50579 hypothetical protein FU2071 8 3.5 

453968 AA847843 Hs.62711 Homo sapiens, clone IMAGE:3351295, mRNA3.5 

436338 W92147 Hs.1 18394 ESTs 3.5 

422890 Z43784 Hs.75893 ankyrin3,nodeofRanvier(ankyrinG) 3.5 

442295 AI827248 Hs.224398 Homo sapiens cDNA FLJ1 1469 fis, clone HE 3.5 

417975 AA641836 Hs.30085 hypothetical protein FU23 186 3.5 

433730 AK002135 Hs.3542 hypothetical protein FLJ1 1273 3.5 

439926 AW014875 Hs. 137007 ESTs 3.5 

445873 AA250970 Hs.251946 poly(A)-binding protein, cytoplasmic 1-1 3.4 

410153 BE311926 Hs.15830 hypothetical protein FLJ1 2691 3.4 

422128 AW881 145 gb:QV0-OT0033-010400-182-a07 OT0033 Homo 

414921 BE390551 Hs.77628 steroidogenic acute regulatory protein r 3.4 

441134 W29092 Hs.7678 cellular retinoic acid-binding protein 1 3.4 

444564 AI167877 Hs.143716 ESTs 3.4 

402470 NA Target Exon 3.4 

418120 AA213437 Hs.192249 ESTs 3.4 

422414 AW875237 Hs.13701 ESTs 3.4 

433345 AI681545 Hs.152982 hypothetical protein FU13117 3.4 

409213 U61412 Hs.51133 PTK6 protein tyrosine kinase 6 3.4 

422611 AA158177 Hs.1 18722 fucosyltransferase8(alpha(1,6)fucosy 3.4 

423554 M90516 Hs.1 674 glutamine-fructose-6-phosphate transamin 3.4 

442619 AA447492 Hs.201 83 ESTs, Weakly similar to AF164793 1 prate 3.4 

402359 NA C19001991*:gi|12656111|gb|AAK00751.1|AF23.4 

439398 AA284267 Hs.221504 ESTs 3.4 

415208 F01020 Hs.172004 titin 3.4 

452853 AA8 12633 Hs.1 0845 ESTs 3.4 

429345 R11141 Hs.1 99695 hypothetical protein 3.4 

449027 AJ271216 Hs.22880 dipeptidylpeptidase III 3.4 

412115 AK001763 Hs.73239 hypothetical protein FLJ10901 3.4 

432180 Y18418 Hs.272822 RuvB (E coli homolog)-like 1 3.4 

428977 AK001404 Hs.194698 cyclin B2 3.4 

431611 U58766 Hs.264428 tissue specific transplantation antigen 3.4 

418286 AA622528 Hs.319825 Homo sapiens, clone IMAGE:361 6574, mRNA,3.4 

436895 AF037335 Hs.5338 carbonic anhydrase XII (tumor antigen H 3.4 

443378 AW392550 Hs.9280 proteasome (prosome, macropain) subunit, 3.4 

428450 NM_01 4791 Hs.1 84339 KIAA0175 gene product 3.3 

449571 AW016812 Hs.200266 ESTs 3.3 

412777 AI335773 Hs.270123 ESTs 3.3 

420542 NM_000505Hs.1321 coagulation factor XII (Hageman factor) 3.3 

412754 AW160375 Hs.74565 amyloid beta (A4) precursor-like protein 3.3 

418327 U70370 Hs.84136 paired-like homeodomain transcription fa 3.3 

449065 AI627393 Hs.258998 ESTs, Weakly similar to high mobility gr 3.3 

425999 AW513051 Hs.332981 ESTs, Weakly similar to I38022 hypotheti 3.3 

430280 AA361258 Hs.237868 interieukin 7 receptor 3.3 

407777 AA161071 Hs.71465 squalene epoxidase 3.3 

426516 BE262660 Hs.170197 glutamic-oxaloacet'c transaminase 2, mit 3.3 

414361 AI086138 Hs.204044 ESTs 3.3 

427080 AW068287 Hs.1 73466 ras-related C3 botulinum toxin substrate 3.3 

426429 X73114 Hs.1 69849 myosin-binding protein C, slow-type 3.3 

446163 AA026880 Hs.25252 prolactin receptor 3.3 

428566 U41763 Hs.1 84916 clathrin, heavy polypeptide-like 1 3.3 

418641 BE243136 Hs.86947 a disintegrin and metalloproteinase doma 3.3 

436293 AI601188 Hs.120910 ESTs * 3.3 



411257 AA628967 Hs.115274 ESTs, Highly similar to IHH_HUMAN INDIAN 3.3 
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430253 AK001514 Hs.236844 hypothetical protein FLJ1 0652 3.3 

430066 AI929659 Hs.237825 signal recognition particle 72kD 3.3 

436469 AK001455 Hs.5198 Down syndrome critical region gene 2 3.3 

437786 BE142681 Hs.155573 polymerase (DNA directed), eta 3.3 

444079 H09048 Hs.23606 ESTs 3.3 

457183 H91882 Hs.1 18569 Dvl-binding protein IDAX (inhibition of 3.3 

431215 AA496078 Hs.121554 Human DNAsequencefromcloneRP11-218C13.3 

424563 AA446932 Hs.151428 ret finger protein 2 3.3 

450828 AW270655 Hs.193804 ESTs 3.3 



408652 R43409 Hs.6829 Homo sapiens mRNA for KIAA1644 protein, 3.3 
445142 AW978484 Hs.93842 Homo sapiens cDNA: FLJ22554 fis, clone H 3.3 
426761 AI015709 Hs.172089 Homo sapiens mRNA; cDNA DKFZp586l2022 (f3.3 



439237 AW4081 58 Hs.318893 ESTs, Weakly similar to A47582 B-cell gr 3.3 

422616 BE300330 Hs.1 18725 selenophosphate synthetase 2 3.3 

443247 BE614387 Hs.333893 c-Myc target JP01 3.3 

406663 U24683 Hs.302063 immunoglobulin heavy constant mu 3.3 

434137 M907734 Hs.124895 ESTs 3.3 

408877 M479033 Hs.130315 ESTs, Weakly similar to A47582 B-cell gr 3.3 

439101 C01765 Hs.38750 hypothetical protein FLJ 11 526 3.3 

408221 M912183 Hs.47447 ESTs 3.3 

447519 U46258 Hs.339665 ESTs 3.3 

404755 NA Target Exon 3.3 

451871 AI821005 Hs.118599 ESTs 3.2 

420319 AW406289 Hs.96593 hypothetical protein 3.2 



430580 AA806105 Hs.300697 immunoglobulin heavy constant gamma 3 (G 3.2 
400202 NA NM_002795*:Homo sapiens proteasome (pros3.2 



400222 NA NM_002082*:Homo sapiens G protein-couple 3.2 

425988 BE045897 Hs.274454 ESTs, Weakly similar to I38022 hypotheti 3.2 

458098 BE550224 Hs.74170 metallothionein 1E (functional) 3.2 

430589 AJ002744 Hs.246315 UDP-N-acetyl-alpha-D-galactosamine:polyp 3.2 

431563 AI027643 Hs.120912 ESTs 3.2 

442353 BE379594 Hs.49136 ESTs, Moderately similar to ALU7JHUMAN A 3.2 

422309 U79745 Hs.114924 solute carrier family 16 (monocarboxylic 3.2 

419703 AI793257 Hs.128151 ESTs 3.2 

420380 AA640891 Hs.102406 ESTs 3.2 

410853 H04588 Hs.30469 ESTs 3.2 

454417 AI244459 Hs.1 10826 trinucleotide repeat containing 9 3.2 

432745 A1821926 gb:nt78f05.x5 NCI_CGAP_Pr3 Homo sapiens 3.2 

422032 AA476966 Hs.1 10857 polymerase (RNA) III (DNA directed) poly 3.2 

415339 NM_015156Hs.78398 KIAA0071 protein 3.2 

426384 AI472078 Hs.303662 ESTs 3.2 

448030 N30714 Hs.325960 membrane-spanning 4-domains, subfamily A 3.2 

418739 AA310964 Hs.88012 SHP2 interacting transmembrane adaptor 3.2 

442053 R35343 Hs.24968 Human DNA sequence from clone RP1-233G163.2 

434747 AA837085 Hs.220585 ESTs 3.2 

427297 AW292593 Hs.334907 Homo sapiens, clone MGC:1 7333, mRNA, com3.2 

412228 AW503785 Hs.73792 complement component (3d/Epstein Barr vi 3.2 

452304 AA025386 Hs.61311 ESTs, Weakly similar to S10590 cysteine 3.2 

453953 AW408337 Hs.36972 CD7 antigen (p41) 3.2 

407758 D50915 Hs.38365 KIAA0125 gene product 3.2 

451149 AL047586 Hs.1 0283 RNA binding motif protein 8B 3.2 

430015 AW768399 Hs.1 12157 ESTs 3.2 

433313 W20128 Hs.296039 ESTs 3.2 

418334 AA319233 Hs.5521 ESTs 3.2 

450223 AA418204 Hs.241493 natural killer-tumor recognition sequenc 3.2 

454365 AW966728 Hs.54642 methionine adenosyltransferase II, beta 3.2 

451128 AL118668 gb:DKFZp761l0310_rl 761 (synonym: hamy2)3.2 

417793 AW405434 Hs.82575 small nuclear ribonucleoprotein polypept 3.2 

428027 U22029 Hs.334345 cytochrome P450, subfamily I IA (phenobar 3.2 

441197 BE244638 Hs.166 sterol regulatory element binding transc 3.2 

424634 NM_003613Hs.151407 cartilage intermediate layer protein, nu 3.2 

419986 AI345455 Hs.78915 GA-binding protein transcription factor, 3.2 

416714 AF283770 Hs.79630 CD79A antigen (immunoglobulin-associated 3.2 

449465 NM_004380Hs.23598 CREB binding protein (Rubinstein-Taybi s 3.2 

422166 W72424 Hs.112405 S100 calcium-binding protein A9(calgran 3.2 

409079 W87707 Hs.82065 interieukin 6 signal transducer (gp130, 3.2 

423551 AA327598 Hs.233785 ESTs 3.2 
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453553 AA036849 Hs.61829 Homo sapiens cDNA FLJ12763 fis, clone NT 3.2 

442580 AI733682 Hs. 130239 ESTs 3.2 
458079 AI796870 Hs.54277 DNA segment on chromosome X (unique) 9923.2 

425700 AF076292 Hs.159251 forkhead box H1 3.2 

5 417124 BE1 22762 Hs.25338 ESTs 3.2 

407104 S57296 Hs.323910 v-erb-b2 avian erythroblastic leukemia v 3.2 

442215 AI703172 Hs.129005 ESTs, Weakly similar to 21 09260AB cell 3.1 

430271 T06199 Hs.237506 DnaJ (Hsp40) homolog, subfamily B, membe 3.1 

425317 AW205118 Hs.210546 interleukin 21 receptor 3.1 

10 426095 AI278023 Hs.89986 ESTs 3.1 

442313 BE388898 Hs.8215 hypothetical protein FLJ 11 307 3.1 

424709 AL137589 Hs.152149 hypothetical protein DKFZp434K0410 3.1 

429671 BE379335 Hs.211594 proteasome (prosome, macropain) 26S subu 3.1 

432715 AA247152 Hs.200483 ESTs, Weakly similar to KIAA1 074 protein 3.1 

15 431574 AW572659 Hs.261 373 hypothetical protein dJ434014.3 3.1 

436876 AI124756 Hs.5337 isocitrate dehydrogenase 2 (NADP), mitoc 3.1 

I -. 405017 NA Target Exon 3.1 

II 433805 AA706910 Hs.1 12742 ESTs 3.1 
D 437352 AL353957 Hs.284181 hypothetical protein DKFZp434P0531 3.1 
r20 430105 X70297 Hs.2540 cholinergic receptor, nicotinic, alpha p 3.1 
s '~i 422083 NMJ01 141 Hs.1 11 256 arachidonate 15-lirx)xygenase, second typ 3.1 

413507 BE145360 Hs.190064 ESTs, Weakly similar to I38022 hypotheti 3.1 

£0 415989 AI267700 Hs.317584 ESTs 3.1 

r! ! 422907 AI879263 Hs.6986 Human glucose transporter pseudogene 3.1 

J 25 425548 AA890023 Hs.1906 prolactin receptor 3.1 

°H 422599 BE387202 Hs.118638 non-metastatic cells 1, protein (NM23A) 3.1 

p 439963 AW247529 Hs.6793 platelet-activating factor acetylhydrola 3.1 

453883 AI63851 6 Hs.22630 cofactor required for Sp1 transcriptiona 3.1 

L_ 458021 AI885190 Hs.156089 ESTs, Weakly similar to repressor protei 3.1 

H>0 418478 U38945 Hs.1174 cyclin-dependent kinase inhibitor 2A (me 3.1 

M: 400814 NA Target Exon 3.1 

==:■ 402327 Target Exon 3.1 

v * 416935 AA190712 gb:zp87f09.r1 Stratagene HeLa cell s3 93 3.1 

439838 AL355722 Hs.106875 Homo sapiens EST from clone 35214, full 3.1 

M5 437036 AI571514 Hs.133022 ESTs 3.1 

T\ 449523 NM_000579Hs.54443 chemokine (C-C motif) receptor 5 3.1 

?y 406642 AJ245210 gb:Homo sapiens mRNA for immunoglobulin 3.1 

406624 AF052762 gb:Homo sapiens clone csneg8-1 immunoglo 3.1 

421924 BE514514 Hs.109606 coronin, actin-binding protein, 1A 3.1 

40 414523 AU076633 Hs.76353 serine (or cysteine) proteinase inhibito 3.1 

416379 N38857 Hs.203933 ESTs 3.1 

422823 D89974 Hs.121102 vanin2 3.1 

433904 AI399956 Hs.208956 ESTs 3.1 

421904 BE143533 Hs.1 09309 hypothetical protein FLJ20035 3.1 

45 428834 AW899713 Hs.339315 ESTs 3.1 

436043 AW963838 Hs.168830 Homo sapiens cDNA FU12136 fis, clone MA 3.1 

452823 AB012124 Hs.30696 transcription factor-like 5 (basic helix 3.1 

405381 NA Target Exon 3.1 

* 428746 AW503820 Hs.1 92861 Spi-B transcription factor (Spi-1/PU.1 r 3.1 
50 435147 AL133731 Hs.4774 Homo sapiens mRNA; cDNA DKFZp761C1712 (f3.1 

425782 U66468 Hs.159525 cell growth regulator/ with EF-hand doma 3.1 

423306 W88562 Hs.108198 ESTs 3.1 

419123 AA234276 Hs.88253 ESTs 3.1 

438581 AW977766 Hs.292133 ESTs, Moderately similar to I78885 serin 3.1 

5 5 417105 X60992 Hs.81226 CD6 antigen 3.0 

428361 NM_015905Hs.183858 transcriptional intermediary factor 1 3.0 

417880 BE241595 Hs.82848 selectin L (lymphocyte adhesion molecule 3.0 

402606 NM_024626:Homo sapiens hypothetical prot 3.0 



401451 NM_004496*:Homo sapiens hepatocyte nude 3.0 

60 421878 AA299652 Hs.111496 Homo sapiens CDNAFLJ11643 fis, clone HE 3.0 



409518 BE384836 Hs.3454 KIAA1821 protein 3.0 

416933 BE561850 Hs.80506 small nuclear ribonucleoprotein polypept 3.0 

414324 Y14768 Hs.890 lymphotoxin beta (TNF superfamily, membe 3.0 

425081 X74794 Hs.154443 minichromosome maintenance deficient (S. 3.0 

65 401519 C15000476*:gi|12737279lrefjXP_012163.1| 3.0 

411704 AI499220 Hs.71573 hypothetical protein FLJ10074 3.0 

428819 AL135623 Hs.193914 KIAA0575 gene product 3.0 
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428423 
413835 
412600 
410491 
433658 
427666 
452514 
429500 
432485 
437400 
452234 
413269 
453216 
400929 
445145 
432615 
423279 
429392 
408548 
451346 
413109 
401714 
421462 
421750 
453293 
457085 
438930 



AU076517 

AI272727 

L28824 

AA465131 

L03678 

AI791495 

AI904898 

X78565 

N90866 

AB011542 

AW084176 

BE167526 

AL137566 



Hs.184276 

Hs.249163 

Hs.74101 

Hs.64001 

Hs.156110 

Hs.180142 

Hs.289114 
Hs.276770 
Hs.5599 
Hs.223296 

Hs.32405 



AI961702 Hs.147434 
AA557191 Hs.55028 
AW959861 Hs.290943 
AL109712 Hs.296506 
AA055449 Hs.63187 
NM_006338Hs.26312 
AW389845 Hs.1 10855 
NA 

AF016495 Hs.104624 
AK000768 Hs.107872 
M382267 Hs.10653 
AA412446 Hs.98138 
AW843633 Hs.306163 



solute carrier family 9 (sodium/hydrogen 3.0 
fatty acid hydroxylase 3.0 
spleen tyrosine kinase 3.0 
Homo sapiens clone 25218 mRNA sequence 3.0 
immunoglobulin kappa constant 3.0 
calmodulin-like skin protein 3.0 
gb:RC-BT068-130399-085 BT068 Homo sapien3.0 
hexabrachion (tenascin C, cytotactin) 3.0 
CDW52 antigen (CAMPATH-1 antigen) 3.0 
EGF-like-domain, multiple 5 3.0 
ESTs, Weakly similar to I38022 hypotheti 3.0 
gb:CM4-HT0509-080300-107-g07 HT0509 Homo 
Homo sapiens mRNA; cDNA DKFZp586G0321 (f 
ENSP00000252232*:Sterol regulatory eleme 3.0 
ESTs 3.0 
ESTs, Weakly similar to I54374 gene NF2 3.0 
ESTs 3.0 
Homo sapiens mRNA full length insert cDN 3.0 
ESTs, Weakly similar to ALUC_HUMAN !!!! 3.0 
glioma amplified on chromosome 1 protein 3.0 
ESTs 3.0 
ENSP00000241802*:CDNA FLJ11007 FIS, CLON 
aquaporin 9 3.0 
hypothetical protein FU20761 3.0 
ESTs 3.0 
ESTs 3.0 
hypothetical protein AL1 10115 3.0 



3.0 
3.0 



3.0 
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TABLE 22A 

Table 22 A shows the accession numbers for those pkeys lacking unigenelD's for Table 22. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number. Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 



407980 103087 1 AA046309 AI263500 AA046397 

410785 1221055J AW803341 AW803265 AW803403 AW803466 AW803402 AW803413 AW803268 AW803396 AW803334 AW803355 

411743 1256098J AW862214 AW859811 AW862215 

412138 1279172J AW895387 AW895547 AW895564 AW895323 AW895405 AW895539 AW895538 

413269 1356961J BE167526BE167651 BE076401 R24654 

416935 163179J AA190712 AA190665 AA252564 

422128 211994J AW881145 AA490718 M85637 AA304575 T06067 AA331991 

423945 233566J M410943 AW948953 AA334202 AA332882 

424109 235506J AW406878 AW966560 AW966151 AW966496 AA336174 AA335376 AA335537 

424128 235728J AW966163 AA335983 AA33601 1 AA335668 AA335973 

425331 250199J AW962128 AA355353 AA427363 

426878 273265J BE069341 AW748403 AL044891 AI908240 AA393080 

432745 353673J AI821926 AA658826 AA564492 M635129AI791191 

441153 51084_2 BE562826 BE378727 

448212 755099J AI475858 AW969013 

451128 859865J AL1 18668 D78823AI7621 76 

452514 920172J AI904898 AI904849 AI904899 

456207 165078_-1 M1 93450 
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TABLE 22B 



Table 22B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 22. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication 

entitled The DNA sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


400814 


8569925 


Minus 


72840-72924,74761-74849 


400929 


7651921 


Minus 


122033-122241,123483-124028 


401045 


8117619 


Plus 


90044-90184,91111-91345 


401451 


6634068 


Minus 


119926-121272 


401519 


6649315 


Plus 


157315-157950 


401645 


7657839 


Minus 


34986-35133 


401714 


6715702 


Plus 


96484-96681 


401866 


8018106 


Plus 


73126-73623 


402327 


7656695 


Minus 


108675-108770,109801-109910 


402359 


9211204 


Minus 


40403-41961 


402408 


9796239 


Minus 


110326-110491 


402470 


9797107 


Plus 


195129-195776 


402542 


9801558 


Minus 


67076-67594 


402578 


9884928 


Plus 


66350-66496 


402606 


9909429 


Minus 


81747-82094 


403011 


6693597 


Minus 


3468-3623 


403212 


7630897 


Minus 


156037-156210 


403329 


8516120 


Plus 


96450-96598 


403366 


8783692 


Minus 


49323-49652 


403485 


9966528 


Plus 


2888-3001 ,3198-3532,365541 1 7 


404347 


9838195 


Plus 


74493-74829 


404580 


6539738 


Minus 


240588-241589 


404755 


7706327 


Minus 


53729-53846 


405017 


6532084 


Plus 


35551-35690 


405348 


2914717 


Minus 


43310-43462 


405381 


6006920 


Minus 


7636-8054 


405801 


2924321 


Plus 


63469-63694 


405850 


6164995 


Plus 


13871-14110 


406153 


9929734 


Minus 


12902-13069 


406348 


9255985 


Minus 


71754-71944 
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TABLE 23: 320 GENES DOWN-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT BREAST 

Table 23 shows 320 genes down-regulated in breast cancer compared to normal adult breast. 
These were selected as for Table 22, except that the numerator was set to the median value 
for 12 non-malignant breast specimens, the denominator was set to the median value amongst 
the 73 breast cancers, the 90 th percentile value amongst the 12 non-malignant breast 
specimens was greater than or equal 80 units, and the ratio was greater than or equal to 4.0 
(i.e. 4-fold down-regulated in tumor vs. normal breast). 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnkjenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of SO 1 " percentile normal body tissue to 75" 1 percentile tumor 



Pkey 


ExAccn 


UnigenelD 


UnigeneTHIe 


Ratio 


428722 


U76456 


Hs.190787 


tissue inhibitor of metalloproteinase 4 


22.4 


428848 


NM 000230 


Hs.194236 


leptin (murine obesity homolog) 


17.4 


445263 


H57646 


Hs.42586 


KIAA1 560 protein 


15.4 


418935 


T28499 


Hs.89485 


carbonic anhydrase IV 


15.0 


407228 


M25079 


Hs.155376 


hemoglobin, beta 


14.6 


417511 


AL049176 


Hs.82223 


chord in-like 


14.6 


439285 


AL133916 


Hs.172572 


hypothetical protein FLJ20093 


14.3 


412442 


AI983730 


Hs.26530 


serum deprivation response (phosphatidyl 


13.6 


410544 


AI446543 


Hs.95511 


ESTs 


12.6 


412047 


AA934589 


Hs.49696 


ESTs 


12.2 


422667 


H25642 


Hs.133471 


ESTs 


12.0 


406664 


L34041 


Hs.9739 


glycerol-3-phosphate dehydrogenase 1 (so 


12.0 


423201 


NM 000163 


Hs.125180 


growth hormone receptor 


11.7 


422163 


AF027208 


Hs. 112360 


prominin (mouse)-like 1 


10.8 


428769 


AW207175 


Hs.106771 


ESTs 


10.6 


407049 


X72632 




NM_021724*:Homo sapiens nuclear receptor 


10.1 


412295 


AW088826 


Hs.1 17176 


poly(A)-binding protein, nuclear 1 


9.8 


425126 


N32759 


Hs.172944 


chorionic gonadotropin, beta polypeptide 


9.8 


406791 


AI220684 


Hs.272572 


hemoglobin, alpha 2 


9.5 


447471 


AF039843 


Hs.18676 


sprouty (Drosophila) homolog 2 


9.5 


451533 


NM 004657 


Hs.26530 


serum deprivation response (phosphatidyl 


9.4 


419407 


AW410377 


Hs.41502 


hypothetical protein FLJ21276 


9.0 


411939 


AI365585 


Hs.146246 


ESTs 


9.0 


410532 


T53088 


Hs.155376 


hemoglobin, beta 


8.9 


425707 


AF1 15402 


Hs.11713 


E74-like factor 5 (ets domain transcript 


8.8 


416585 


X54162 


Hs.79386 


leiomodin 1 (smooth muscle) 


8.7 


443060 


D78874 


Hs.8944 


procollagen C-endopeptidase enhancer 2 


8.6 


435265 


AA779958 


Hs.185932 


ESTs 


8.5 


422511 


AU076442 


Hs.1 17938 


collagen, type XVII, alpha 1 


8.4 


433138 


AB029496 


Hs.59729 


semaphorin sem2 


8.3 


402195 






NM 004497*: Homo sapiens hepatocyte nude 


8.1 


429350 


AI754634 


Hs.131987 


ESTs 


8.1 


445107 


AI208121 


Hs.147313 


ESTs, Weakly similar to I38022 hypothetj 


8.1 


406643 


N77976 


Hs.272572 


hemoglobin, alpha 2 


8.0 


410199 


AW377424 


Hs.205126 


Homo sapiens cDNA: FLJ22667 fis, clone H 


8.0 


417225 


AA815048 


Hs.24078 


hypothetical protein FLJ 12649 


7.8 


437569 


AA760849 


Hs.294052 


ESTs 


7.5 


436062 


AK000027 


Hs.98633 


ESTs 


7.5 


425078 


NM_002599 


Hs.154437 


phosphodiesterase 2A, cGMP-stimulated 


7.5 


430327 


AW973636 


Hs.55931 


ESTs 


7.4 
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447577 


AI393693 


Hs.183297 


DKFZP566F2124 protein 


7.4 




446039 


AI150491 


Hs.90756 


ESTs 


7.2 




422060 


R20893 


Hs.325823 


ESTs, Moderately similar to ALU5_HUMAN A 


7.2 




424455 


AA452006 


Hs.333199 


ESTs 


7.1 


5 


424343 


AW956360 


Hs.4748 


adenylate cyclase activating polypeptide 


7.1 




442792 


AI352340 


Hs.131194 


ESTs 


7.0 




406714 


AI219304 


Hs.283108 


hemoglobin, gamma G 


6.9 




407571 


AI446183 


Hs.9572 


ESTs, Highly similar to CYA5 HUMAN ADENY 


6.8 


10 


429580 


AA346839 


H<5 209100 


DKFZP434C171 protein 


6.7 


453500 


AI478427 


Hs.43125 


esophageal cancer related gene 4 protein 


6.7 




422233 


AB002058 


Hs.1 13275 


purinergic receptor P2X-like 1, orphan r 


6.7 




420205 


AA256395 


Hq RR1 <ifi 
na.oo i ju 


ESTs 


6.6 




404368 


NA 




ENSP00000241075*:TRRAP PROTEIN. 


6.6 


15 


447261 


NM 006691 


Hs 17917 


extracellular link domain-containing 1 


6.5 


417090 AA193282 


H<! 85863 


ESTs, Weakly similar to B34612 zinc fing 


6.5 




410677 NM 003278 


He RW4 


tetranectin (plasminogen-binding protein 


6.5 




427283 AL119796 


He 1741R*i 


ectonucleotidepyrophosphatase/phosphodi 


6.5 




415011 


AW963085 




gb:EST375158 MAGE resequences, MAGH Homo6.4 


412068 


S72043 


He 7T1T* 
nb.r O lOO 


metallothionein 3 (growth inhibitory fac 


6.4 




416253 


BE250659 


no. I «JHDO 


Homo sapiens, clone IMAGE:2959994, mRNA 


6.4 




435885 


AA701483 


riS.0004 1 


ESTs 


6.3 


yj 


402779 


NA 




Target Exon 


6.3 




418138 


AA213626 


ns. I oOiUn 


EST 


6.3 




439335 


AA742697 


He R94Q9 


ESTs, Weakly similar to B39066 proline-r 


6.3 


% 


427019 


AA001732 


He 17^9^* 


hypothetical protein FU 10970 


6.2 


Fu 


411478 


BE143068 




gb:MR0-HT0158-030200-003-b09 HT0158 Homo6.2 


™i 


452654 


BE004783 




gb:MR2-BN0114-270400-004-e11 BN0114 Homo6.1 




447359 


NM_012093 


He 1R9RR 


adenylate kinase 5 


6.1 


L— 1 


414323 


NM 014759 


He "XIAPStfi. 


KIAA0273 gene product 


6.1 




441266 


H15968 


14 e OQ^RilK 

ns.iyoMO 


Homo sapiens, clone IMAGE:3502329, mRNA, 


6.1 




417011 


F08212 


He 9**AftQft 


ESTs, Weakly similar to 2109260A B cell 


6.0 




400089 


NA 




Eos Control 


6.0 




433614 


W07475 


He 977101 


cytochrome c oxidase subunit IV isoform 


5.9 


k i 


440439 


N92818 


He R47M 


ESTs, Weakly similar to potential CDS [H 


5.9 




454404 


BE067414 




gb:MR4-BT0355-200100-201-e05 BT0355 Homo 5.9 


™?\ 


436704 


AA062610 


no. itoujy 


EST 


5.9 


'~; 


406563 


NA 




Target Exon 


5.9 


&5 i 


433490 


AW451023 




hypothetical protein DKFZp7610132 


5.9 


40 


419313 


AA843387 


He A797Q 


ESTs 


5.9 


409196 


NM 001874 


He ^487^ 


carboxypeptidase M 


5.8 




410882 


AW809163 




gb:MR4-ST0118-261099-012-a03 ST0118 Homo5.8 




453469 


AB014533 


He ^vim 


KIAA0633 protein 


5.8 




441899 


AI372588 


He ftfi99 


TU3A protein 


5.8 


45 


426210 


AA372052 


He ^^A^Q 


Homo sapiens cDNA FLJ14458 fis, clone HE 


5.8 


413065 


BE063555 




gb:CM1-BT0283-081199-033-d09 BT0283 Homo5.8 




454192 


AW876813 


He I^A^ 


phosphoglycerate dehydrogenase 


5.7 




425187 


AW014486 


He 99^nQ 


ESTs 


5.7 




429757 AW452355 


He 9^fin?7 


ESTs 


5.7 


50 


420202 


AL036557 


He Qr;Q-in 
ns.yoyiu 


putative lymphocyte G0/G1 switch gene 


5.7 


416284 


AI695473 


Uo 9Qnnn« 
nS.zyoUUo 


ESTs 


5.7 




428553 


AA181641 


He ^RAQH7 

ns. i<54yu/ 


G protein-coupled receptor 1 


5.6 




404689 


NA 




Target Exon 


5.6 




438887 


R68857 


Hs 265499 


ESTs 


5.6 


55 


406082 


S47833 


Hs.82927 


adenosine monophosphate deaminase 2 (iso 


5.6 


449748 


H23963 


Hs.32043 


ESTs 


5.6 




431048 


R50253 


Hs.249129 


cell death-inducing DFFA-like effector a 


5.5 




452205 


C 15819 




gb:C15819 Clontech human aorta polyA mRN 


5.5 




434040 


AW444613 


Hs.288809 


hypothetical protein FLJ20159 


5.5 


60 


407744 


AB020629 


Hs.38095 


ATP-binding cassette, sub-family A (ABC 1 


5.5 


450606 


AI668605 


Hs.60380 


ESTs, Moderately similar to ALU6_HUMAN A 


5.5 




414629 


AA345824 


Hs.76688 


carboxylesterase 1 (monocyte/macrophage 


5.5 




401665 






C11000703:gi|10048448|ref|NP_065258.1 1 g 


5.5 




436107 


T99079 


Hs.191194 


ESTs 


5.5 


65 


444432 


AI161428 


Hs.75916 


splicing factor 3b, subunit 2, 145kD 


5.5 


434715 


BE005346 


Hs.1 16410 


ESTs 


5.5 




447205 


BE617015 


Hs.11006 


ESTs, Moderately similar to T17372 plasm 


5.5 




408122 


AI432652 


Hs.42824 


hypothetical protein FLJ10718 


5.5 



336 



454016 


AW016806 


Hs.233108 


ESTs 


5.5 


414913 


R25621 




gb:yh45(06.r1 Soares placenta Nb2HP Honno 


5.4 


459033 


AA017590 


Hs. 129907 


ESTs 


5.4 


441003 


8E1 72240 


Hs. 126379 


ESTs, Weakly similar to I38022 hypotheti 


5.4 


450637 


N49826 


Hs. 18602 


ESTs 


5.4 


442398 


AA994520 




gb:ou42g09.s1 Soares_NFL_T_GBC_S1 Homo 


s5.4 


403612 


NA 




Target Exon 


5.3 


407102 


AA007629 


Hs.9739 


glycerol-3-phosphate dehydrogenase 1 (so 


5.3 


410057 


R66634 


Hs.268107 


multimerin 


5.3 


428232 


BE272452 


Hs.183109 


monoamine oxidase A 


5.3 


432769 


AA620814 


Hs. 144959 


ESTs 


5.3 


431344 


R99530 


Hs.272572 


hemoglobin, alpha 2 


5.3 


427032 


AF012023 


Hs. 173274 


integrin cytoplasmic domain-associated p 


5.3 


406305 


BE261320 


Hs. 1581 96 


transcriptional adaptor 3 (ADA3, yeast h 


5.3 


437411 


AW613948 


Hs.194915 


ESTs 


5.3 


442800 


AI809481 


Hs.131227 


ESTs 


5.3 


402054 


NA 




Target Exon 


5.3 


432085 


AF212829 


Hs.272406 


potassium channel, subfamily K, member 9 


5.3 


415313 


R59638 


Hs.6181 


ESTs 


5.2 


459159 


AI904646 




gb:QV-BT065-020399-103 BT065 Homo sapien 


5.2 


427164 


AB037721 


Hs. 173871 


KIM1300 protein 


5.2 


441391 


BE467930 


Hs. 170381 


ESTs 


5.2 


458959 


AI285901 


Hs.181297 


ESTs 


5.2 


402698 


NA 




ENSP00000251335*:DJ1003J2.1 (sodium and 


5.2 


401810 


NA 




Target Exon 


5.2 


438879 


AA827674 


Hs.189073 


ESTs 


5.2 


414657 


M424074 


Hs.76780 


protein phosphatase 1, regulatory (inhib 


5.2 


427809 


M26380 


Hs.180878 


lipoprotein lipase 


5.1 


456063 


NM_006744 


Hs.76461 


retinol-binding protein 4, interstitial 


5.1 


451186 


AW023469 


Hs.65256 


ESTs, Weakly similar to leucine-rich gli 


5.1 


451882 


A1821324 


Hs 100445 


ESTs 


5.1 


402583 


NA 




NM_021620:Homo sapiens PR domain contain 


5.1 


431130 


NM_006103 


Hs.2719 


HE4; WFDC2; putative ovarian carcinoma m 


5.1 


458218 


AI435179 


Hs. 126820 


ESTs 


5.1 


416083 


R53467 


Hs.269122 


ESTs, Weakly similar to ALU1_HUMAN ALU S 


5.1 


455282 


BE1 43867 




gb:MR0-HT0164-070100-013-h02 HT0164 Homo5.1 


426488 


X03350 


Hs.4 


alcohol dehydrogenase 1B (class I), beta 


5.1 


426156 


BE244537 


Hs. 167382 


natriuretic peptide receptor A/guanylate 


5.1 


407891 


AA486620 


Hs.41135 


endomucin-2 


5.0 


408610 


AW026692 


Hs.224829 


ESTs 


5.0 


445967 


D59597 


Hs.1 18821 


CGI-62 protein 


5.0 


434813 


AI524307 


Hs. 162870 


ESTs 


5.0 


437526 


AI076012 


Hs. 121 388 


ESTs, Weakly similar to MDHC HUMAN MALAT 5.0 


454775 


BE1 60229 




gb:QV1-HT0413-090200-062-a12 HT0413 Homo5.0 


409451 


AF012626 


Hs.54472 


fragile X mental retardation 2 


5.0 


409853 


AW502327 




gb:UI-HF-BROp-aka-a-07-0-Ul.r1NIH_MGC_5 


5.0 


405062 






Target Exon 


5.0 


446490 


AK000706 


Hs.15125 


hypothetical protein FLJ20699 


5.0 


417622 


AW298163 


Hs.82318 


WAS protein family, member 3 


5.0 


421978 


AJ243662 


Hs.1 10196 


NICE-1 protein 


5.0 


440338 


R62431 


Hs. 12758 


ESTs 


5.0 


415421 


R35009 


Hs.24903 


ESTs 


5.0 


417574 


R00348 




gb:ye69e06.r1 Soares fetal liver spleen 


5.0 


409882 


AJ243191 


Hs.56874 


heat shock 27kD protein family, member 7 


5.0 


447998 


AI768289 


Hs.304389 


ESTs 


4.9 


445613 


BE550889 


Hs.158491 


ESTs 


4.9 


443074 


AW341470 


Hs.144907 


ESTs 


4.9 


451324 


AI783600 


Hs.208052 


ESTs 


4.9 


432433 


AW014734 


Hs.1 57969 


ESTs 


4.9 


449654 


AI989812 


Hs.199850 


ESTs 


4.9 


414519 


N94587 


Hs.55063 


ESTs 


4.9 


457531 


AW973716 


Hs.13913 


KIAA1577 protein 


4.9 


433200 


AA682722 


Hs.192725 


ESTs 


4.9 


430782 


AF026263 


Hs.247920 


cholinergic receptor, muscarinic 5 


4.8 


427555 


AW137094 


Hs.97990 


ESTs 


4.8 


433545 


AA868510 


Hs.1 12496 


ESTs 


4.8 


420334 


AI349351 


Hs.1 18944 


hypothetical protein FLJ22477 


4.8 



337 



421795 


X63094 


He 98187? 


427138 


N77624 


He 171717 
no. if jf if 


413072 


BE063965 




443721 


AW450451 


He PRRI'i-i 


408053 


AW139474 


He 94fififi9 


427067 


M843716 


He 177Q97 
no. I / / dli 


442969 


AI025499 


He 11591ft 
no. ioz^oo 


426220 


AI383475 


He 171KQ7 
no. 1 / iuo/ 


414593 


BE386764 




426893 


AA398716 


He Q7A1ft 


434046 


AW292618 


He 111fl11 
no. i iou i i 


401590 


NA 




457971 


AW134679 


He 949JUQ 


427722 


AK000123 


He 1fifUI7Q 
no. lOU't/y 


443793 AA045290 


He osom 


Qui lit 


D/IQ-1Q7 

K4yw 


He RRRQ 
no.oooy 


AA1 f*iCG 

441 yoo 


A AQT0107 

AAy/ loci 


He 1A9Q(11 
no. I*r£oUO 


AA1AQQ 




He miRRQ 
no. 1U IDOy 


AA7E47 
44/01/ 


At3o272o 


He 1R941il 
no. IOZ*tO*t 


4U3U1 / 






450580 


N40087 


He 1594ft 
no. I0Z40 


AftAGA A 

4U4b1 1 


H58589 


ns.oo i oo 


414831 


M31158 


Lj c 77/ on 

no. / 


459290 


NM_001546 


We ^/ft^O 
nS.o400.3 


444341 


AI142027 


Uc *MRK£n 
HS.1400DU 


408614 


AL1 37698 


nb.4DOo I 


449638 


AW204277 




434418 


AF1 34707 


Lie *57*"tfi7Q 


447360 


AI375984 


Uc 1fi701fi 

HS.IO/zIO 


419583 


F00312 




440698 


AI348455 


ns. t4/4yz 


451199 


AI290653 


We 1947RH 

HS.1Z4/0O 


438338 


NM_014861 


We £1£ft 

ns.o loo 


433756 


AW015933 


We 1 •fOCRjl 

ns.11zb04 


423301 


S67580 


We icyic 
nS.1040 


417237 


H86385 


We 1117*27 

ns.ol/o/ 


439745 


AL389981 


We 140910 

ns.i4y^iy 


424137 


AA335769 


Up 1COCO 

ns.lb^oz 


449338 


H73444 


HS.jy4 


434744 


N94835 


ns.zooozo 


407402 


AF035303 




443510 


NM 012190 


We OJCO/1 


415754 


M169114 


We 1*)Ov(7 

HS.1ZZ4/ 


415986 


Z43619 




457416 


BE1 42052 


We SICCA 

MS.0^004 


418064 


BE387287 


We OOOQ,) 


437120 


AI356125 


We 4C77C7 

ns. 157767 


453950 


AA156998 


We II ACCO 

ns.zllooo 


401093 






436935 


AW206494 


We lEocen 
HS.^OJOfaO 


457974 


AW842353 


We *JO-l"7'17 

HS.021/17 


428222 


AL133112 


We loinoc 

ns.looOoo 


442705 


AI264634 


We 41H *V7 

nS.loil// 


437409 


AL359599 


Hs.283850 


458494 


AI380906 


Hs.158436 


410490 


H03589 




416069 


R37101 


Hs.20982 


438463 


AA807958 


Hs.3 14232 


444043 


AI499723 


Hs.135089 


438327 


H87407 


Hs. 172944 


457711 


AF1 47401 


Hs.23917 


400870 






441425 


AA933590 


Hs.28937 


416267 


H45384 




407262 


M12873 




444567 AV654020 


Hs.184261 


403263. NA 





Rhesus blood group, D antigen 4.8 

phosphatide acid phosphatase type 2B 4.8 
gb:QV3-BT0296-140200-085-h01 BT0296 Homo 4.8 

ESTs 4.8 

ESTs 4.8 

ESTs 4.7 

ESTs 4.7 

ESTs, Weakly similar to T1 3924 sdk prate 4.7 
gb:601273249F1 NIH_MGC_20 Homo sapiens c 4.7 

ESTs 4.7 

ESTs 4.7 

Target Exon 4.7 

ESTs 4.7 

hypothetical protein FLJ20116 4.6 

ESTs, Weakly similar to 21 09260A B cell 4.6 

ESTs 4.6 

ESTs 4.6 

ESTs 4.6 

ESTs 4.6 

Target Exon 4.6 

ESTs 4.6 

Homo sapiens cDNA FU1 1027 fis, clone PL 4.6 

protein kinase, cAMP-dependent, regulato 4.6 

inhibitor of DNA binding 4, dominant neg 4.6 

ESTs ' 4.6 

Homo sapiens mRNA; cDNA DKFZp434C1915 (f 4.6 

hypothetical protein MGC2747 4.6 

a disintegrin and metalloproteinase doma 4.6 

ESTs 4.6 

gb:HSBB0D101 STRATAGENE Human skeletal m4.6 

Homo sapiens cDNA FLJ1 1777 fis, clone HE 4.6 

ESTs 4.6 

KIAA0703 gene product 4.6 

Homo sapiens, clone MGC:9764, mRNA, comp 4.5 

cytochrome P450, subfamily IVA, polypept 4.5 

palmitoyl-protein thioesterase 2 4.5 

Homo sapiens mRNA full length insert cDN 4.5 

ESTs 4.5 

adrenomedullin 4.5 

Homo sapiens genomic DNA, chromosome 21q 4.5 

gb:Homo sapiens clone 23943 mRNA sequenc 4.5 

formyltetrahydrofolate dehydrogenase 4.5 

hypothetical protein FLJ1 1413 4.5 

gb:HSC1GE121 normalized infant brain cDN 4.5 

kringle-containing transmembrane protein 4.5 

S100 calcium-binding protein, beta (neur 4.4 
ESTs, Weakly similar to HXA2JHUMAN HOME04.4 

eukaryotic translation initiation factor 4.4 

C12000586*:gi|6330167|db]|BAA86477.1 1 (A 4.4 

ESTs 4.4 

ESTs, Weakly similar to S22765 heterogen 4.4 
Homo sapiens mRNA; cDNA DKFZp434K098 (fr 4.4 

ESTs 4.4 
Homo sapiens mRNA; cDNA DKFZp547C126 (fr 4.4 

ESTs 4.4 

gb:yj42d08.r1 Soares placenta Nb2HP Homo 4.4 

ESTs 4.4 

ESTs 4.4 

ESTs 4.4 

chorionic gonadotropin, beta polypeptide 4.4 

ESTs 4.3 

C1 1000905:gi| 1 1 692565|gb|AAG39879.1 |AF28 4.3 

homeobox protein from AL590526 4.3 

gb:yn99c10.r1 Soares adult brain N2b5HB5 4.3 

gb:Human Ig rearranged H-chain mRNA VDJ4 4.3 

ESTs, Weakly similar to T26686 hypotheti 4.3 

Target Exon 4.3 



338 



410034 


BE067414 




gb:MR4-BT0355-200100-201-e05 BT0355 Homo4.3 


456804 


AI421645 


Hs.139851 


rPVPOlin 2 


4.3 


448427 


BE395260 


Hs.309438 


EST 


4.3 


416931 


D45371 


Hs.80485 


adipose most abundant gene transcript 1 


4.3 


421296 


NM.002666 


Hs.103253 


perilipin 


4.3 


400973 


NA 




ENSP00000236667*:Mucin 5B (Fragment). 


4.3 


452602 


AW366194 


Hs.55962 


ESTs 


4.3 


412330 


NM_005100 


Hs.788 


A kinase (PRKA) anchor protein (gravin) 


4.3 


405016 






CY000171 *:gi|9280405|gb|AAF86402.1 |AF245 


4.3 


435104 


AI475671 


Hs.88607 


FSTs Hinhlv similar tn F-hny nrnfpin FR 

L.JIO, 1 IIUI IIV omnia! \SJ 1 U\JA L/IUldll 1 u 


4.3 


406118 






ENSP00000246632:CDNA FLJ20261 fis, clone 


4.3 


418556 


T02850 




ab*FB12A9 Fptal brain Stratanpnp Hnmo «. 


4.3 


429745 


AA480818 


Hs.221736 


ESTs 


4.3 


433088 


AW451206 


Hs.1 15899 


ESTs 


4 


444445 


AA342329 


Hs.1 15920 


Hnrnn eanipne rDWA* Fl l99R1Rfie rlnnp K 


*i.J 


453880 


AI803166 


Hs.28462 


ESTs Weaklv similar to I38029 hvnothpti 


4 3 


447384 


AI377221 


Hs.40528 


CO I o 


A 9 

H.C. 


414541 


BE293116 


Hs 763Q2 


sldehyde dehydroQsnase 1 family, member 




444975 




He 1R94R9 
no. IO£HO£ 


F9Tc Wpflklv eimilar tr\ Tftft^R9 hunnthpfi 
co id, v v cd m y oHiiifai lu i wOvt itypuuicLt 


A 9 

•tit 


403921 


NA 




r ( innfl919*-niMnn479^7lHhilRAR1' : l4n7 11 fA 


A 9 


451477 




Hs.42710 


to 1 S 


A O 








Pftrtfi-IWWniM 1R^7lrtK;iRAR*1ftQQf; 11 /API 
L-DUU lOOU.yi| 1 10 1 IOo/|Qp]jDAO I09o0. I| (Ab 






AA1Q19M 


He T^ftfil 
no. o joq i 


urvrirooot lo/i i protein 


A 0 


41 *iRRO 


DC 133000 


He 95R99 
no. ZooZZ 


klAAIRDR nrntain 


A 0 






He 1447Q4 


FQTe 
CO I 5 


A 0 
'i.e. 


4ndfift9 


NIA 




PQfin*l'1flR* , niM97^flfi49lrafiMD {\71'7 f )~\ ^^ n 
L»yUU 1 100 .yi[ It/ ooo^ZjrcTjiNr^j 0/ to. 1 1 p 


A 0 


*t IOUOC7 




no.oooo 


uoio, vvcdKiy biiTiiidr io \i oooo senne/ui 


A 9 








iNivi_uu lozz.norno oaptens aipna-t-no-yiycop 


A 0 




AWQ754R0 


no. (40000 


Co I o 


A 9 


414217 




He 97QRQR 


Unmn eanionc /»HMA' Fl lO^IR^ fie /'•Inn a 1 

nomo Sapiens cuina. rLjzo 100 us, cione u 


A 0 


41R4?<% 

•t 1 OHivJ 


A 187 1947 


He R9R9 


hunnthQ+ica! nmtoin fj!fiPRAn7 

nypuuiciicdi proxcin wouo'hj/ 


A 9 


41Q"5fiQ 




He oni 09*; 


Homo sapiens cDNA FLJ13446 fis, clone PL 


4.Z 


4*7090. 

*tO/ Uto 


A AOQ77QQ 


He 1fi1RD^ 
no. 10 I0U0 


CO IS 


A O 




Mr I300U/ 


He 1^1 R^5 


Rhesus blood group, B glycoprotein 


A 0 


44RQRR 


YftQ7fi^ 

I Uj / UO 


He 997R* 
nax£/ 00 


gam ma-am inobutync acid (GABA) A recepto 


A 9 


*r*rUU 1 U 


nl/ O JU30 


no. I OUOw 


ESTs 


4 2 


439590 


AF086410 




gu.nuuio bdpiciib lull jengin msen cuinm 


A 9 


49794(1 




He 974151 
nox/'t io I 


linstin 


4.2 


408932 


AW594172 


Hs.278513 




4 9 


436112 


T77545 


1 IO. lOr %J*J& 


COTc 

LU 1 O 


4 9 


444382 


AI144152 


Hs 5A746 


CO 1 o 


4 9 
*t.t 


456716 


A A"? 180(50 


Hs 135121 

no, i it. i 


hvnnthptiral nrntAin Fl 19941^ 

uy^A/u ICU^di piuicni rum l\/ 


4 9 


419846 


NM 015977 


Hs.285681 


lA/illlomc Dai Iran o/n^rrtrtia ^hrrtmftertinQ 

vviiiiaino-Dcurcn synuroiuc cnromosoine reyi 


A 9 






He 57Q7 1 ! 
no. or 3/ o 


calsequestrin 2 (cardiac muscle) 


4. 1 


*tlA^O*rO 


NA 




Target Exon 


4. 1 


HUOVJO 1 


NA 




Target Exon 


4.1 


ttU 1 OJ7 


nm (ww 


no.yoOo I 


lipase, hormone-sensitive 


4.1 


*tOvt*Ht 


AAnn7<;i4 


He 195fifi9 
no. IdDVOd 


ESTs 


4.1 




AAflldUfi 


He 11R4Q4 

ns. i 10434 


ESTs 


4.1 




VVOjlU 1 u 


He 1Q1T7Q 
ns. i y i o/ y 


ESTs 


4.1 


»* 1 *rO 1 D 




He n*wfin 

no. IOooOU 


ESTs, Weakly similar to T43458 hypotheti 


4.1 






He 1 9999R 
no. i^llu 


ESTs 


4.1 


4109^ 

H 1 \j£.dd 


AA(1fl9Q47 




gb:zn10g07.s1 Stratagene hNT neuron (937 


4.1 


412179 


BE270758 


Hs.69428 


hypothetical protein MGC3020 


4.1 


441871 


AI306150 


Hs.153450 


ESTs, Weakly similar to 1909123ANa glue 


4.1 


426411 


AK000708 


Hs.169764 


hypothetical protein FLJ20701 


4.1 


453692 


AL1 10416 




gb:DKFZp434K0431_rt 434 (synonym: htes3) 


4.1 


448640 


AW817177 


Hs.102558 


Homo sapiens, clone MGC:5352, mRNA, comp 


4.1 


417481 


AA203281 


Hs.21798 


ESTs 


4.1 


412912 


AW1 18878 


Hs.1 10835 


ESTs 


4.1 


454183 


AW807116 




gb:MR4-ST0062-040100-024-b12 ST0062 Homo4.1 


426328 


AW631296 




gb:hh83c09.y1 NCI_CGAP_GU1 Homo sapiens 4.1 


435942 


R06285 


Hs.191215 


ESTs 


4.1 


417629 


T76945 




gb:yc92c07.r1 Soares infant brain 1NIB H 


4.1 


403593 


NA 




Target Exon 


4.0 


402690 






Target Exon 


4.0 


418190 


R49591 


Hs.270425 


ESTs 


4.0 
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408641 


AW245207 


Hs.5555 


hypothetical protein MGC5347 


4.0 


427899 


AA829286 


Hs.332053 


serum amyloid A1 


4.0 


445975 


AI811536 


Hs. 145734 


ESTs 


4.0 


438831 


BE263273 


Hs.6439 


synapsin II 


4.0 


455578 


BE006350 


Hs.14355 


Homo sapiens cDNA FLJ13207 fis, clone NT 


4.0 


401840 


NA 




Target Exon 


4.0 


413753 


U17760 


Hs.75517 


laminin, beta 3 (nicein (125kD), kalinin 


4.0 


445030 


AI205925 


Hs.147238 


ESTs, Highly similar to AAC3_HUMAN ALPHA 


4.0 


433873 


AW156913 


Hs.1 50478 


ESTs, Weakly similar to A Chain A, Cryst 


4.0 


456736 


AW248217 


Hs.1619 


achaete-scute complex (Drosophila) homol 


4.0 


450112 


BE047734 


Hs.5473 


ESTs, Moderately similar to ALU5 HUMAN A 


4.0 


448906 


AI589567 


Hs.309719 


ESTs 


4.0 
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TABLE 23A 

Table 23 A shows the accession numbers for those pkeys lacking unigenelD's for Table 23. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 



409853 1 156226J AW502327 AW502488 AW501829 AW502625 AW502687 

410034 1170594J BE067414 BE067958 BE067419 BE067963 AW577127 AW601412 

410233 118656J AA082947 M083036 

410490 1205347J H03589 AW750687 AW750688 

410882 1225686_1 AW809163 AW809247 AW809177 AW809190 AW809225 

41 1478 1 247073J BE143068 AW849143 AW848705 AW848569 AW848071 AW848475 AW848092 AW848005 
413065 1347960J BE063555 BE151321 BE151319 BE151657 BE151655 BE063556 BE151322 

413072 1348163J BE063965 BE063968 BE064034 BE064028 BE063874 BE063966 BE063869 BE064043 BE064033 BE063884 

414593 1464909J BE386764 BE387560 

414913 1506721J R25621 C03959 C04010 

415011 151328J AW963085AA1 59005 AW963073 

415986 1564410J Z43619R61274H12206R12883 

416267 1583547J H45384 H49125 H41699 

417574 1687770_1 R00348 R09593 

417629 1690392J T76945 R20210 R05755 

418556 1767866_-1 T02850 

419583 186198J F00312 AA247490 F31427 AA383663 F22045 

426328 264901J AW631296 AA375484 

439590 47413.1 AF086410 W94386 W74609 

442398 541271.1 AA994520 AW393574 

452205 90415J C15819 AA024741 AA024742 

452654 925931.1 BE004783 BE004947 AI911790 

453692 977825.1 AL1 10416 AW876759 

454183 1049636.1 AW8071 16 AW807569 AW807415 AW807338 AW807288 AW807263 AW807316 AW177402 AW807413 AW807068 BE141561 
BE141569 AW807401 AW807310 BE141565 AW807318 AW8071 19 AW807299 AW807241 AW807225 AW807204 AW807345 
AW807103 BE141615 AW807431 AW807393 AW807337 AW807406 AW807259 AW807375 AW845890 AW807220 AW807399 
AW807064 AW807376 AW807024 BE141595 AW807236 AW807027 AW8071 12 AW807378 AW807202 BE141593 AW807216 
AW8071 38 AW807244 AW807221 AW807297 AW807050 AW807248 AW807404 AW807075 AW807237 AW807212 AW807308 
AW807110 AW807104 BE140912 AW807301 AW807382 AW807294 AW807026 AW807020 AW807108 AW807025 AW807433 
AW807124 AW807419 AW807031 AW807264 AW807032 AW807029 AW807052 AW807391 AW807207 AW807215 AW807019 
AW807238 AW807201 BE141590 AW807302 AW807323 AW807380 AW807109 BE141588 AW845877 AW807418 AW807407 
AW807309 BE141614 AW845861 AW807396 AW807300 AW807348 AW807311 AW807214 AW807132 AW807402 AW807350 
AW807028 AW807298 AW807291 AW807305 AW807217 AW807312 AW807261 AW807268 AW807260 AW807266 AW807198 
AW807346 AW807315 BE140940 AW807351 AW807122 AW807235 AW807076 AW807314 AW807213 AW807143 AW807131 
AW800079AW807066AW807077 AW845923 AW807421 BE141619 BE140943 AW807420 AW807381 BE141621 AW845921 
BE141629 BE141625 BE141624 BE141636 BE141630 AW807405 AW807290 AW807353 

454404 1170594.1 BE067414 BE067958 BE067419 BE067963 AW577127 AW601412 

454775 1234106.1 BE160229AW819879AW820179AW819882AW819876AW820169BE153201 AW993736 BE152911 
455282 1273020.1 BE143867 AW935060 AW886684 
459159 919998.1 AI904646 BE179494 BE179421 
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TABLE 23B 



Table 23B shows the genomic positioning for those pkeys lacking unigene ID's and 
5 accession numbers in Table 23. For each predicted exon, we have listed the genomic 

sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

0 Ret Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication 

entitled The DNA sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

Nt_position: Indicates nucleotide positions of predicted exons. 



Ml5 





Pkey 


Ref 


Strand 


Ntjiosition 


4TSS5. 


400545 


9800107 


Minus 


124618-124881 




400870 


9838306 


Minus 


34081-35027 




400973 


7960452 


Minus 


98119-98253 




401093 


8516137 


Minus 


22335-23166 


i ?; ; 


401590 


9966320 


Minus 


33547-33649 


V I 


401665 


7145001 


Plus 


121591-122537 


W25 


401810 


7342191 


Plus 


129063-129476 


401840 


7684597 


Plus 


56283-56439 




402054 


8083691 


Minus 


8288-8806 


T~" : 


402195 


7689778 


Minus 


147901-148884 




402583 


7684486 


Plus 


94883-95003 




402690 


8348058 


Plus 


13368-13998 


ri3o 


402698 


8570304 


Minus 


108641-108903 




402779 


9588555 


Minus 


38173-39210 




403017 


6693623 


Plus 


78630-79367 


P 


403051 


4827080 


Minus 


5269-5411 




403263 


7770677 


Plus 


52431-52737 


= *35 


403433 


9719611 


Minus 


72225-72437 




403593 


6862650 


Minus 


62554-62712,69449-69602 




403612 


8469060 


Minus 


94723-94859 




403921 


7711590 


Minus 


3297-3536 


40 


404368 


7630956 


Minus 


102053-102199 


404682 


9797231 


Minus 


40977-41150 




404689 


7534100 


Plus 


119461-119717 




405016 


6524300 


Plus 


51997-53308 




405062 


7657730 


Plus 


101283-101432 


45 


406118 


9143818 


Plus 


53997-54629 


406344 


9255974 


Plus 


20254-20374,20526-20659,20835-21097 




406563 


7711604 


Plus 


34401-34538 
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TABLE 24: 

Table 24 depicts Seq ID No., UnigenelD, UnigeneTitle, Pkey, Pred.Cell.Loc, and ExAccn 
for all of the sequences in Table 25. The information in Table 24 is linked by Seq ID No to 
Table 25. 



Pkey: 

ExAccn: 

UnigenelD: 

Unigene Title: 

Pred.Cell.Loc: 

Seq.ID.No.: 



Unique Eos probeset identifier number 
Exemplar Accession number, Genbank accession number 
Unigene number 



Predicted Cellular Localization 

Sequence Identification Number found in Table 25 



Pkey ExAccn UnigenelD Unigene Title 



Pred.Cell.Loc. Seq. ID. No. 



449746 AI668594 Hs.1 76588 
407276 AI951118 Hs.326736 
415539 AI733881 Hs.72472 
400297 AM 27076 Hs.334473 
450375 AA009647 Hs.8850 
102457 NM_001394Hs.2359 
429170 NM_001394Hs.2359 
424399 AI905687 Hs.2533 
422505 AL120862 Hs.124165 
449765 N92293 Hs.206832 
425692 D90041 Hs. 155956 
426215 AW963419 Hs.155223 
439840 AW449211 Hs. 105445 
410102 AW248508 Hs.279727 
429220 AW207206 Hs.136319 
416276 U41060 Hs.79136 
409079 W87707 Hs.82065 
442818 AK001741 Hs.8739 
442082 R41823 Hs.7413 
444381 BE387335 Hs.283713 
446163 M026880 Hs.25252 
416636 N32536 Hs.42645 
442117 AW664964 Hs.128899 
433043 W57554 Hs.125019 
429353 AL1 17406 Hs.200102 
452190 H26735 Hs.91668 
446733 AA863360 Hs.26040 
452747 BE153855 Hs.61460 
423242 AL039402 Hs.125783 
417433 BE270266 Hs.82128 
432201 AI538613 Hs.298241 
423961 D13666 Hs.136348 
439569 AW602166 Hs.222399 
114480 BE066778 Hs.151678 
404561 
325372 NA 

112287 AB033064 Hs.334806 
335824 NA 

424735 U31875 Hs.272499 
400289 X07820 Hs.2258 
427585 D31152 Hs.179729 
429925 NM_000786Hs.226213 
429441 AJ224172 Hs.204096 
421155 H87879 Hs.102267 
420931 AF044197 Hs.100431 
420813 X51501 Hs.99949 
452744 AI267652 Hs.30504 



ESTs, Weakly similar to CP4Y_HUMAN CYTOC 
Homo sapiens breast cancer antigen NY-BR 
BMP-R1B 

hypothetical protein DKFZp56401278 

a disintegrin and metalloproteinase doma 

dual specificity phosphatase 4 nuclear 

dual specificity phosphatase 4 nuclear 

aldehyde dehydrogenase 9 family, member cytoplasm 

ESTs 

ESTs, Moderately similar to ALU8JHUMAN A 

N-acetyltransferase 1 (arylamine N-acety 

stanniocalcin 2 

GDNF family receptor alpha 1 

Homo sapiens cDNA FLJ14035 fis, clone HE 

ESTs 

LIV-1 protein, estrogen regulated 
interleukin 6 signal transducer (gp130, 
hypothetical protein FLJ10879 
ESTs 

ESTs, Weakly similar to S64054 hypotheti 
Homo sapiens cDNA FLJ13603 fis, clone PL 
solute carrier family 16 (monocarboxylic 
ESTs 

lymphoid nuclear protein (LAF-4) mRNA 
ATP-binding cassette transporter MRP8 
Homo sapiens clone PP1498 unknown mRNA 
ESTs, Weakly similar to fatty acid omega 
lg superfamily receptor LNIR 
DEME-6 protein 

5T4 oncofetal trophoblast glycoprotein 
Transmembrane protease, serine 3 
osteoblast specific factor 2 (fasciclin 
CEGP1 protein 

UDP-N-acetyl-alpha-D-galactosamine:polyp 
NM_0141 12*:Homo sapiens trichorhinophala mitochodria 
Phase 2 & 3 Exons nuclear 
KIAA1238 protein 

ENSP00000249072*:DJ222E13.1 (N-TERMINAL 
short-chain alcohol dehydrogenase family 
matrix metalloproteinase 10 (stromelysin 
collagen, type X, alpha 1 (Schmid metaph 
cytochrome P450, 51 (lanosterol 14-alpha ER 
lipophilin B (uteroglobin family member) 
lysyl oxidase extracellular 
small inducible cytokine B subfamily (Cy 
prolactin-induced protein nuclear 
Homo sapiens mRNA; cDNA DKFZp434E082 (fr 
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Seq ID 1 & 2 
SeqlD3&4 
Seq ID5&6 
Seq ID 7 & 8 
Seq ID 9 & 10 
Seq ID 11 & 12 
Seq ID 11 & 12 
Seq ID 13 & 14 
Seq ID 15 & 16 
Seq ID 17 & 18 
Seq ID 19&20 
Seq ID 21 & 22 
Seq ID 23 & 24 
Seq ID 25 & 26 
Seq ID 27 & 28 
Seq ID 29 & 30 
Seq ID 31 &32 
Seq ID 33&34 
Seq ID 35 & 36 
Seq ID 37 & 38 
Seq ID 39 & 40 
Seq ID 41 &42 
Seq ID 43 & 44 
Seq ID 45 & 46 
Seq ID 47 & 48 
Seq ID 49 & 50 
Seq ID 51 & 52 
Seq ID 53 & 54 
Seq ID 55 & 56 
Seq ID 57 & 58 
Seq ID 59 & 60 
Seq ID 61 & 62 
SeqlD63&64 
Seq ID 65 & 66 
Seq ID 67 & 68 
Seq ID 69 & 70 
Seq ID 71 &72 
Seq ID 73 & 74 
Seq ID 75 & 76 
Seq ID 77 & 78 
Seq ID 79 & 80 
Seq ID 81 &82 
Seq ID 83 & 84 
Seq ID 85&86 
Seq ID 87 & 88 
Seq ID 89 & 90 
Seq ID 91 &92 
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420757 X78592 Hs.99915 
424905 NM_002497Hs.153704 
429859 NM_007050Hs.225952 
446921 AB012113 Hs.16530 
445537 AJ245671 Hs.12844 
428227 M321649 Hs.2248 
424001 W67883 Hs.137476 
421727 Y13153 Hs.107318 
452838 U65011 Hs.30743 
419667 AU077005 Hs.92208 
414812 X72755 Hs.77367 
426320 W47595 Hs.169300 
422867 L32137 Hs.1584 
411558 AA102670 Hs.70725 
417866 AW067903 Hs.82772 
428398 AI249368 Hs.98558 
431958 X63629 Hs.2877 
428722 U76456 Hs.1 90787 
412970 AB026436 Hs.1 77534 
421379 Y15221 Hs.103982 
415752 BE314524 Hs.78776 
444051 N48373 Hs.10247 
451110 AI955040 Hs.265398 



androgen receptor (dihydrotestosterone r 
NIMA (never in mitosis gene a)-related k 
protein tyrosine phosphatase, receptor t 
small inducible cytokine subfamily A (Cy 
EGF-like-domain, multiple 6 
small inducible cytokine subfamily B (Cy 
paternally expressed 10 
kynurenine 3-monooxygenase (kynurenine 3 
preferentially expressed antigen in mela 
a disintegrin and metalloproteinase doma 
monokine induced by gamma interferon 
transforming growth factor, beta 2 
cartilage oligomeric matrix protein (pse 
gamma-aminobutyric acid (GABA) A recepto 
collagen, type XI, alpha 1 
ESTs 

cadherin 3, type 1 , P-cadherin (placenta 
tissue inhibitor of metalloproteinase 4 
dual specificity phosphatase 10 
small inducible cytokine subfamily B (Cy 
putative transmembrane protein 
activated leucocyte cell adhesion molecu 
ESTs, Weakly similar to transformation-r 



cytoplasm 
nuclear 



Seq 
Seq 
Seq 
Seq 
Seq 
Seq 
Seq 
Seq 
Seq 
Seq 
Seq 
Seq 
Seq 
Seq 
Seq 
Seq 

plasma membraneSeq 
Seq 
Seq 
Seq 
Seq 
Seq 
Seq 



extracellular 



extracellular 



nuclear 

extracellular 
extracellular 
extracellular 



extracellular 



ID 93 & 94 
ID 95 & 96 
ID 97 & 98 
ID 99 & 100 
ID 101 & 102 
ID 103 & 104 
ID 105 & 106 
ID 107 & 108 
ID 109 & 110 
ID 111 & 112 
ID 113 & 114 
ID 115 & 116 
ID 117 & 118 
ID 119 & 120 
ID 121 & 122 
ID 123 & 124 
ID 125 & 126 
ID 127 & 128 
ID 129 & 130 
ID 131 & 132 
ID 133 & 134 
ID 135 & 136 
ID 137 & 138 
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TABLE 24A 

Table 24A shows the accession numbers for those pkeys lacking unigenelD's for Table 24. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accession 

335824 CH22_3197FG_619_11_LINK_E 
325372 c12_hs 
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TABLE 24B 

Table 24B shows the genomic positioning for those pkeys lacking unigene DD's and 
accession numbers in Table 24. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication 

entitled "The DNA sequence of human chromosome 22." Dunham I. et al., Nature (1 999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand NLposition 

404561 9795980 Minus 69039-70100 
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Table 25 

The 69 gene sequences identified to be overexpressed in breast cancer may be used to identify coding regions from the public DNA databases (nr and htgs in 
Genbank). The sequences may be used to either identify genes that encode known proteins, or they may be used to predict the coding regions from genomic 
DNA using exon prediction algorithms, such as FGENESH (Salamov and Solovyev, 2000, Genome Res. 10:516-522) 



Seq ID NO: 1 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted ORF 

Coding sequence: 1-1518 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

ATGG AGCCCT CCTGGCTTCA GGAACTCATG GCTCACCCCT TCTTGCTGCT GATCCTCCTC 60 
TGCATGTCTC TGCTGCTGTT TCAGGTAATC AGGTTGTACC AGAGGAGGAG ATGGATGATC 120 
1 5 AGAGCCCTGC ACCTGTTTCC TGCACCCCCT GCCCACTGGT TCTATGGCCA CAAGGAGTTT 180 
TACCCAGTAA AGGAGTTTGA GGTGTATCAT AAGCTGATGG AAAAATACCC ATGTGCTGTT 240 
CCCTTGTGGG TTGGACCCTT TACGATGTTC TTCAGTGTCC ATGACCCAGA CTATGCCAAG 300 
ATTCTCCTGA AAAGACAAGA TCCCAAAAGT GCTGTTAGCC ACAAAATCCT TGAATCCTGG 360 
r * GTTGGTCGAG GACTTGTGAC CCTGGATGGT TCTAAATGGA AAAAGCACCG CCAGATTGTG 420 

[=20 AAACCTGGCT TCAACATCAG CATTCTGAAA ATATTCATCA CCATGATGTC TGAGAGTGTT 480 
r!.; CGGATGATGC TGAACAAATG GGAGGAACGC ATTGCCCAAA ACTCACGTCT GGAGCTCTTT 540 

%:J CAACATGTCT CCCTGATGAC CCTGGACAGC ATCATGAAGT GTGCCTTCAG CCACCAGGGC 600 

1 ?! AGCATCCAGT TGGACAGTAC CCTGGACTCA TACCTGAAAG CAGTGTTCAA CCTTAGCAAA 660 

""J. t? ATCTCCAACC AGCGCATGAA CAATTTTCTA CATCACAACG ACCTGGTTTT CAAATTCAGC 720 
. 25 TCTCAAGGCC AAATCTTTTC TAAATTTAAC CAAGAACTTC ATCAGTTCAC AGAGAAAGTA 780 
=7: ATCCAGGACC GGAAGGAGTC TCTTAAGGAT AAGCTAAAAC AAGATACTAC TCAGAAAAGG 840 

% CGCTGGGATT TTCTGGACAT ACTTTTGAGT GCCAAAAGCG AAAACACCAA AGATTTCTCT 900 

S! GAAGCAGATC TCCAGGCTGA AGTGAAAACG TTCATGTTTG CAGGACATGA CACCACATCC 960 

» AGTGCTATCT CCTGGATCCT TTACTGCTTG GCAAAGTACC CTGAGCATCA GCAGAGATGC 1020 

4=3 0 CGAGATGAAA TCAGGGAACT CCTAGGGGAT GGGTCTTCTA TTACCTGGGA ACACCTGAGC 1080 
CAGATGCCTT ACACCACGAT GTGCATCAAG GAATGCCTCC GCCTCTACGC ACCGGTAGTA 1 140 
.1- AACATATCCC GGTTACTCGA CAAACCCATC ACCTTTCCAG ATGGACGCTC CTTACCTGCA 1200 

D GGAATAACTG TGTTTATCAA TATTTGGGCT CTTCACCACA ACCCCTATTT CTGGGAAGAC 1260 

i _> CCTCAGGTCT TTAACCCCTT GAGATTCTCC AGGGAAAATT CTGAAAAAAT ACATCCCTAT 1320 

l3 5 GCCTTCATAC CATTCTCAGC TGGATTAAGG AACTGCATTG GGCAGCATTT TGCCATAATT 1380 
1 Li GAGTGTAAAG TGGCAGTGGC ATTAACTCTG CTCCGCTTCA AGCTGGCTCC AGACCACTCA 1440 

=■ AGGCCTCCCC AGCCTGTTCG TCAAGTTGTC CTCAAGTCCA AGAATGGAAT CCATGTGTTT 1 500 

=T : : GCAAAAAAAG TTTGC TAAT T TTAAGTCCTT TCGTATAAGA ATTAATGAG A CAATTTTCCT 1560 

iCl ACCAAAGGAA GAACAAAAGG ATAAATATAA TACAAAATAT ATGTATATGG TTGTTTGACA 1620 

=40 AATTATATAA CTTAGGATAC TTCTGACTGG TTTTGACATC CATTAACAGT AATTTTAATT 1680 
! y TCTTTGCTGT ATCTGGTGAA ACCCACAAAA ACACCTGAAA AAACTCAAGC TGACTTCCAC 1740 

TGCGAAGGGA AATTATTGGT TTGTGTAACT AGTGGTAGAG TGGCTTTCAA GCATAGTTTG 1 800 
ATCAAAACTC CACTCAGTAT CTGCATTACT TTTATCTCTG CAAATATCTG CATGATAGCT 1860 
TTATTCTCAG TTATCTTTCC CCAATAATAA AAAA 

45 



Seq ID NO: 2 Protein seauence: 

Protein Accession #: FGENESH predicted 

50 1 11 21 31 41 51 

I I i I I I 

MEPSWLQELM AHPFLLLILL CMSLLLFQV1 RLYQRRRWM1 RALHLFPAPP AHWFYGHKEF 60 
YPVKEFEVYH KLMEKYPCAV PLWVGPFTMF FSVHDPDYAK ILLKRQDPKS AVSHKILESW 120 
VGRGLVTLDG SKWKKHRQIV KPGFNISILK IFITMMSESV RMMLNKWEER IAQNSRLELF 1 80 

5 5 QHVSLMTLDS IMKCAFSHQG SIQLDSTLDS YLKAVFNLSK ISNQRMNNFL HHNDLVFKFS 240 
SQGQIFSKFN QELHQFTEKV IQDRKESLKD KLKQDTTQKR RWDFLDILLS AKSENTKDFS 300 
EADLQAEVKT FMFAGHDTTS SAISWILYCL AKYPEHQQRC RDEIRELLGD GSSITWEHLS 360 
QMPYTTMCDC ECLRLYAPVV NISRLLDKPI TFPDGRSLPA GITVFINIWA LHHNPYFWED 420 
PQVFNPLRFS RENSEKIHPY AFIPFSAGLR NCIGQHFAU ECKVAVALTL LRFKLAPDHS 480 

60 RPPQPVRQW LKSKNGIHVF AKKVC 



65 



Seq ID NO: 3 DNA sequence 

Nucleic Acid Accession #: NM_052997 

Coding sequence: 100-4125 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

! i I ! I I 

CTAGTCTATA CCAGCAACGA CTCCTACATC GTCCACTCTG GGGATCTTAG AAAGATCCAT 60 
70 AAAGCTGCCT CCCGGGGACA AGTCCGGAAG CTGGAGAAG A TG ACAAAGAG GAAGAAGACC 120 
ATCA ACCTTA ATATAC A AG A CGCCC AG AAG AGGACTGCTC TAC ACTGGGC CTGTGTCAAT 1 80 
GGCCATGAGG AAGTAGTAAC ATTTCTGGTA GACAGAAAGT GCCAGCTTGA CGTCCTTGAT 240 
GGCGAACACA GGACACCTCT GATGAAGGCT CTACAATGCC ATCAGGAGGC TTGTGCAAAT 300 
ATTCTGATAG ATTCTGGTGC CGATATAAAT CTCGTAGATG TGTATGGCAA CATGGCTCTC 360 
75 CATTATGCTG TTTATAGTGA GATTTTGTCA GTGGTGGCAA AACTGCTGTC CCATGGTGCA 420 
GTCATCGAAG TGCACAACAA GGCTAGCCTC ACACCACTTT TACTATCCAT AACGAAAAGA 480 
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AGTGAGCAAA TTGTGGAATT TTTGCTGATA AAAAATGCAA ATGCGAATGC AGTTAATAAG 540 
TATAAATGCA CAGCCCTCAT GCTTGCTGTA TGTCATGGAT CATCAGAGAT AGTTGGCATG 600 
CTTCTTCAGC AAAATGTTGA CGTCTTTGCT GCAGATATAT GTGGAGTAAC TGCAGAACAT 660 
TATGCTGTTA CTTGTGGATT TCATCACATT CATGAACAAA TTATGGAATA TATACGAAAA 720 
TTATCTAAAA ATCATCAAAA TACCAATCCA GAAGGAACAT CTGCAGGAAC ACCTGATGAG 780 
GCTGCACCCT TGGCGGAAAG AACACCTGAC AC AGCTG AAA GCTTGGTGGA AAAAACACCT 840 
GATGAGGCTG CACCCTTGGT GGAAAGAACA CCTGACACGG CTGAAAGCTT GGTGGAAAAA 900 
ACACCTGATG AGGCTGCATC CTTGGTGGAG GGAACATCTG ACAAAATTCA ATGTTTGGAG 960 
AAAGCGACAT CTGGAAAGTT CGAACAGTCA GCAGAAGAAA CACCTAGGGA AATTACGAGT 1020 
CCTGCAAAAG AAACATCTGA GAAATTTACG TGGCCAGCAA AAGGAAGACC TAGGAAGATC 1080 
GCATGGGAGA AAAAAGAAGA CACACCTAGG GAAATTATGA GTCCCGCAAA AGAAACATCT 1 140 
GAGAAATTTA CGTGGGCAGC AAAAGGAAGA CCTAGGAAGA TCGCATGGGA GAAAAAAGAA 1200 
ACACCTGTAA AGACTGGATG CGTGGCAAGA GTAACATCTA ATAAAACTAA AGTTTTGGAA 1260 
AAAGGAAGAT CTAAGATGAT TGCATGTCCT ACAAAAGAAT CATCTACAAA AGCAAGTGCC 1320 
AATGATCAGA GGTTCCCATC AGAATCCAAA CAAGAGGAAG ATGAAGAATA TTCTTGTGAT 1380 
TCTCGGAGTC TCTTTGAGAG TTCTGCAAAG ATTCAAGTGT GTATACCTGA GTCTATATAT 1440 
CAAAAAGTAA TGGAGATAAA TAGAGAAGTA GAAGAGCCTC CTAAGAAGCC ATCTGCCTTC 1500 
AAGCCTGCCA TTGAAATGCA AAACTCTGTT CCAAATAAAG CCTTTGAATT GAAGAATGAA 1560 
CAAACATTGA GAGCAGATCC GATGTTCCCA CCAGAATCCA AACAAAAGGA CTATGAAGAA 1620 
AATTCTTGGG ATTCTGAGAG TCTCTGTGAG ACTGTTTCAC AGAAGGATGT GTGTTTACCC 1680 
AAGGCTACAC ATCAAAAAGA AATAGATAAA ATAAATGGAA AATTAGAAGA GTCTCCTAAT 1740 
AAAGATGGTC TTCTGAAGGC TACCTGCGGA ATGAAAGTTT CTATTCCAAC TAAAGCCTTA 1800 
GAATTGAAGG ACATGCAAAC TTTCAAAGCG GAGCCTCCGG GGAAGCCATC TGCCTTCGAG 1860 
CCTGCCACTG AAATGCAAAA GTCTGTCCCA AATAAAGCCT TGGAATTGAA AAATGAACAA 1920 
ACATGGAGAG CAGATGAGAT ACTCCCATCA GAATCCAAAC AAAAGGACTA TGAAGAAAAT 1980 
TCTTGGGATA CTGAGAGTCT CTGTGAGACT GTTTCACAGA AGGATGTGTG TTTACCCAAG 2040 
GCTGCGCATC AAAAAGAAAT AGATAAAATA AATGGAAAAT TAGAAGGGTC TCCTGTTAAA 2100 
GATGGTCTTC TGAAGGCTAA CTGCGGAATG AAAGTTTCTA TTCCAACTAA AGCCTTAGAA 2160 
TTGATGGACA TGCAAACTTT CAAAGCAGAG CCTCCCGAGA AGCCATCTGC CTTCGAGCCT 2220 
GCCATTGAAA TGCAAAAGTC TGTTCCAAAT AAAGCCTTGG AATTGAAGAA TGAACAAACA 2280 
TTGAGAGCAG ATGAGATACT CCCATCAGAA TCCAAACAAA AGGACTATGA AGAAAGTTCT 2340 
TGGGATTCTG AGAGTCTCTG TGAGACTGTT TCACAGAAGG ATGTGTGTTT ACCCAAGGCT 2400 
ACACATCAAA AAGAAATAGA TAAAATAAAT GGAAAATTAG AAGAGTCTCC TGATAATGAT 2460 
GGTTTTCTGA AGGCTCCCTG CAGAATGAAA GTTTCTATTC CAACTAAAGC CTT AGAATTG 2520 
ATGGACATGC AAACTTTCAA AGCAGAGCCT CCCGAGAAGC CATCTGCCTT CGAGCCTGCC 2580 
ATTGAAATGC AAAAGTCTGT TCCAAATAAA GCCTTGGAAT TGAAGAATGA ACAAACATTG 2640 
AGAGCAGATC AGATGTTCCC TTCAGAATCA AAACAAAAGA AGGTTGAAGA AAATTCTTGG 2700 
GATTCTGAGA GTCTCCGTGA GACTGTTTCA CAGAAGGATG TGTGTGTACC CAAGGCTACA 2760 
CATCAAAAAG AAATGGATAA AATAAGTGGA AAATTAGAAG ATTCAACTAG CCTATCAAAA 2820 
ATCTTGGATA CAGTTCATTC TTGTGAAAGA GCAAGGGAAC TTCAAAAAGA TCACTGTGAA 2880 
CAACGTACAG GAAAAATGGA ACAAATGAAA AAGAAGTTTT GTGTACTGAA AAAGAAACTG 2940 
TCAGAAGCAA AAGAAATAAA ATCACAGTTA GAGAACCAAA AAGTTAAATG GGAACAAGAG 3000 
CTCTGCAGTG TGAGATTGAC TTTAAACCAA GAAGAAGAGA AGAGAAGAAA TGCCGATATA 3060 
TTAAATGAAA AAATTAGGGA AGAATTAGGA AGAATCGAAG AGCAGCATAG GAAAGAGTTA 3120 
GAAGTGAAAC AACAACTTGA ACAGGCTCTC AGAATACAAG ATATAGAATT GAAGAGTGTA 3180 
GAAAGTAATT TGAATCAGGT TTCTCACACT CATGAAAATG AAAATTATCT CTTACATGAA 3240 
AATTGCATGT TGAAAAAGGA AATTGCCATG CTAAAACTGG AAATAGCCAC ACTGAAACAC 3300 
CAATACCAGG AAAAGGAAAA TAAATACTTT GAGGACATTA AGATTTTAAA AGAAAAGAAT 3360 
GCTGAACTTC AGATGACCCT AAAACTGAAA GAGGAATCAT TAACTAAAAG GGCATCTCAA 3420 
TATAGTGGGC AGCTTAAAGT TCTGATAGCT GAGAACACAA TGCTCACTTC TAAATTGAAG 3480 
GAAAAACAAG ACAAAGAAAT ACTAGAGGCA GAAATTGAAT CACACCATCC TAGACTGGCT 3540 
TCTGCTGTAC AAGACCATGA TCAAATTGTG ACATCAAGAA AAAGTCAAGA ACCTGCTTTC 3600 
CACATTGCAG GAGATGCTTG TTTGCAAAGA AAAATGAATG TTGATGTGAG TAGTACGATA 3660 
TATAACAATG AGGTGCTCCA TCAACCACTT TCTGAAGCTC AAAGGAAATC CAAAAGCCTA 3720 
AAAATTAATC TCAATTATGC AGGAGATGCT CTAAGAGAAA ATACATTGGT TTCAGAACAT 3780 
GCACAAAGAG ACCAACGTGA AACACAGTGT CAAATGAAGG AAGCTGAACA CATGTATCAA 3840 
AACGAACAAG ATAATGTGAA CAAACACACT GAACAGCAGG AGTCTCTAGA TCAGAAATTA 3900 
TTTCAACTAC AAAGCAAAAA TATGTGGCTT CAACAGCAAT TAGTTCATGC ACATAAGAAA 3960 
GCTGACAACA AAAGC AAG AT AACAATTGAT ATTCATTTTC TTGAGAGGAA AATGCAACAT 4020 
CATCTCCTAA AAGAGAAAAA TGAGGAGATA TTTAATTACA ATAACCATTT AAAAAACCGT 4080 
ATATATCAAT ATGAAAAAGA GAAAGCAGAA ACAGAAAACT CA TGA GAGAC AAGCAGTAAG 4140 
AAACTTCTTT TGGAGAAACA ACAGACCAGA TCTTTACTCA CAACTCATGC TAGGAGGCCA 4200 
GTCCTAGCAT CACCTTATGT TGAAAATCTT ACCAATAGTC TGTGTCAACA GAATACTTAT 4260 
TTTAGAAGAA AAATTCATGA TTTCTTCCTG AAGCCTACAG ACATAAAATA ACAGTGTGAA 4320 
GAATTACTTG TTCACGAATT GCATAAAGCT GCACAGGATT CCCATCTACC CTGATGATGC 4380 
AGCAGACATC ATTCAATCCA ACCAGAATCT CGCTCTGCAC TCCAGCCTAG GTGACAGAGT 4440 
GAGACTCCAC CTCGGAAA 



Seq ID NO: 4 Protein sequence: 
Protein Accession #: NP_443723 . 1 

1 11 21 31 41 51 

MTKRKKTINL NIQDAQKRTALHWACVNGHE EVVTFLVDRK CQLDVLDGEH RTPLMKALQC 60 
HQEACANILI DSGADINLVD VYGNMALHYA VYSEILSV VA KLLSHGAVIE VHNKASLTPL 1 20 
LLSITKRSEQ IVEFLLIKNA NANAVNKYKC TALMLAVCHG SSEIVGMLLQ QNVDVFAADI 180 
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CGVTAEHYAV TCGFHHIHEQ IMEYIRKLSK NHQNTNPEGT SAGTPDEAAP LAERTPDTAE 240 
SLVEKTPDEA APLVERTPDT AESLVEKTPD EAASLVEGTS DKIQCLEKAT SGKFEQSAEE 300 
TPREITSPAK ETSEKFTWPA KGRPRKIAWE KKEDTPREIM SPAKETSEKF TWAAKGRPRK 360 
IAWEKKETPV KTGCVARVTS NKTKVLEKGR SKMIACPTKE SSTKASANDQ RFPSESKQEE 420 
DEEYSCDSRS LFESSAKIQV CIPESIYQKV MEINREVEEP PKKPSAFKPA IEMQNSVPNK 480 
AFELKNEQTL RADPMFPPES KQKDYEENSW DSESLCETVS QKDVCLPKAT HQKEIDKJNG 540 
KLEESPNKDG LLKATCGMKV SIPTKALELK DMQTFKAEPP GKPSAFEPAT EMQKSVPNKA 600 
LELKNEQTWR ADEILPSESK QKDYEENSWD TESLCETVSQ KDVCLPKAAH QKEIDKINGK 660 
LEGSPVKDGL LKANCGMKVS IPTKALELMD MQTFKAEPPE KPSAFEPAIE MQKSVPNKAL 720 
ELKNEQTLRA DEILPSESKQ KDYEESSWDS ESLCETVSQK DVCLPKATHQ K.EIDKINGK.L 780 
EESPDNDGFL KAPCRMKVSI PTKALELMDM QTFKAEPPEK PSAFEPAIEM QKSVPNKALE 840 
LKNEQTLRAD QMFPSESKQK KVEENSWDSE SLRETVSQKD VCVPKATHQK EMDKISGKLE 900 
DSTSLSKILD TVHSCERARE LQKDHCEQRT GKMEQMKKKF CVLKKKLSEA KEKSQLENQ 960 
KVKWEQELCS VRLTLNQEEE KRRNADILNE KIREELGRIE EQHRKELEVK QQLEQALRIQ 1020 
DIELKSVESN LNQVSHTHEN ENYLLHENCM LKKEIAMLKL EIATLKHQYQ EKENKYFEDI 1080 
KILKEKNAEL QMTLKLKEES LTKRASQYSG QLKVLIAENT MLTSKLKEKQ DKEILEAEIE 1 140 
SHHPRLASAV QDHDQIVTSR KSQEPAFHIA GDACLQRKMN VDVSSTIYNN EVLHQPLSEA 1200 
ORKSKSLKIN LNYAGDALRE NTLVSEHAQR DQRETQCQMK EAEHMYQNEQ DNVNKHTEQQ 1260 
ESLDQKLFQL QSKNMWLQQQ LVHAHKKADN KSKITIDIHF LERKMQHHLL KEKNEEIFNY 1320 
NNHLKNRIYQ YEKEKAETEN S 

Seq ID NO: 5 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 273-1785 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

CGCGGGGCGC GGAGTCGGCG GGGCCTCGCG GGACGCGGGC AGTGCGGAGA CCGCGGCGCT 60 
GAGGACGCGG GAGCCGGGAG CGCACGCGCG GGGTGGAGTT CAGCCTACTC TTTCTTAGAT 120 
GTG AAAGGAA AGGAAGATCA TTTCATGCCT TGTTGATAAA GGTTCAGACT TCTGCTGATT 1 80 
CATAACCATT TGGCTCTGAG CTATGACAAG AGAGGAAACA AAAAGTTAAA CTTACAAGCC 240 
TGCCATAAGT GAGAAGCAAA CTTCCTTGAT AACATGCTTT TGCGAAGTGC AGGAAAATTA 300 
AATGTGGGCA CCAAGAAAGA GGATGGTGAG AGTACAGCCC CCACCCCCCG TCCAAAGGTC 360 
TTGCGTTGTA AATGCCACCA CCATTGTCCA GAAGACTCAG TCAACAATAT TTGCAGCACA 420 
GACGGATATT GTTTCACGAT GATAGAAGAG GATGACTCTG GGTTGCCTGT GGTCACTTCT 480 
GGTTGCCTAG GACTAGAAGG CTCAGATTTT CAGTGTCGGG ACACTCCCAT TCCTCATCAA 540 
AGAAGATCAA TTGAATGCTG CACAGAAAGG AACGAATGTA ATAAAGACCT ACACCCTACA 600 
CTGCCTCCAT TGAAAAACAG AGATTTTGTT GATGGACCTA TACACCACAG GGCTTTACTT 660 
ATATCTGTGA CTGTCTGTAG TTTGCTCTTG GTCCTTATCA TATTATTTTG TTACTTCCGG 720 
TATAAAAGAC AAGAAACCAG ACCTCGATAC AGCATTGGGT TAGAACAGGA TGAAACTTAC 780 
ATTCCTCCTG GAGAATCCCT GAGAGACTTA ATTGAGCAGT CTCAGAGCTC AGGAAGTGGA 840 
TCAGGCCTCC CTCTGCTGGT CCAAAGGACT ATAGCTAAGC AGATTCAGAT GGTGAAACAG 900 
ATTGGAAAAG GTCGCTATGG GGAAGTTTGG ATGGGAAAGT GGCGTGGCGA AAAGGTAGCT 960 
GTGAAAGTGT TCTTCACCAC AGAGGAAGCC AGCTGGTTCA GAGAGACAGA AATATATCAG 1020 
ACAGTGTTGA TGAGGCATGA AAACATTTTG GGTTTCATTG CTGCAGATAT CAAAGGGACA 1080 
GGGTCCTGGA CCCAGTTGTA CCTAATCACA GACTATCATG AAAATGGTTC CCTTTATGAT 1140 
TATCTGAAGT CCACCACCCT AGACGCTAAA TCAATGCTGA AGTTAGCCTA CTCTTCTGTC 1200 
AGTGGCTTAT GTCATTTACA CACAGAAATC TTTAGTACTC AAGGCAAACC AGCAATTGCC 1260 
CATCGAGATC TGAAAAGTAA AAACATTCTG GTGAAGAAAA ATGGAACTTG CTGTATTGCT 1320 
GACCTGGGCC TGGCTGTTAA ATTTATTAGT GATACAAATG AAGTTGACAT ACCACCTAAC 1380 
ACTCGAGTTG GCACCAAACG CTATATGCCT CCAGAAGTGT TGGACGAGAG CTTGAACAGA 1440 
AATCACTTCC AGTCTTACAT CATGGCTGAC ATGTATAGTT TTGGCCTCAT CCTTTGGGAG 1500 
GTTGCTAGGA GATGTGTATC AGGAGGTATA GTGGAAGAAT ACCAGCTTCC TTATCATGAC 1560 
CTAGTGCCCA GTGACCCCTC TTATGAGGAC ATGAGGGAGA TTGTGTGCAT CAAGAAGTTA 1620 
CGCCCCTCAT TCCCAAACCG GTGGAGCAGT GATGAGTGTC TAAGGCAGAT GGGAAAACTC 1680 
ATGACAGAAT GCTGGGCTCA CAATCCTGCA TCAAGGCTGA CAGCCCTGCG GGTTAAGAAA 1740 
ACACTTGCCA AAATGTCAGA GTCCCAGGAC ATTAAACTCT GATAGGAGAG GAAAAGTAAG 1800 
CATCTCTGCA GAAAGCCAAC AGGTACTCTT CTGTTTGTGG GCAGAGCAAA AGACATCAAA 1860 
TAAGCATCCA CAGTACAAGC CTTGAACATC GTCCTGCTTC CCAGTGGGTT CAGACCTCAC 1920 
CTTTCAGGGA GCGACCTGGG CAAAGACAGA GAAGCTCCCA GAAGGAGAGA TTGATCCGTG 1980 
TCTGTTTGTA GGCGGAGAAA CCGTTGGGTA ACTTGTTCAA GATATGATGC AT 



Seq ID NO: 6 Protein sequence: 
Protein Accession #: none found 



1 11 21 31 41 51 

MLLRSAGKLN VGTKKEDGES TAPTPRPKVL RCKCHHHCPE DSVNN1CSTD GYCFTMIEED 60 
DSGLPVVTSG CLGLEGSDFQ CRDTPWHQR RSIECCTERN ECNKDLHPTL PPLKNRDFVD 120 
GPIHHRALLI SVTVCSLLLV LIILFCYFRY KRQETRPRYS IGLEQDETYI PPGESLRDLI 1 80 
EQSQSSGSGS GLPLLVQRTI AKQIQMVKQI GKGRYGEVWM GKWRGEKVAV KVFFTTEEAS 240 
WFRETEIYQT VLMRHENILG FIAADIKGTG SWTQLYLITD YHENGSLYDY LKSTTLDAKS 300 
MLKLAYSSVS GLCHLHTEIF STQGKPAIAH RDLKSKNILV KKNGTCCIAD LGLAVKFISD 360 
TNEVDIPPNT RVGTKRYMPP EVLDESLNRN HFQSYIMADM YSFGLILWEV ARRCVSGGIV 420 
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EEYQLPYHDL VPSDPSYEDM REIVCKKLR PSFPNRWSSD ECLRQMGKLM TECWAHNPAS 480 
RLTALRVKKT LAKMSESQDI KL 

Seq ID NO: 7 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 482-3007(underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

AACTGAGCTA ACAAGAAATA CTAGAAAAGG AGGAAGGAGA ACATTGCTGC AGCTTGGATC 60 
TACAACCTAA GAAAGCAAGA GTGATCAATC TCAGCTCTGT TAAACATCTT GTTTACTTAC 120 
TGCATTCAGC AGCTTGCAAA TGGTTAACTA TATGCAAAAA AGTCAGCATA GCTGTGAAGT 1 80 
ATGCCGTGAA TTTTAATTGA GGGAAAAAGG GACAATTGCT TCAGGATGCT CTAGTATGCA 240 
1 5 CTCTGCTTGA AATATTTTCA ATGAAATGCT CAGTATTCTA TCTTTGACCA GAGGTTTTAA 300 

CTTTATGAAG CTATGGGACT TGACAAAAAG TGATATTTGA GAAGAAAGTA CGCAGTGGTT 360 
GGTGTTTTCT TTTTTTT A AT AAAGGAATTG AATTACTTTG AACACCTCTT CCAGCTGTGC 420 
ATTACAGATA ACGTCAGGAA GAGTCTCTGC TTTACAGAAT CGGATTTCAT CACATGACAA 480 
CATGAAGCTG TGGATTCATC TCTTTTATTC ATCTCTCCTT GCCTGTATAT CTTTACACTC 540 
1=^0 CCAAACTCCA GTGCTCTCAT CCAGAGGCTC TTGTGATTCT CTTTGCAATT GTGAGGAAAA 600 
i;=. AGATGGCACA ATGCTAATAA ATTGTGAAGC AAAAGGTATC AAGATGGTAT CTGAAATAAG 660 

W TGTGCCACCA TCACGACCTT TCCAACTAAG CTTATTAAAT AACGGCTTGA CGATGCTTCA 720 

f-i CACAAATGAC TTTTCTGGGC TTACCAATGC TATTTCAATA CACCTTGGAT TTAACAATAT 780 

TGCAGATATT GAGATAGGTG CATTTAATGG CCTTGGCCTC CTGAAACAAC TTCATATCAA 840 
S-15 TCACAATTCT TTAGAAATTC TTAAAGAGGA TACTTTCCAT GGACTGGAAA ACCTGGAATT 900 
pw. CCTGCAAGCA GATAACAATT TTATCACAGT GATTGAACCA AGTGCCTTTA GCAAGCTCAA 960 

^ CAGACTCAAA GTGTTAATTT TAAATGACAA TGCTATTGAG AGTCTTCCTC CAAACATCTT 1020 

fU CCGATTTGTT CCTTTAACCC ATCTAGATCT TCGTGGAAAT CAATTACAAA CATTGCCTTA 1080 

I ", TGTTGGTTTT CTCGAACACA TTGGCCGAAT ATTGGATCTT CAGTTGGAGG ACAACAAATG 1 140 

' S 0 GGCCTGCAAT TGTGACTTAT TGCAGTTAAA AACTTGGTTG GAGAACATGC CTCCACAGTC 1200 
f~j TAT AATTGGT GATGTTGTCT GCAACAGCCC TCCATTTTTT AAAGG AAGTA TACTCAGTAG 1 260 

ACTAAAGAAG GAATCTATTT GCCCTACTCC ACCAGTGTAT GAAGAACATG AGGATCCTTC 1320 
5 AGGATCATT A CATCTGGCAG CAACATCTTC AATAAATGAT AGTCGCATGT CAACTAAGAC 1 380 

t~> CACGTCCATT CTAAAACTAC CCACCAAAGC ACCAGGTTTG ATACCTTATA TTACAAAGCC 1440 

"35 ATCCACTCAA CTTCCAGGAC CTTACTGCCC TATTCCTTGT AACTGCAAAG TCCTATCCCC 1500 
F* ATCAGGACTT CTAATACATT GTCAGGAGCG CAACATTGAA AGCTTATCAG ATCTGAGACC 1560 

ff i TCCTCCGCAA AATCCTAGAA AGCTCATTCT AGCGGGAAAT ATTATTCACA GTTTAATGAA 1620 

*i GTCTGATCTA GTGGAATATT TCACTTTGGA AATGCTTCAC TTGGGAAACA ATCGTATTGA 1680 

£ AGTTCTTGAA GAAGGATCGT TTATGAACCT AACGAGATTA CAAAAACTCT ATCTAAATGG 1740 

£40 TAACCACCTG ACCAAATTAA GTAAAGGCAT GTTCCTTGGT CTCCATAATC TTGAATACTT 1 800 
W ATATCTTGAA TACAATGCCA TTAAGGAAAT ACTGCCAGGA ACCTTTAATC CAATGCCTAA 1 860 

SI \ ACTTAAAGTC CTGTATTTAA ATAACAACCT CCTCCAAGTT TTACCACCAC ATATTTTTTC 1920 

" AGGGGTTCCT CTAACTAAGG TAAATCTTAA AACAAACCAG TTTACCCATC TACCTGTAAG 1980 

TAATATTTTG GATGATCTTG ATTTACTAAC CCAGATTGAC CTTGAGGATA ACCCCTGGGA 2040 
45 CTGCTCCTGT GACCTGGTTG GACTGCAGCA ATGGATACAA AAGTTAAGCA AGAACACAGT 2100 
GACAGATGAC ATCCTCTGCA CTTCCCCCGG GCATCTCGAC AAAAAGGAAT TGAAAGCCCT 2160 
AAATAGTGAA ATTCTCTGTC CAGGTTTAGT AAATAACCCA TCCATGCCAA CACAGACTAG 2220 
TTACCTTATG GTCACCACTC CTGCAACAAC AACAAATACG GCTGATACTA TTTTACGATC 2280 
TCTTACGGAC GCTGTGCCAC TGTCTGTTCT AATATTGGGA CTTCTGATTA TGTTCATCAC 2340 
5 0 TATTGTTTTC TGTGCTGCAG GGATAGTGGT TCTTGTTCTT CACCGCAGGA GAAGATACAA 2400 

AAAGAAACAA GTAGATGAGC AAATGAGAGA CAACAGTCCT GTGCATCTTC AGTACAGCAT 2460 
GTATGGCCAT AAAACCACTC ATCACACTAC TGAAAGACCC TCTGCCTCAC TCTATGAACA 2520 
GCACATGGTG AGCCCCATGG TTCATGTCTA TAGAAGTCCA TCCTTTGGTC CAAAGCATCT 2580 
GGAAGAGGAA GAAGAGAGGA ATGAGAAAGA AGGAAGTGAT GCAAAACATC TCCAAAGAAG 2640 
5 5 TCTTTTGGAA CAGGAAAATC ATTCACCACT CACAGGGTCA AATATGAAAT ACAAAACCAC 2700 
GAACCAATCA ACAGAATTTT TATCCTTCCA AGATGCCAGC TCATTGTACA GAAACATTTT 2760 
AGAAAAAGAA AGGGAACTTC AGCAACTGGG AATCACAGAA TACCTAAGGA AAAACATTGC 2820 
TCAGCTCCAG CCTGATATGG AGGCACATTA TCCTGGAGCC CACGAAGAGC TGAAGTTAAT 2880 
GGAAACATTA ATGTACTCAC GTCCAAGGAA GGTATTAGTG GAACAGACAA AAAATGAGTA 2940 
60 TTTTGAACTT AAAGCTAATT TACATGCTGA ACCTGACTAT TTAGAAGTCC TGGAGCAGCA 3000 
AACATAGATG GAGAGTTTGA GGGCTTTCGC AGAAATGCTG TGATTCTGTT TTAAGTCCAT 3060 
ACCTTGTAAA TAAGTGCCTT ACGTGAGTGT GTCATCAATC AGAACCTAAG CACAGCAGTA 3120 
AACTATGGGG AAAAAAAAAG AAGAAGAAAA GAAACTCAGG GATCACTGGG AGAAGCCATG 3180 
GCATTATCTT CAGGCAATTT AGTCTGTCCC AAATAAAATC AATCCTTGCA TGTAAATC 

65 



Seq ID NO: 8 Protein sequence: 
Protein Accession #: none found 

70 1 11 21 31 41 51 

MKLWIHLFYS SLLACISLHS QTPVLSSRGS CDSLCNCEEK DGTMLINCEA KGIKMVSEIS 60 
VPPSRPFQLS LLNNGLTMLH TNDFSGLTNA ISIHLGFNNI ADIEIGAFNG LGLLKQLHIN 120 
HNSLEILKED TFHGLENLEF LQADNNFITV IEPSAFSKLN RLKVLILNDN AIESLPPNIF 1 80 
75 RFVPLTHLDL RGNQLQTLPY VGFLEHIGRI LDLQLEDNKW ACNCDLLQLK TWLENMPPQS 240 

IIGDVVCNSP PFFKGSILSR LKKESICPTP PVYEEHEDPS GSLHLAATSS INDSRMSTKT 300 
TSILKLPTKA PGLIPYITKP STQLPGPYCP IPCNCKVLSP SGLLIHCQER NIESLSDLRP 360 
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PPQNPRKLIL AGNIIHSLMK SDLVEYFTLE MLHLGNNRIE VLEEGSFMNL TRLQKLYLNG 420 
NHLTKLSKGM FLGLHNLEYL YLEYNADCEI LPGTFNPMPK LKVLYLNNNL LQVLPPHIFS 480 
GVPLTKVNLK TNQFTHLPVS NILDDLDLLT QIDLEDNPWD CSCDLVGLQQ WIQKLSKNTV 540 
TDDILCTSPG HLDKKELKAL NSEILCPGLV NNPSMPTQTS YLMVTTPATT TNTADTILRS 600 
LTDAVPLSVL ILGLLIMFIT IVFCAAGIVV LVLHRRRRYK KKQVDEQMRD NSPVHLQYSM 660 
YGHKTTHHTT ERPSASLYEQ HMVSPMVHVY RSPSFGPKHL EEEEERNEKE GSDAKHLQRS 720 
LLEQENHSPL TGSNMKYKTT NQSTEFLSFQ DASSLYRNIL EKERELQQLG ITEYLRKNIA 780 
QLQPDMEAHY PGAHEELKLM ETLMYSRPRK VLVEQTKNEY FELKANLHAE PDYLEVLEQQ 840 
T 

Seq ID NO: 9 DNA sequence 

Nucleic Acid Accession #: NM_003474 

Coding sequence.- 307-3036 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

CACTAACGCT CTTCCTAGTC CCCGGGCCAA CTCGGACAGT TTGCTCATTT ATTGCAACGG 60 
TCAAGGCTGG CTTGTGCCAG AACGGCGCGC GCGCGACGCA CGCACACACA CGGGGGGAAA 120 
CTTTTTTAAA AATGAAAGGC TAGAAGAGCT CAGCGGCGGC GCGGGCCGTG CGCGAGGGCT 180 
CCGGAGCTGA CTCGCCGAGG CAGGAAATCC CTCCGGTCGC GACGCCCGGC CCCGCTCGGC 240 
GCCCGCGTGG GATGGTGCAG CGCTCGCCGC CGGGCCCGAG AGCTGCTGCA CTGAAGGCCG 300 
GCGACG ATG G CAGCGCGCCC GCTGCCCGTG TCCCCCGCCC GCGCCCTCCT GCTCGCCCTG 360 
GCCGGTGCTC TGCTCGCGCC CTGCGAGGCC CGAGGGGTGA GCTTATGGAA CGAAGGAAGA 420 
GCTGATGAAG TTGTCAGTGC CTCTGTTCGG AGTGGGGACC TCTGGATCCC AGTGAAGAGC 480 
TTCGACTCCA AGAATCATCC AGAAGTGCTG AATATTCGAC TACAACGGGA AAGCAAAGAA 540 
CTGATCATAA ATCTGGAAAG AAATGAAGGT CTCATTGCCA GCAGTTTCAC GGAAACCCAC 600 
TATCTGCAAG ACGGTACTGA TGTCTCCCTC GCTCGAAATT ACACGGTAAT TCTGGGTCAC 660 
TGTTACTACC ATGGACATGT ACGGGGATAT TCTGATTCAG CAGTCAGTCT CAGCACGTGT 720 
TCTGGTCTCA GGGGACTTAT TGTGTTTGAA AATGAAAGCT ATGTCTTAGA ACCAATGAAA 780 
AGTGCAACCA ACAGATACAA ACTCTTCCCA GCGAAGAAGC TGAAAAGCGT CCGGGGATCA 840 
TGTGGATCAC ATCACAACAC ACCAAACCTC GCTGCAAAGA ATGTGTTTCC ACCACCCTCT 900 
CAGACATGGG CAAGAAGGCA TAAAAGAGAG ACCCTCAAGG CAACTAAGTA TGTGG AGCTG 960 
GTGATCGTGG CAGACAACCG AGAGTTTCAG AGGCAAGGAA AAGATCTGGA AAAAGTTAAG 1020 
CAGCGATTAA TAGAGATTGC TAATCACGTT GACAAGTTTT ACAGACCACT GAACATTCGG 1080 
ATCGTGTTGG TAGGCGTGGA AGTGTGGAAT GACATGGACA AATGCTCTGT AAGTCAGGAC 1 140 
CCATTCACCA GCCTCCATGA ATTTCTGGAC TGGAGGAAGA TGAAGCTTCT ACCTCGCAAA 1200 
TCCCATGACA ATGCGCAGCT TGTCAGTGGG GTTTATTTCC AAGGGACCAC CATCGGCATG 1260 
GCCCCAATCA TGAGCATGTG CACGGCAGAC CAGTCTGGGG GAATTGTCAT GGACCATTCA 1320 
GACAATCCCC TTGGTGCAGC CGTGACCCTG GCACATGAGC TGGGCCACAA TTTCGGGATG 1380 
AATCATGACA CACTGGACAG GGGCTGTAGC TGTCAAATGG CGGTTGAGAA AGGAGGCTGC 1440 
ATCATGAACG CTTCCACCGG GTACCCATTT CCCATGGTGT TCAGCAGTTG CAGCAGGAAG 1500 
GACTTGGAGA CCAGCCTGGA GAAAGGAATG GGGGTGTGCC TGTTTAACCT GCCGGAAGTC 1560 
AGGGAGTCTT TCGGGGGCCA GAAGTGTGGG AACAGATTTG TGGAAGAAGG AGAGGAGTGT 1620 
GACTGTGGGG AGCCAGAGGA ATGTATGAAT CGCTGCTGCA ATGCCACCAC CTGTACCCTG 1680 
AAGCCGGACG CTGTGTGCGC ACATGGGCTG TGCTGTGAAG ACTGCCAGCT GAAGCCTGCA 1740 
GGAACAGCGT GCAGGGACTC CAGCAACTCC TGTGACCTCC CAGAGTTCTG CACAGGGGCC 1800 
AGCCCTCACT GCCCAGCCAA CGTGTACCTG CACGATGGGC ACTCATGTCA GGATGTGGAC 1 860 
GGCTACTGCT ACAATGGCAT CTGCCAGACT CACGAGCAGC AGTGTGTCAC ACTCTGGGGA 1920 
CCAGGTGCTA AACCTGCCCC TGGGATCTGC TTTGAGAGAG TCAATTCTGC AGGTGATCCT 1980 
TATGGCAACT GTGGCAAAGT CTCGAAGAGT TCCTTTGCCA AATGCGAGAT GAGAGATGCT 2040 
AAATGTGGAA AAATCCAGTG TCAAGGAGGT GCCAGCCGGC CAGTCATTGG TACCAATGCC 2100 
GTTTCCATAG AAACAAACAT CCCCCTGCAG CAAGGAGGCC GGATTCTGTG CCGGGGGACC 2160 
CACGTGTACT TGGGCGATGA CATGCCGGAC CCAGGGCTTG TGCTTGCAGG CACAAAGTGT 2220 
GCAGATGGAA AAATCTGCCT GAATCGTCAA TGTCAAAATA TTAGTGTCTT TGGGGTTCAC 2280 
GAGTGTGCAA TGCAGTGCCA CGG CAGAGGG GTGTGCAACA ACAGGAAGAA CTGCCACTGC 2340 
GAGGCCCACT GGGCACCTCC CTTCTGTGAC AAGTTTGGCT TTGGAGGAAG CACAGACAGC 2400 
GGCCCCATCC GGCAAGCAGA TAACCAAGGT TTAACCATAG GAATTCTGGT GACCATCCTG 2460 
TGTCTTCTTG CTGCCGGATT TGTGGTTTAT CTCAAAAGGA AGACCTTGAT ACGACTGCTG 2520 
TTTACAAATA AGAAGACCAC CATTGAAAAA CTAAGGTGTG TGCGCCCTTC CCGGCCACCC 2580 
CGTGGCTTCC AACCCTGTCA GGCTCACCTC GGCCACCTTG GAAAAGGCCT GATGAGGAAG 2640 
CCGCCAGATT CCTACCCACC GAAGGACAAT CCCAGGAGAT TGCTGCAGTG TCAGAATGTT 2700 
GACATCAGCA GACCCCTCAA CGGCCTGAAT GTCCCTCAGC CCCAGTCAAC TCAGCGAGTG 2760 
CTTCCTCCCC TCCACCGGGC CCCACGTGCA CCTAGCGTCC CTGCCAGACC CCTGCCAGCC 2820 
AAGCCTOCAC TTAGGCAGGC CCAGGGGACC TGTAAGCCAA ACCCCCCTCA GAAGCCTCTG 2880 
CCTGCAGATC CTCTGGCCAG AACAACTCGG CTCACTCATG CCTTGGCCAG GACCCCAGGA 2940 
CAATGGGAGA CTGGGCTCCG CCTGGCACCC CTCAGACCTG CTCCACAATA TCCACACCAA 3000 
GTGCCCAGAT CCACCCACAC CGCCTATATT AAG TGAG AAG CCGACACCTT TTTTCAACAG 3060 
TGAAGACAGA AGTTTGCACT ATCTTTCAGC TCCAGTTGGA GTTTTTTGTA CCAACTTTTA 3120 
GGATTTTCTT TAATGTTTAA AACATCATTA CTATAAGAAC TTTGAGCTAC TGCCGTCAGT 3180 
GCTGTGCTGT GCTATGGTGC TCTGTCTACT TGCACAGGTA CTTGTAAATT ATTAATTTAT 3240 
GCAGAATGTT GATTACAGTG CAGTGCGCTG TAGTAGGCAT TTTTACCATC ACTGAGTTTT 3300 
CCATGGCAGG AAGGCTTGTT GTGCTTTTAG TATTTTAGTG AACTTGAAAT ATCCTGCTTG 3360 
ATGGGATTCT GGACAGGATG TGTTTGCTTT CTGATCAAGG CCTTATTGGA AAGCAGTCCC 3420 
CCAACTACCC CCAGCTGTGC TTATGGTACC AGATGCAGCT CAAGAGATCC CAAGTAGAAT 3480 
CTCAGTTGAT TTTCTGGATT CCCCATCTCA GGCCAGAGCC AAGGGGCTTC AGGTCCAGGC 3540 
TGTGTTTGGC TTTCAGGGAG GCCCTGTGCC CCTTGACAAC TGGCAGGCAG GCTCCCAGGG 3600 
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ACACCTGGGA GAAATCTGGC TTCTGGCCAG GAAGCTTTGG TGAGAACCTG GGTTGCAGAC 3660 
AGGAATCTTA AGGTGTAGCC ACACCAGGAT AGAGACTGGA ACACTAGACA AGCCAGAACT 3720 
TGACCCTGAG CTGACCAGCC GTGAGCATGT TTGGAAGGGG TCTGTAGTGT CACTCAAGGC 3780 
GGTGCTTGAT AGAAATGCCA AGCACTTCTT TTTCTCGCTG TCCTTTCTAG AGCACTGCCA 3840 
CCAGTAGGTT ATTTAGCTTG GGAAAGGTGG TGTTTCTGTA AGAAACCTAC TGCCCAGGCA 3900 
CTGCAAACCG CCACCTCCCT ATACTGCTTG GAGCTGAGCA AATCACCACA AACTGTAATA 3960 
CAATGATCCT GTATTCAGAC AGATGAGGAC TTTCCATGGG ACCACAACTA TTTTCAGATG 4020 
TGAACCATTA ACCAGATCTA GTCAATCAAG TCTGTTTACT GCAAGGTTCA ACTTATTAAC 4080 
AATTAGGCAG ACTCTTTATG CTTGCAAAAA CTACAACCAA TGGAATGTGA TGTTCATGGG 4140 
TATAGTTCAT GTCTGCTATC ATTATTCGTA GATATTGGAC AAAGAACCTT CTCTATGGGG 4200 
CATCCTCTTT TTCCAACTTG GCTGCAGGAA TCTTTAAAAG ATGCTTTTAA CAGAGTCTGA 4260 
ACCTATTTCT TAAACACTTG CAACCTACCT GTTGAGCATC ACAGAATGTG ATAAGGAAAT 4320 
CAACTTGCTT ATCAACTTCC TAAATATTAT GAGATGTGGC TTGGGCAGCA TCCCCTTGAA 4380 
CTCTTCACTC TTCAAATGCC TGACTAGGGA GCCATGTTTC ACAAGGTCTT TAAAGTGACT 4440 
AATGGCATGA GAAATACAAA AATACTCAGA TAAGGTAAAA TGCCATGATG CCTCTGTCTT 4500 
CTGGACTGGT TTTCACATTA GAAGACAATT GACAACAGTT ACATAATTCA CTCTGAGTGT 4560 
TTTATGAGAA AGCCTTCTTT TGGGGTCAAC AGTTTTCCTA TGCTTTGAAA CAGAAAAATA 4620 
TGTACCAAGA ATCTTGGTTT GCCTTCCAGA AAACAAAACT GCATTTCACT TTCCCGGTGT 4680 
TCCCCACTGT ATCTAGGCAA CATAGTATTC ATGACTATGG ATAAACTAAA CACGTGACAC 4740 
AAACACACAC AAAAGGGAAC CCAGCTCTAA TACATTCCAA CTCGTATAGC ATGCATCTGT 4800 
TTATTCTATA GTTATTAAGT TCTTTAAAAT GTAAAGCCAT GCTGGAAAAT AATACTGCTG 4860 
AGATACATAC AGAATTACTG TAACTGATTA CACTTGGTAA TTGTACTAAA GCCAAACATA 4920 
TATATACTAT TAAAAAGGTT TACAGAATTT TATGGTGCAT TACGTGGGCA TTGTCTTTTT 4980 
AGATGCCCAA ATCCTTAGAT CTGGCATGTT AGCCCTTCCT CCAATTATAA GAGGATATGA 5040 
ACCAAAAAAA AAAAAAAAAA AA 



Seq ID NO: 10 Protein sequence: 
Protein Accession #: NPO03465.2 

1 11 21 31 41 51 

I I I I I I 

MAARPLPVSP ARALLLALAG ALLAPCEARG VSLWNEGRAD EWSASVRSG DLWIPVKSFD 60 
SKNHPEVLNI RLQRESKELI INLERNEGLI ASSFTETHYL QDGTDVSLAR NYTVILGHCY 120 
YHGHVRGYSD SAVSLSTCSG LRGLIVFENE SYVLEPMKSA TNRYKLFPAK KLKSVRGSCG 180 
SHHNTPNLAA KNVFPPPSQT WARRHKRETL KATKYVELVI VADNREFQRQ GKDLEKVKQR 240 
LIEIANHVDK FYRPLNIR1V LVGVEVWNDM DKCSVSQDPF TSLHEFLDWR KMKLLPRKSH 300 
DNAQLVSGVY FQGTTIGMAP IMSMCTADQS GGIVMDHSDN PLGAAVTLAH ELGHNFGMNH 360 
DTLDRGCSCQ MAVEKGGCIM NASTGYPFPM VFSSCSRKDL ETSLEKGMGV CLFNLPEVRE 420 
SFGGQKCGNR FVEEGEECDC GEPEECMNRC CNATTCTLKP DAVCAHGLCC EDCQLKPAGT 480 
ACRDSSNSCD LPEFCTGASP HCPANVYLHD GHSCQDVDGY CYNGICQTHE QQCVTLWGPG 540 
AKPAPG1CFE RVNSAGDPYG NCGKVSKSSF AKCEMRDAKC GKIQCQGGAS RPVIGTNAVS 600 
IETNIPLQQG GRILCRGTHV YLGDDMPDPG LVLAGTKCAD GKICLNRQCQ NISVFGVHEC 660 
AMQCHGRGVC NNRKNCHCEA HWAPPFCDKF GFGGSTDSGP IRQADNQGLT IGILVTILCL 720 
LAAGFWYLK RKTLIRLLFT NKKTTIEKLR CVRPSRPPRG FQPCQAHLGH LGKGLMRKPP 780 
DSYPPKDNPR RLLQCQNVDI SRPLNGLNVP QPQSTQRVLP PLHRAPRAPS VPARPLPAKP 840 
ALRQAQGTCK PNPPQKPLPA DPLARTTRLT HALARTPGQW ETGLRLAPLR PAPQYPHQVP 900 
RSTHTAYHC 

Seq ID NO: 1 1 DNA sequence 

Nucleic Acid Accession #: NM_001394 

Coding sequence: 400-1584(underIined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

GGCACGAGGG CTGAGCGCCG GAGGAGCGTA GGCAGGGCAG CGCTGGCGCC AGCGGCGACA 60 
GGAGCCGCGC GACCGGCAAA AATACACGGG AGGCCGTCGC CGAAAAGAGT CCGCGGTCCT 120 
CTCTCGTAAA CACACTCTCC TCCACCGGCG CCTCCCCCTC CGCTCTGCGC GCCGCCCGGC 180 
TGGGCGCCCG AGGCCGCTCC GACTGCTATG TGACCGCGAG GCTGCGGGAG GAAGGGGACA 240 
GGGAAGAAGA GGCTCTCCCG CGGGAGCCCT TGAGGACCAA GTTTGCGGCC ACTTCTGCAG 300 
GCGTCCCTTC TTAGCTCTCG CCTGCCCCTT TCTGCAGCCT AGGCGGCCCA GGTTCTCTTC 360 
TCTTCCTCGC GCGCCCAGCC GCCTCGGTTC CCGGCGACC A TG GTGACGAT GGAGGAGCTG 420 
CGGGAGATGG ACTGCAGTGT GCTCAAAAGG CTGATGAACC GGGACGAGAA TGGCGGCGGC 480 
GCGGGCGGCA GCGGCAGCCA CGGCACCCTG GGGCTGCCGA GCGGCGGCAA GTGCCTGCTG 540 
CTGGACTGCA GACCGTTCCT GGCGCACAGC GCGGGCTACA TCCTAGGTTC GGTCAACGTG 600 
CG CTGTA AC A CCATCGTGCG GCGGCGGGCT AAGGGCTCCG TGAGCCTGGA GCAGATCCTG 660 
CCCGCCGAGG AGGAGGTACG CGCCCGCTTG CGCTCCGGCC TCTACTCGGC GGTCATCGTC 720 
TACGACGAGC GCAGCCCGCG CGCCGAGAGC CTCCGCGAGG ACAGCACCGT GTCGCTGGTG 780 
GTGCAGGCGC TGCGCCGCAA CGCCGAGCGC ACCGACATCT GCCTGCTCAA AGGCGGCTAT 840 
GAGAGGTTTT CCTCCGAGTA CCCAGAATTC TGTTCTAAAA CCAAGGCCCT G GCAGCCATC 900 
CCACCCCCGG TTCCCCCCAG CGCCACAGAG CCCTTGGACC TGGGCTGCAG CTCCTGTGGG 960 
ACCCCACTAC ACGACCAGGG GGGTCCTGTG GAGATCCTTC CCTTCCTCTA CCTCGGCAGT 1020 
GCCTACCATG CTGCCCGGAG AGACATGCTG GACGCCCTGG GCATCACGGC TCTGTTGAAT 1080 
GTCTCCTCGG ACTGCCCAAA CCACTTTGAA GGACACTATC AGTACAAGTG CATCCCAGTG 1 140 
GAAGATAACC ACAAGGCCGA CATCAGCTCC TGGTTCATGG AAGCCATAGA GTACATCGAT 1200 
GCCGTGAAGG ACTGCCGTGG GCGCGTGCTG GTGCACTGCC AGGCGGGCAT CTCGCGGTCG 1260 
GCCACCATCT GCCTGGCCTA CCTGATGATG AAGAAACGGG TGAGGCTGGA GGAGGCCTTC 1320 
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GAGTTCGTTA AGCAGCGCCG CAGCATCATC TCGCCCAACT TCAGCTTCAT GGGGCAGCTG 1380 
CTGCAGTTCG AGTCCCAGGT GCTGGCCACG TCCTGTGCTG CGGAGGCTGC TAGCCCCTCG 1440 
GGACCCCTGC GGGAGCGGGG CAAGACCCCC GCCACCCCCA CCTCGCAGTT CGTCTTCAGC 1500 
TTTCCGGTCT CCGTGGGCGT GCACTCGGCC CCCAGCAGCC TGCCCTACCT GCACAGCCCC 1560 
5 ATCACCACCT CTCCCAGCTG TTAGAGCCGC CCTGGGGGCC CCAGAACCAG AGCTGGCTCC 1620 

CAGCAAGGGT AGGACGGGCC GCATGCGGCA GAAAGTTGGG ACTGAGCAGC TGGGAGCAGG 1680 
CGACCGAGCT CCTTCCCCAT CATTTCTCCT TGGCCAACGA CGAGGCCAGC CAGAATGGCA 1740 
ATAAGGACTC CGAATACATA ATAAAAGCAA ACAGAACACT CCAACTTAGA GCAATAACCG 1800 
GTGCCGCAGC AGCCAGGGAA GACCTTGGTT TGGTTTATGT GTCAGTTTCA CTTTTCCGAT 1860 

1 0 AGAAATTTCT TACCTCATTT TTTTAAGCAG TAAGGCTTGA AGTGATGAAA CCCACAGATC 1920 

CTAGCAAATG TGCCCAACCA GCTTTACTAA AGGGGGAGGA AGGGAGGGCA AAGGGATGAG 1980 
AAGACAAGTT TCCCAGAAGT GCCTGGTTCT GGGTACTTGT CCCTTTGTTG TCGTTGTTGT 2040 
AGTTAAAGGA ATTTCATTTT TAAAAGAAAT CTTCGAAGGT GTGGTTTTCA TTTCTCAGTC 2100 
ACCAACAGAT GAATAATTAT GCTTAATAAT AAAGTATTTA TTAAGACTTT CTTCAGAGTA 2160 

1 5 TGAAAGTACA AAAAGTCTAG TTACAGTGGA TTTAGAATAT ATTTATGTTG ATGTCAAACA 2220 
GCTGAGCACC GTAGCATGCA GATGTCAAGG CAGTTAGGAA GTAAATGGTG TCTTGTAGAT 2280 
ATGTGCAAGG TAGCATGATG AGCAACTTGA GTTTGTTGCC ACTGAGAAGC AGGCGGGTTG 2340 
GGTGGGAGGA GGAAGAAAGG GAAGAATTAG GTTTGAATTG CTTTTTAAAA AAAAAAGAAA 2400 
AGAAAAAGAC AGCATCTCAC TATGTTGCCA AGGCTCATCT TGAGAAGCAG GCGGGTTGGG 2460 
hSO TGGGAGGAGG AAGAAAGGGA AGAATTAGGT TTGAATTGCT TTTTTAAAAA A AAA 

Seq ID NO: 12 Protein sequence: 
Cl Protein Accession*: NP_001385 

W25 1 11 21 31 41 51 

S\ MVTMEELREM DCSVLKRLMN RDENGGGAGG SGSHGTLGLP SGGKCLLLDC RPFLAHSAGY 60 

W ILGSVNVRCN TIVRRRAKGS VSLEQILPAE EEVRARLRSG LYSAVIVYDE RSPRAESLRE 120 

% j DSTVSLVVQA LRRNAERTDI CLLKGGYERF SSEYPEFCSK TKALAAIPPP VPPSATEPLD 180 

3 0 LGCSSCGTPL HDQGGPVEIL PFLYLGSAYH AARRDMLDAL GITALLNVSS DCPNHFEGHY 240 

O QYKCIPVEDN HKADISSWFM EAIEYIDAVK DCRGRVLVHC QAGISRSATI CLAYLMMKKR 300 

VRLEEAFEFV KQRRSIISPN FSFMGQLLQF ESQVLATSCA AEAASPSGPL RERGKTPATP 360 

L_ TSQFVFSFPV SVGVHSAPSS LPYLHSPITT SPSC 

735 Seq ID NO: 13 DNA sequence 

t~* Nucleic Acid Accession #: none found 

T\\ Coding sequence: 68-340(underlined sequences correspond to start and stop codons) 

MO 1 11 21 31 41 51 

T'. I I I 1 I 1 

s U AGCGCCTTGC CTTCTCTTAG GCTTTGAAGC ATTTTTGTCT GTGCTCCCTG ATCTTCAGGT 60 

CACCACCATG AAGTTCTTAG CAGTCCTGGT ACTCTTGGGA GTTTCCATCT TTCTGGTCTC 1 20 
TGCCCAGAAT CCGACAACAG CTGCTCCAGC TGACACGTAT CCAGCTACTG GTCCTGCTGA 180 
45 TGATGAAGCC CCTGATGCTG AAACCACTGC TGCTGCAACC ACTGCGACCA CTGCTGCTCC 240 
TACCACTGCA ACCACCGCTG CTTCTACCAC TGCTCGTAAA GACATTCCAG TTTTACCCAA 300 
ATGGGTTGGG GATCTCCCGA ATGGTAGAGT GTGTCCCTGA GATGGAATCA GCTTGAGTCT 360 
TCTGCAATTG GGTCACAACT ATTCATGCTT CCTGTGATTT CATCCAACTA CTTACCTTGC 420 
CTACGATATC CCCTTTATCT CTAATCAGTT TATTTTCTTT CAAATAAAAA ATAACTATGA 480 
50 GCGAGCTAAC AT 
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Seq ID NO: 14 Protein sequence: 
Protein Accession # : none found 



1 11 21 31 41 51 

MKFLAVLVLL GVSIFLVSAQ NPTTAAPADT YPATGPADDE APDAETTAAA TTATTAAPTT 60 
60 ATTAASTTAR KDIPVLPKWV GDLPNGRVCP 



Seq ID NO: 15 DNA sequence 

Nucleic Acid Accession #: NM 0 1 6640.2 

Coding sequence: 39-1358(underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

GCTTAAGTTG ACCTCTGGGT CCGGAATCGC GGGCAAAGAT GGCGGCGGCC AGGTGTTGGA 60 
70 GGCCTTTGCT ACGCGGTCCG AGGCTTTCAT TGCACACCGC GGCTAATGCC GCCGCCACGG 120 
CTACAGAAAC GACCTCCCAA GACGTCGCGG CGACCCCCGT CGCGCGGTAC CCGCCGATTG 1 80 
TGGCCTCCAT GACAGCCGAC AGCAAAGCTG CACGGCTGCG GCGGATCGAG CGCTGGCAGG 240 
CGACGGTGCA CGCTGCG GAG TCGGTAGACG AGAAGCTGCG AATCCTCACC AAGATGCAGT 300 
TTATGAAGTA CATGGTTTAC CCGCAGACCT TCGCACTGAA TGCCGACCGC TGGTACCAGT 360 
75 ACTTCACCAA GACCGTGTTC CTGTCGGGTC TGCCGCCGCC CCCAGCGGAG CCCGAGCCCG 420 
AGCCCGAACC CGAACCTGAA CCTGCGCTGG ACCTCGCGGC GCTGCGTGCG GTCGCCTGCG 480 
ACTGCCTGCT GCAGGAGCAC TTCTACCTGC GGCGCAGGCG GCGCGTGCAC CGTTACGAGG 540 
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AGAGCGAGGT CATATCTTTG CCCTTCCTGG ATCAGCTGGT GTCAACCCTC GTGGGCCTCC 600 
TCAGCCCACA CAACCCGGCC CTGGCCGCTG CCGCCCTCGA TTATAGATGC CCAGTTCATT 660 
TTTACTGGGT GCGTGGTGAA GAAATTATTC CTCGTGGTCA TCGAAGAGGT CGAATTGATG 720 
ACTTGCGATA CCAGATAGAT GATAAACCAA ACAACCAGAT TCGAATATCC AAGCAACTCG 780 
CAGAGTTTGT GCCATTGGAT TATTCTGTTC CTATAGAAAT CCCCACTATA AAATGTAAAC 840 
CAGACAAACT TCCATTATTC AAACGGCAGT ATGAAAACCA CATATTTGTT GGCTCAAAAA 900 
CTGCAGATCC TTGCTGTTAC GGTCACACCC AGTTTCATCT GTTACCTGAC AAATTAAGAA 960 
GGGAAAGGCT TTTGAGACAA AACTGTGCTG ATCAGATAGA AGTTGTTTTT AGAGCTAATG 1020 
CTATTGCAAG CCTTTTTGCT TGGACTGGAG CACAAGCTAT GTATCAAGGA TTCTGGAGTG 1080 
AAGCAGATGT TACTCGACCT TTTGTCTCCC AGGCTGTGAT CACAGATGGA AAATACTTTT 1 140 
CCTTTTTCTG CTACCAGCTA AATACTTTGG CACTGACTAC ACAAGCTGAT CAAAATAACC 1200 
CTCGTAAAAA TATATGTTGG GGTACACAAA GTAAGCCTCT TTATGAAACA ATTGAGGATA 1260 
ATGATGTGAA AGGTTTTAAT GATGATGTTC TACTTCAGAT AGTTCACTTT CTACTGAATA 1320 
GACCAAAAGA AGAAAAATCA CAGCTGTTGG AAAA CTGA AA AAGCATATTT GATTGAGAAC 1380 
TGTGGGAATA TTTAAATTTT ACTGAAGGAA CAATAATGAT GAGATTTGTA ACTGTCAACT 1440 
ATTAAATACA TTGATTTTTG AGACAAATAT TTCTTATGTC AACCTGTTAT TAGATCTCTT 1500 
ACTCTGCTCA AATTCATCAC TGAAAGATTT AATTTTAGTT ACCTTTTGTT GATTTAAAAA 1560 
TAATTGCATT TGTATATTGC TAACTGATAA GACAAATTGA GTTATTGAGC TATTAAATGC 1620 
ACATTTTAAT ATAAATGCAG AAATCCCAAA TAAAATGCTA ACATACTGAA TTCAGTAATT 1680 
AAAAGAACCC ACTGC 



Seq ID NO: 16 Protein sequence: 
Protein Accession #: NP 0S7724. 1 

1 11 21 31 41 51 

MAAARCWRPL LRGPRLSLHT AANAAATATE TTSQDVAATP VARYPPIVAS MTADSKAARL 60 
RRIERWQATV HAAESVDEKL RILTKMQFMK YMVYPQTFAL NADRWYQYFT KTVFLSGLPP 1 20 
PPAEPEPEPE PEPEPALDLA ALRAVACDCL LQEHFYLRRR RRVHRYEESE VISLPFLDQL 1 80 
VSTLVGLLSP HNPALAAAAL DYRCPVHFYW VRGEEIIPRG HRRGRIDDLR YQIDDKPNNQ 240 
IRISKQLAEF VPLDYSVPIE IPTIKCKPDK LPLFKRQYEN FUFVGSKTAD PCCYGHTQFH 300 
LLPDKLRRER LLRQNCADQI EVVFRANAIA SLFAWTGAQA MYQGFWSEAD VTRPFVSQAV 360 
ITDGKYFSFF CYQLNTLALT TQADQNNPRK NICWGTQSKP LYETIEDNDV KGFNDDVLLQ 420 
IVHFLLNRPK EEKSQLLEN 

Seq ID NO: 17 DNA sequence 

Nucleic Acid Accession #: NM_025059.1 

Coding sequence: 3-2150 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

GCATGAGCCT GGACTGCACC AGCCATATCG CGCTGGGTGC CGCTTCGCCA GCGCCCGAGG 60 
AAACTTACGA TCATCTTTCG GAAGTCCCGG TCACGCGGGA GCAGTTAAAC CACTATCGGA 120 
ATGTGGCTCA AAATGCTCGA AGTGAACTTG CAGCAACTTT GGTCAAATTT GAATGTGCTC 180 
AGTCTGAGCT TCAAGACCTC CGATCCAAGA TGCTCTCTAA AGAAGTCTCC TGTCAAGAAC 240 
TGAAAGCTGA AATGGAGAGC TACAAGGAAA ACAATGCCAG AAAATCATCT CTCCTTACCT 300 
CTTTGAGAGA CAGAGTTCAG GAACTAGAAG AAGAATCAGC AGCACTTTCC ACTTCTAAAA 360 
TCAGAACAGA AATCACAGCT CACGCTGCAA TCAAGGAGAA CCAGGAATTA AAGAAGAAAG 420 
TTGTAGAGTT AAATGAAAAA TTACAAAAGT GTTCAAAAGA AAATGAGGAG AATAAGAAAC 480 
AAGTTTCAAA GAATTGCAGG AAACATGAGG AATTTCTGAC TCAACTGCGT GACTGCTTGG 540 
ATCCAGATGA GAGGAATGAC AAGGCATCAG ATGAAGATTT AATTTTAAAG CTTAGAGACC 600 
TGCGCAAAGA AAATGAATTC GTGAAAGGAC AAATTGTTAT TCTTGAAGAG ACTATAAATG 660 
TCCATGAGAT GGAAGCAAAA GCTAGCAGAG AAACGATCAT GAGGCTGGCT TCAGAAGTCA 720 
ACAGAGAGCA GAAAAAAGCT GCCTCCTGTA CTGAAGAGAA AGAGAAGCTG AACCAGGACC 780 
TGCTCAGTGC TGTAGAAGCA AAAGAAGCTC TTGAAAGGGA AGTTAAGATC TTCCAAGAAA 840 
GGCTGCTTGC TGGCCAGCAG GTCTGGGATG CCTCAAAGCA GGAAGTGAGC CTCCTGAAGA 900 
AAAGCTCTTC TGAGTTGGAG AAGAGTTTGA AGGCCAGTCA GGATGCAGTC ACAACCTCAC 960 
AAAGCCAGTA CTCCTCATTT AGGGAGAAAA TCGCAGCCCT CCTTAGGGGC AGATTGAGCA 1020 
TGACTGGGTC CACTGAGGAC ACCATTTTGG AGAAGATTCG AGAAATGGAC AGCCGGGAAG 1080 
AAAGCAGGGA CCGGATGGTC TCCCAGCTTG AAGCCCAAAT ATCTGAGCTT GTTGAACAGT 1 140 
TGGGAAAGGA GTCTGGGTTT CACCAGAAAG CTCTCCAGAG GGCCCAGAAA GCAGAGAATA 1200 
TGTTGGAGAC TCTTCAGGGT CAGCTGACAC ACCTGGAGGC AGAGCTGGTT TCTGGAGGTG 1260 
TTTTGCGAGA CAACTTGAAT TTTGAGAAAC AAAAATATCT TAAATTTCTG GATCAGCTTT 1320 
CTCAGAAAAT GAAGTTGGAC CAGATGGCTG CCGAACTTGG CTTTGACACG CGGCTGGACG 1380 
TGGTTTTAGC TCGAACAGAG CAGCTGGTTC GTCTTGAGAG CAATGCAGTC ATTGAGAACA 1440 
AGACCATTGC CCACAATTTG CAGAGAAAGC TAAAGACACA GAAAGAGAGA CTGGAGAGCA 1500 
AAGAATTACA CATGAGCCTC CTCCGGCAGA AAATAGCCCA GCTGGAGGAG GAGAAGCAGG 1560 
CACGCACGGC CTTGGTGGTT GAGAGGGACA ACGCGCATCT TACCATCAGG AACTTGCAGA 1620 
AGAAGGTGGA GAGGCTGCAG AAAGAGCTGA ACACGTGTCG AGACTTGCAC ACCGAGCTCA 1680 
AAGCCAAACT GGCCGACACC AATGAACTGA AGATTAAAAC TTTGGAACAG ACTAAAGCCA 1740 
TTGAAGATCT AAACAAATCC AGAGACCAAC TGGAGAAGAT GAAGGAGAAA GCTGAGAAAA 1800 
AGCTCATGTC TGTCAAGTCA GAACTGGATA CCACAGAACA TGAGGCTAAG GAGAATAAAG 1860 
AAAGGGCCAG AAACATGATA GAAGTGGTAA CCAGTGAAAT GAAGACACTA AAAAAATCTC 1920 
TGGAAGAAGC AGAAAAGAGA GAAAAGCAGC TGGCAGACTT CAGGGAGGTG GTGTCGCAGA 1980 
TGCTAGGCTT GAACGTGACC AGCCTTGCTC TTCCTGATTA TGAAATCATC AAGTGTCTTG 2040 
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AAAGATTGGT CCATTCACAT CAGCATCACT TTGTTACCTG TGCCTGCCTC AAAGATGTGA 2100 
CTACTGG GCA AGAGAGGCAC CCACAAGGCC ATTTACAGCT TCTTCATTGA ACACTGTATC 2160 
TCTTGAGAGA GGTGGCCATA AGACATGGCA CACAATTCCC AATTTCACAA ATTCCTCATG 2220 
TCTTTGAGAT TTGATCAGTT TGTGAATATT TTATGCTTTG ATGATATAGT GAGAATGCAT 2280 
5 CACTTGCAAA AACGATCTCA AAAGTGTCAG CCTTAGATAA ACGTCAGCAT TAAAAAACGC 2340 
CAAAAAAAAA AAAAAAAAGC ATTTTAGGAT CCAGAAGAAT TCCACCAGAT TGCATGAGTT 2400 
AGATTGGGAA ATGGGAGTGG GAGATAATAT TGGGAGGTAT CTATTTTAAG TCAGGGGCTT 2460 
TACTAGCCGA TTTAGTTCTC ACAATAACCA TGTGGAGAAG CTGTGACATT TTTAATTTAC 2520 
AACCTTTCTG GGGCTCAGAC ATAAAGTTAC CTATCCAAGG TTGCAGTTGG GTAGTGGTGG 2580 

1 0 GACCAGGATG GACAACTCAT TGGCCCTGCC TCAAAAGCCA TACCTCTTCT CCTGCTATGC 2640 
AGAATCTGTT TCTCCTGAAT CTCTGTGATG CTGGTGGGAA TTGTTTGCAT AGAGGAAGGA 2700 
CAATAACCCT GCCATCGTGA GTTAATGTCC GGGCTGGTCA CAGTGGTTCA TGCCTGTAAT 2760 
CCCAGCACTT TGGGAGTCCA AGGCAGGCAT ATCATTTGAG GTCAGGAGTT TAAGACCAGC 2820 
CTGGCTAACA TAGTGAGACC CTGTTTCTAC TAAAAATACA AAAATAAGCC AGGTGTGGTG 2880 

1 5 GTGCATGACT GTAATCCCAA CTACTCAGCA GGAGAAGCAC TTGAACCCAG GAGACGGAGG 2940 
CGGCAGTGAG CCAAGATTGT GCCACTGCAC TCCAGCCTGG GCGACAGAGT GAAACTACAT 3000 
CTCAGAAAAA AAAAAAAAAA AAAAAAAAAA AAGTTAATGT CCAAAAATGA CAGATTTACA 3060 
AGTGTAAGCT ATATGATTTC TTCAAAAAGC AAAAGCAATA TACCTAATTC ATTTGGATCA 3120 
AACTTACATA GGTCTCAGGT CCTGTAAGAA ACTTGCCTGT TCTAACTGTT GCTACCAGAT 3180 

1 20 TATATCTGGT GGTAATTGTT AATGTTTCAG CAGGGCTGGT CTCAGTCCTT TAAAATGGAA 3240 

AGCTTTATTT GGAGCCCAAC CCTATAAGAT GAAGAAATCC TATATAGTCT TATTCACCAA 3300 
C 1 TATATCCAAT ACACCCACAG CAATGGTACC TTTTTAAGAT CAGGATTTTA TTATGA ATTC 3360 

CTGTCACTTT CTGTTTTCCA TTTAAATTTC TATTTTACAA ATTTTTCAGG GAATCATATT 3420 
* =i CTTAACTTCA CTGAGAAAAA TGTGTTACTC TTTTGGACAA TTTATCTTAT TTCTATCATA 3480 

! U 5 TAAGATGTAT TTTTTTATTG TCCTTAAAAG AAGCTCTAGC ATGAAATTAA AGGAAAGGGA 3540 
-= ! AAGAAATAGA TCTGGTGCAC CCGAACATTA GGAGAAAATG AAAAATATAC AACCAACCGT 3600 

TCGTGAGTC A TC AAAAAGTC AAAGTCAGCC TGGCCAACAT GGCAAAACTC CGTCTCTGCA 3660 
fl i AAAAATACAA AAATGAGCCC GGTATGTTGG CATATGCCTG TAATCCCAGC TACTCGAGAG 3720 

I GCTGAGACAC GAAAATTGCT TGAACCTGGG AGGCGGAGGT TGCAATGAGC CGAGATCGCG 3780 
"ISO CTACTGCACT CCAGCCTGGG CAACAGAGAG AGACCTTGTC TCAAAAAACA ACAACAACAA 3840 

AAAGTCAAAG TCATAATAAG CAAATTATTG GCTTCTTTCT TCTAGACYAA AAGAAATTAA 3900 

AGAGATGAAA CAATCAATTG CAAGGGTCAA AACTAGATTG GATCTTGGTT TGAATGAAAA 3960 

2 AAAGCATAAA ATATTCTTGC AATAATTGTA AAAATTTGAA TGTGGACTAA GTCCTAGATT 4020 
r=A ATATTAAAAT ATTTTTAATT TTTTAAGCTT GACAAATGCA CTGATTGTTA TACTTTAAAT 4080 
fe 35 AACTAAAAAT CTGAGAATCC ACAGTGCTAC AGACAATAAA TGATAAAATG GGAAAAAAAA 4140 
M' AAAAAAAAAA A 

J-5 \ 

s £»: 

J£S Seq ID NO: 18 Protein sequence: 

30 Protein Accession #: NP 079335.1 

f? i 1 11 21 31 41 51 

3fe '' I I I I I I 

MSLDCTSHIA LGAASPAPEE TYDHLSEVPV TREQLNHYRN VAQNARSELA ATLVKFECAQ 60 

45 SELQDLRSKM LSKEVSCQEL KAEMESYKEN NARKSSLLTS LRDRVQELEE ESAALSTSKI 120 
RTEITAHAA1 KENQELKKKV VELNEKLQKC SKENEENKKQ VSKNCRKHEE FLTQLRDCLD 1 80 
PDERNDKASD EDLILKLRDL RKENEFVKGQ IVILEETINV HEMEAKASRE TLMRLASEVN 240 
REQKKAASCT EEKEKLNQDL LSAVEAKEAL EREVKIFQER LLAGQQVWDA SKQEVSLLKK 300 
SSSELEKSLK ASQDAVTTSQ SQYSSFREKI AALLRGRLSM TGSTEDTILE KIREMDSREE 360 

50 SRDRMVSQLE AQISELVEQL GKESGFHQKA LQRAQKAENM LETLQGQLTH LEAELVSGGV 420 
LRDNLNFEKQ KYLKFLDQLS QKMKLDQMAA ELGFDTRLDV VLARTEQLVR LESNAVIENK 480 
TIAHNLQRKL KTQKERLESK ELHMSLLRQK IAQLEEEKQA RTALVVERDN AHLTIRNLQK 540 
KVERLQKELN TCRDLHTELK AKLADTNELK DCTLEQTKAI EDLNKSRDQL EKMKEKAEKK 600 
LMSVKSELDT TEHEAKENKE RARNMIEVVT SEMKTLKKSL EEAEKREKQL ADFREVVSQM 660 

5 5 LGLNVTSLAL PDYEIIKCLE RLVHSHQHHF VTCACLKDVT TGQERHPQGH LQLLH 

Seq ID NO: 19 DNA sequence 

Nucleic Acid Accession ft: AF071552, NM_000662 

60 Coding sequence: 441-1313 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

65 CTTTGTATAA GGCTCAGCTA AAAGGGAAAT TGAGTGGGTC AGGTACCACG GATACTATAC 60 
ACTCTATTGC ATGATTCTCC TGCCTACATC AGAAGACGTT TATAAGCCTA TTTTAAAGGA 120 
TACCAGTTGG AATCTCTCTT TTATTAATCA CCAAGAGAAC CATGAACAAG CTGTTTATCA 1 80 
TTTGACTCAT CATTTAATCT TGATTTCCAG CTTCTCACAC TTGAAAGAAG ACATAATACA 240 
TTTCTCACAG GATTCTGGGA CTATTAACTG AACTTATGTG TGTAAAAGGA ATTCATACAA 300 

70 TGAAAGCACT AGAAATAATT ATTATACTTA TAACCATTGT ATTTTTACAT GTTTAAAATA 360 
TAGCCATAAT TAGCCTACTC AAATCCAAGT GTAAAAGTAA AATGATTTGC TTTCGTTTTG 420 
TTTTCCTTGC TTAGGGGATC ATG GACATTG AAGCATATCT TGAAAGAATT GGCTATAAGA 480 
AGTCTAGGAA CAAATTGGAC TTGGAAACAT TAACTGACAT TCTTCAACAC CAGATCCGAG 540 
CTGTTCCCTT TGAGAACCTT AACATCCATT GTGGGGATGC CATGGACTTA GGCTTAGAGG 600 

75 CCATTTTTGA TCAAGTTGTG AGAAGAAATC GGGGTGGATG GTGTCTCCAG GTCAATCATC 660 
TTCTGTACTG GGCTCTGACC ACTATTGGTT TTGAGACCAC GATGTTGGGA GGGTATGTTT 720 
ACAGCACTCC AGCCAAAAAA TACAGCACTG GCATGATTCA CCTTCTCCTG CAGGTGACCA 780 
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TTGATGGCAG GAACTACATT GTCGATGCTG GGTTTGGACG CTCATACCAG ATGTGGCAGC 840 
CTCTGGAGTT AATTTCTGGG AAGGATCAGC CTCAGGTGCC TTGTGTCTTC CGTTTGACGG 900 
AAGAGAATGG ATTCTGGTAT CTAGACCAAA TCAGAAGGGA ACAGTACATT CCAAATGAAG 960 
AATTTCTTCA TTCTGATCTC CTAGAAGACA GCAAATACCG AAAAATCTAC TCCTTTACTC 1020 
5 TTAAGCCTCG AACAATTGAA GATTTTGAGT CTATGAATAC ATACCTGCAG ACATCTCCAT 1080 
CATCTGTGTT TACTAGTAA A TCATTTTGTT CCTTGCAGAC CCCAG ATGGG GTTCACTGTT 1 140 
TGGTGGGCTT CACCCTCACC CATAGGAGAT TCAATTATAA GGACAATACA GATCTAATAG 1200 
AGTTCAAGAC TCTGAGTGAG GAAGAAATAG AAAAAGTGCT GAAAAATATA TTTAATATTT 1260 
CCTTGCAGAG AAAGCTTGTG CCCAAACATG GTGATAGATT TTTTACTATT TAGAATAAGG 1320 
1 0 AGTAAAACAA TCTTGTCTAT TTGTCATCCA GCTCACCAGT TATCAACTGA CGACCTATCA 1380 
TGTATCTTCT GTACCCTTAC CTTATTTTGA AGAAAATCCT AGACATCAAA TCATTTCACC 1440 
TATAAAAATG TCATCATATA TAATTAAACA GCTTTTTAAA GAAACATAAC CACAAACCTT 1500 
TTCAAATAAT AATAATAATA ATAATAATAA ATGTATTTTA AAGATGGCCT GTGGTTATCT 1560 
TGGAAATTGG TGATTTATGC TAGAAAGCTT TTAATGTTGG TTTATTGTTG AATTC 

Seq ID NO: 20 Protein sequence: 
Protein Accession #: NP_OO0653 . 1 



= 20 1 11 21 31 41 51 

I I I I I I 
Pi MDIEAYLERI GYKKSRNKLD LETLTDILQH QIRAVPFENL NIHCGDAMDL GLEAEFDQVV 60 

RRNRGGWCLQ VNHLLYWALT TIGFETTMLG GYVYSTPAKK YSTGMIHLLL QVTIDGRNYI 120 
VDAGFGRSYQ MWQPLELISG KDQPQVPCVF RLTEENGFWY LDQIRREQYI PNEEFLHSDL 180 
l!5 LEDSKYRKIY SFTLKPRTIE DFESMNTYLQ TSPSSVFTSK SFCSLQTPDG VHCLVGFTLT 240 
~-:J HRRFNYKDNT DLIEFKTLSE EEIEKVLKN1 FN1SLQRKLV PKHGDRFFTI 



3 1 11 21 31 41 51 

5 CGGCACG AGC AAAAAGGAAG AGTGGGAGGA GGAGGGG AAG CGGCGAAGG A GGAAGAGG AG 60 
M: GAGGAGGAAG AGGGGAGCAC AAAGGATCCA GGTCTCCCGA CGGGAGGTTA ATACCAAGAA 120 

~ CCATGTGTGC CGAGCGGCTG GGCCAGTTCA TGACCCTGGC TTTGGTGTTG GCCACCTTTG 180 

! k ; ACCCGGCGCG GGGGACCGAC GCCACCAACC CACCCGAGGG TCCCCAAGAC AGGAGCTCCC 240 

F ' AGCAG AAAGG CCGCCTGTCC CTGCAGAATA CAGCGGAGAT CCAGCACTGT TTGGTCAACG 300 

740 CTGGCGATGT GGGGTGTGGC GTGTTTGAAT GTTTCGAGAA CAACTCTTGT GAGATTCGGG 360 
© GCTTACATGG GATTTGCATG ACTTTTCTGC ACAACGCTGG AAAATTTGAT GCCCAGGGCA 420 

'?=-'■ AGTCATTCAT CAAAGACGCC TTGAAATGTA AGGCCCACGC TCTGCGGCAC AGGTTCGGCT 480 

\ fel GCATAAGCCG GAAGTGCCCG GCCATCAGGG AAATGGTGTC CCAGTTGCAG CGGGAATGCT 540 

ACCTCAAGCA CGACCTGTGC GCGGCTGCCC AGGAGAACAC CCGGGTGATA GTGGAGATGA 600 
45 TCCATTTCAA GGACTTGCTG CTGCACGAAC CCTACGTGGA CCTCGTGAAC TTGCTGCTGA 660 

CCTGTGGGGA GGAGGTGAAG GAGGCCATCA CCCACAGCGT GCAGGTTCAG TGTGAGCAGA 720 
ACTGGGGAAG CCTGTGCTCC ATCTTGAGCT TCTGCACCTC GGCCATCCAG AAGCCTCCCA 780 
CGGCGCCCCC CGAGCGCCAG CCCCAGGTGG ACAGAACCAA GCTCTCCAGG GCCCACCACG 840 
GGGAAGCAGG ACATCACCTC CCAGAGCCCA GCAGTAGGGA GACTGGCCGA GGTGCCAAGG 900 
50 GTGAGCGAGG TAGCAAGAGC CACCCAAACG CCCATGCCCG AGGCAGAGTC GGGGGCCTTG 960 
GGGCTCAGGG ACCTTCCGGA AGCAGCGAGT GGGAAGACGA ACAGTCTGAG TATTCTGATA 1020 
TCCGGAGGTGAAATGAAAGG CCTGGCCACG AAATCTTTCC TCCACGCCGT CCATTTTCTT 1080 
ATCTATGGAC ATTCCAAAAC ATTTACCATT AGAGAGGGGG GATGTCACAC GCAGGATTCT 1 140 
GTGGGGACTG TGGACTTCAT CGAGGTGTGT GTTCGCGGAA CGGACAGGTG AGATGGAGAC 1200 
5 5 CCCTGGGGCC GTGGGGTCTC AGGGGTGCCT GGTGAATTCT GCACTTACAC GTACTCAAGG 1 260 
GAGCGCGCCC GCGTTATCCT CGTACCTTTG TCTTCTTTCC ATCTGTGGAG TCAGTGGGTG 1320 
TCGGCCGCTC TGTTGTGGGG GAGGTGAACC AGGGAGGGGC AGGGCAAGGC AGGGCCCCCA 1380 
GAGCTGGGCC ACACAGTGGG TGCTGGGCCT CGCCCCGAAG CTTCTGGTGC AGCAGCCTCT 1440 
GGTGCTGTCT CCGCGGAAGT CAGGGCGGCT GGATTCCAGG ACAGGAGTGA ATGTAAAAAT 1500 
60 AAATATCGCT TAGAATGCAG GAGAAGGGTG GAGAGGAGGC AGGGGCCGAG GGGGTGCTTG 1560 
GTGCCAAACT GAAATTCAGT TTCTTGTGTG GGGCCTTGCG GTTCAGAGCT CTTGGCGAGG 1620 
GTGGAGGGAG GAGTGTCATT TCTATGTGTA ATTTCTGAGC CATTGTACTG TCTGGGCTGG 1680 
GGGGGACACT GTCCAAGGGA GTGGCCCCTA TGAGTTTATA TTTTAACCAC TGCTTCAAAT 1740 
CTCGATTTCA CTTTTTTTAT TTATCCAGTT ATATCTACAT ATCTGTCATC TAAATAAATG 1800 
65 GCTTTCAAAC AAAAAAAAAA AAAAAAAAAA AAAAAAA 

Seq ID NO: 22 Protein sequence. 
Protein Accession #: NP O03705 



1 11 21 31 41 51 

MCAERLGQFM TLALVLATFD PARGTDATNP PEGPQDRSSQ QKGRLSLQNT AEIQHCLVNA 60 
GDVGCGVFEC FENNSCEIRG LHGICMTFLH NAGKFDAQGK SFKDALKCK AHALRHRFGC 120 
75 ISRKCPAIRE MVSQLQRECY LKHDLCAAAQ ENTRVIVEMI HFKDLLLHEP YVDLVNLLLT 180 
CGEEVKEAIT HSVQVQCEQN WGSLCSILSF CTSAIQKPPT APPERQPQVD RTKLSRAHHG 240 . 
EAGHHLPEPS SRETGRGAKG ERGSKSHPNA HARGRVGGLG AQGPSGSSEW EDEQSEYSDI 300 




Seq ID NO: 21 DNA sequence 

Nucleic Acid Accession #■ NM_0037I4 

Coding sequence: 123-1031(underlined sequences correspond to start and stop codons) 



70 
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Seq ID NO: 23 DNA sequence 

Nucleic Acid Accession #: NM 005264. 1 

Coding sequence: 557-1954(underlined sequences correspond to start and stop codons) 



10 



C20 



1 11 21 31 41 51 
I I I I I I 
GAATTCCGGC CAGAAGAAAT CTGGCCTCGG AACACGCCAT TCTCCGCGCC GCTTCCAATA 60 
ACCACTAACA TCCCTAACGA GCATCCGAGC CGAGGGCTCT GCTCGGAAAT CGTCCTGGCC 1 20 
CAACTCGGCC CTTCGAGCTC TCGAAGATTA CCGCATCTAT TTTTTTTTTC TTTTTTTTCT 1 80 
TTTCCTAGCG CAGATAAAGT GAGCCCGGAA AGGGAAGGAG GGGGCGGGGA CACCATTGCC 240 
1 5 CTGAAAGAAT AAATAAGTAA ATAAACAAAC TGGCTCCTCG CCGCAGCTGG ACGCGGTCGG 300 
TTGAGTCCAG GTTGGGTCGG ACCTGAACCC CTAAAAGCGG AACCGCCTCC CGCCCTCGCC 360 
ATCCCGGAGC TGAGTCGCCG GCGGCGGTGG CTGCTGCCAG ACCCGGAGTT TCCTCTTTCA 420 
CTGGATGGAG CTGAACTTTG GGCGGCCAGA GCAGCACAGC TGTCCGGGGA TCGCTGCACG 480 
CTGAGCTCCC TCGGCAAGAC CCAGCGGCGG CTCGGGATTT TTTTGGGGGG GCGGGGACCA 540 
GCCCCGCGCC GGCACC ATG T TCCTGGCGAC CCTGTACTTC GCGCTGCCGC TCTTGGACTT 600 
GCTCCTGTCG GCCGAAGTGA GCGGCGGAGA CCGCCTGGAT TGCGTGAAAG CCAGTGATCA 660 
GTGCCTGAAG GAGCAGAGCT GCAGCACCAA GTACCGCACG CTAAGGCAGT GCGTGGCGGG 720 
CAAGG AGACC AACTTCAGCC TGGCATCCGG CCTGGAGGCC AAGGATGAGT GCCGCAGCGC 780 
CATGGAGGCC CTGAAGCAGA AGTCGCTCTA CAACTGCCGC TGCAAGCGGG GTATGAAGAA 840 
-25 GGAGAAGAAC TGCCTGCGCA TTTACTGGAG CATGTACCAG AGCCTGCAGG GAAATGATCT 900 
GCTGGAGGAT TCCCCATATG AACCAGTTAA CAGCAGATTG TCAGATATAT TCCGGGTGGT 960 
CCCATTCATA TCAGATGTTT TTCAGCAAGT GGAGCACATT CCCAAAGGGA ACAACTGCCT 1020 
: »; GGATGCAGCG AAGGCCTGCA ACCTCGACGA CATTTGCAAG AAGTACAGGT CGGCGTACAT 1080 

CACCCCGTGC ACCACCAGCG TGTCCAACG A TGTCTGCAAC CGCCGCAAGT GCCACAAGGC 1140 
CCTCCGGCAG TTCTTTGACA AGGTCCCGGC CA AGCACAGC TACGGAATGC TCTTCTGCTC 1 200 
CTGCCGGGAC ATCGCCTGCA CAGAGCGGAG GCGACAGACC ATCGTGCCTG TGTGCTCCTA 1260 
TGAAGAGAGG GAGAAGCCCA ACTGTTTGAA TTTGCAGGAC TCCTGCAAGA CGAATTACAT 1320 
CTGCAGATCT CGCCTTGCGG ATTTTTTTAC CAACTGCCAG CCAGAGTCAA GGTCTGTCAG 1380 
CAGCTGTCTA AAGGAAAACT ACGCTGACTG CCTCCTCGCC TACTCGGGGC TTATTGGCAC 1440 
AGTCATGACC CCCAACTACA TAGACTCCAG TAGCCTCAGT GTGGCCCCAT GGTGTGACTG 1500 
CAGCAACAGT GGGAACGACC TAGAAGAGTG CTTGAAATTT TTGAATTTCT TCAAGGACAA 1560 
TACATGTCTT AAAAATGCAA TTCAAGCCTT TGGCAATGGC TCCGATGTGA CCGTGTGGCA 1620 
GCCAGCCTTC CCAGTACAGA CCACCACTGC CACTACCACC ACTGCCCTCC GGGTTAAGAA 1680 
CAAGCCCCTG GGGCCAGCAG GGTCTGAGAA TGAAATTCCC ACTCATGTTT TGCCACCGTG 1740 
TGCAAATTTA CAGGCACAGA AGCTGAAATC CAATGTGTCG GGCAATACAC ACCTCTGTAT 1800 
TTCCAATGGT AATTATGAAA AAGAAGGTCT CGGTGCTTCC AGCCACATAA CCACAAAATC 1860 
AATGGCTGCT CCTCCAAGCT GTGGTCTGAG CCCACTGCTG GTCCTGGTGG TAACCGCTCT 1920 
GTCCACCCTA TTATCTTTAA CAGAAACATC ATAG CTGCAT TAAAAAAATA CAATATGGAC 1980 
ATGTAAAAAG ACAAAAACCA AGTTATCTGT TTCCTGTTCT CTTGTATAGC TGAAATTCCA 2040 
GTTTAGGAGC TCAGTTGAGA AACAGTTCCA TTCAACTGGA ACATTTTTTT TTTTCCTTTT 2100 
AAGAAAGCTT CTTGTGATCC TTCGGGGCTT CTGTGAAAAA CCTGATGCAG TGCTCCATCC 2160 
AAACTCAGAA GGCTTTGGGA TATGCTGTAT TTTAAAGGGA CAGTTTGTAA CTTGGGCTGT 2220 
AAAGCAAACT GGGGCTGTGT TTTCGATGAT GATGATCATC ATGATCATGA TGATTTTAAC 2280 
AGTTTTACTT CTGGCCTTTC CTAGCTAGAG AAGGAGTTAA TATTTCTAAG GTAACTCCCA 2340 
5 U TATCTCCTTT AATGACATTG ATTTCTAATG ATATAAATTT CAGCCTACAT TGATGCCAAG 2400 
CTTTTTTGCC ACAAAGAAGA TTCTTACCAA GAGTGGGCTT TGTGGAAACA GCTGGTACTG 2460 
ATGTTCACCT TTATATATGT ACTAGCATTT TCCACGCTGA TGTTTATGTA CTGTAAACAG 2520 
TTCTGCACTC TTGTACAAAA GAAAAAACCA CCCGGAATTC 

55 Seq ID NO: 24 Protein sequence: 

Protein Accession it: NP_005255 

1 11 21 31 41 51 
AA I I I i I I 

OU MFLATLYFAL PLLDLLLSAE VSGGDRLDCV KASDQCLKEQ SCSTKYRTLR QCVAGKETNF 60 

SLASGLEAKD ECRSAMEALK QKSLYNCRCK RGMKKEKNCL RIYWSMYQSL QGNDLLEDSP 120 
YEPVNSRLSD IFRVVPFISD VFQQVEHIPK GNNCLDAAKA CNLDDICKKY RSAYITPCTT 1 80 
SVSNDVCNRR KCHKALRQFF DKVPAKHSYG MLFCSCRDIA CTERRRQTIV PVCSYEEREK 240 
PNCLNLQDSC KTNYICRSRL ADFFTNCQPE SRSVSSCLKE NYADCLLAYS GLIGTVMTPN 300 

65 YIDSSSLSVA PWCDCSNSGN DLEECLKFLN FFKDNTCLKN AIQAFGNGSD VTVWQPAFPV 360 
QTTTATTTTA LRVKNKPLGP AGSENEIPTH VLPPCANLQA QKLKSNVSGN THLCISNGNY 420 
EKEGLGASSH ITTKSMAAPP SCGLSPLLVL VVTALSTLLS LTETS 

Seq ID NO: 25 DNA sequence 
70 Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: 1576(entire sequence represents open reading frame) 

1 II 21 31 41 51 

~r I I I I I I 

/D CTTTTGTTTC GCCATGCCTA GTCTAGTGGT ATCTGGAATA ATGGAAAGAA ATGGGGGCTT 60 

TGGAG AACTA GGATGTTTCG GGGGAAGCGC TAAGGACCGA GGGCTGCTGG AAGACGAGCG 1 20 
CGCCCTTCAG CTGGCTCTCG ATCAACTCTG CCTCCTGGGT TTGGGGGAGC CCCCCGCCCC 1 80 
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r : '40 



45 



CAGGGCGGGC GAGGACGGGG GAGGTGGGGG GGGCGGCGCC CCCGCGCAGC CGACAGCCCC 240 
CCCGCAGCCG GCGCCGCCGC CGCCGCCCGC GGCGCCCCCG GCCGCCCCGA CGACGGCCCC 300 
CGCAGCGCAG ACGCCCCAGC CCCCCACCGC CCCCAAAGGG GCGAGCGACG CCAAGCTCTG 360 
CGCTCTCTAC AAAGAGGCCG AGCTGCGCCT GAAGGGCAGC AGCAACACCA CGGAGTGTGT 420 
TCCCGTGCCC ACCTCCGAGC ACGTGGCCGA GATCGTGGGC AGGCAAGGCT GCAAGATTAA 480 
GGCCTTGAGG GCCAAGACCA ACACCTACAT CAAGACACCG GTGAGGGGCG AGGAACCAGT 540 
GTTCATGGTG ACAGGGCGAC GGGAGGACGT GGCCACAGCC CGGCGGGAAA TCATCTCAGC 600 
AGCGGAGCAC TTCTCCATGA TCCGTGCCTC CCGCAACAAG TCAGGCGCCG CCTTTGGTGT 660 
GGCTCCTGCT CTGCCCGGCC AGGTGACCAT CCGTGTGCGG GTGCCCTACC GCGTGGTGGG 720 
GCTGGTGGTG GGCCCCAAAG GGGCAACCAT CAAGCGCATC CAGCAGCAAA CCAACACATA 780 
CATTATCACA CCAAGCCGTG ACCGCGACCC CGTGTTCGAG ATCACGGGTG CCCCAGGCAA 840 
CGTGGAGCGT GCGCGCGAGG AGATCGAGAC GCACATCGCG GTGCGCACTG GCAAGATCCT 900 
CGAGTACAAC AATGAAAACG ACTTCCTGGC GGGGAGCCCC GACGCAGCAA TCGATAGCCG 960 
CTACTCCGAC GCCTGGCGGG TGCACCAGCC CGGCTGCAAG CCCCTCTCCA CCTTCCGGCA 1020 
GAACAGCCTG GGCTGCATCG GCGAGTGCGG AGTGGACTCT GGCTTTGAGG CCCCACGCCT 1080 
GGGTGAGCAG GGCGGGGACT TTGGCTACGG CGGGTACCTC TTTCCGGGCT ATGGCGTGGG 1140 
CAAGCAGGAT GTGTACTACG GCGTGGCCGA GACTAGCCCC CCGCTGTGGG CGGGCCAGGA 1200 
GAACGCCACG CCCACCTCCG TGCTCTTCTC CTCTGCCTCC TCCTCCTCCT CCTCTTCCGC 1260 
CAAGGCCCGC GCTGGGCCCC CGGGCGCACA CCGCTCCCCT GCCACTTCCG CGGGACCCGA 1320 
GCTGGCCGGA CTCCCGAGGC GCCCCCCGGG AGAGCCGCTC CAGGGCTTCT CTAAACTTGG 1380 
TGGGGGCGGC CTGCGGAGCC CCGGCGGCGG GCGGGATTGC ATGGTCTGCT TTGAGAGCGA 1440 
AGTGACTGCC GCCCTTGTGC CCTGCGG ACA CAACCTGTTC TGCATGG AGT GTGCAGTACG 1 500 
CATCTGCGAG AGGACGGACC CAGAGTGTCC CGTCTGCCAC ATCACAGCCA CGCAAGCCAT 1560 
CCGAATATTC TCCTAA 

Seq ID NO: 26 Protein sequence: 

Protein Accession #: FGENESH predicted 



1 11 21 31 41 51 

I I I I I I 

FCFAMPSLVV SGIMERNGGF GELGCFGGSA KDRGLLEDER ALQLALDQLC LLGLGEPPAP 60 
RAGEDGGGGG GGAPAQPTAP PQPAPPPPPA APPAAPTTAP AAQTPQPPTA PKGASDAKLC 120 
ALYKEAELRL KGSSNTTECV PVPTSEHVAE IVGRQGCKJK A LRAKTNTYI KTPVRGEEPV 1 80 
FMVTGRREDV ATARRE1ISA AEHFSMIRAS RNKSGAAFGV APALPGQVTI RVRVPYRVVG 240 
LWGPKGATI KRIQQQTNTY IITPSRDRDP VFEITGAPGN VERAREEIET HIAVRTGKIL 300 
EYNNENDFLA GSPDAAIDSR YSDAWRVHQP GCKPLSTFRQ NSLGCIG ECG VDSGFEAPRL 360 
GEQGGDFGYG GYLFPGYGVG KQDVYYGVAE TSPPLWAGQE NATPTSVLFS SASSSSSSSA 420 
KARAGPPGAH RSPATSAGPE LAGLPRRPPG EPLQGFSKLG GGGLRSPGGG RDCMVCFESE 480 
VTAALVPCGH NLFCMECAVR ICERTDPECP VCHITATQAI RIFS 

Seq ID NO: 27 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: 1-2070 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

ATGAGCGGTG CGGGGGTGGC GGCTGGGACG CGGCCCCCCA GCTCGCCGAC CCCGGGCTCT 60 
CGGCGCCGGC GCCAGCGCCC CTCTGTGGGC GTCCAGTCCT TGAGGCCGCA GAGCCCGCAG 120 
CTCAGGCAG A GCGACCCGCA GAAACGGAAC CTGG ACCTGG AGAAAAGCCT GCAGTTCCTG 1 80 
CAGCAGCAGC ACTCGGAGAT GCTGGCCAAG CTCCATGAGG AGATCGAGCA TCTGAAGCGG 240 
GAAAACAAGG GTGAGCCGGC GCGGGGCCCT AGGCCGGCCC TGCCTCCCCA GGCACACTCA 300 
ACACTGCCGC TCCCGCAGCA CAGAAACACA GCCATCAACT CCAGCACACG CCTGGGCTCA 360 
GGGGGAACAC AGGACGGGGA GCCCCTCCAG ACTGTCCTTG CCCACCTGGC TGCACTGGCC 420 
CCTGTATGCC AACCCAGTGG GTACAGGTTC TGGGGGACCT GGACAGATGC CGCTACCTCT 480 
AGCCGTGGCT GGACGATGTT ATGCAGCCAA GCACAGCACG TGCTGCTCTC GGGAAGCCCA 540 
GGGCCTGAGG TCATTGCAGG GCGGCAGGTG GCCACAGGGT GCTCCCCAGA CCTCCCTCCT 600 
CCAAGTAGAG CTGAAATGGG AAGGAACCCC TGGGACAGCC CCTGCCCTGC TAGATCTTTG 660 
CCTCAGATTG CTGCTGTGGC CAGGCCCAGG ATTTCCAGCC CTATGGCTCT GAGTCCTCAC 720 
ATGCTGGGGG CCCAGGGGAT ATGGACACAC TCCATCCAGG GATCCCTTCC TGCCATCTGG 780 
GCAGCAACCA TGGGGACAAA GGGAGGAAGC AGAGTCCTGT TTCCTTGCCA CTTGTCCAAG 840 
GCACTTCCCC ATCCTGACAG CGGCCCCCAC CCAGCCCAGG ATCCTGGGCT GTGGTCTCAA 900 
GCTCACTTCC CATTATCTTT GGGGCTGGGG CTGACATCAG GAGGACATCT GACTGGTGGA 960 
TGGAGCCAGC CTGGGAACAT CGCAGCTGGG GCAGTGCCTA GGGCTCTCCC TTCCCAGGGA 1020 
GACATGGAGA AGGGGGTTGA GGGAGGGCCC TTCCCTAGCC GCTGTGGCAA CTCCAGTGAG 1080 
CTGTTCTGGG CAAAGTGTGG CCCAAGTCGG CAGCCCCAGC CCTGCAGTGC TGGGGACGCT 1 140 
GACAGGACAC GGGAAGAGGC CATGCTTTCC CTCGGGACCT GCTGTTCCAT GTGTCCCAAG 1200 
CCCTCCTGCT TTCCAGATGG CCCCTCAGGA AACCACCTTT CCAGGGCCTC TGCTCCCTTG 1260 
GGCGCTCGCT GGGTCTGCAT CAACGGAGTG TGGGTAGAGC CGGGAGGACC CAGCCCTGCC 1320 
AGGCTGAAGG AGGGCTCCTC ACGGACACAC AGGCCAGGAG GCAAGCGTGG GCGTCTTGCG 1380 
GGCGGTAGCG CCGACACTGT GCGCTCTCCT GCAGACAGCC TCTCCATGTC AAGCTTCCAG 1440 
TCTGTCAAGT CCATCTCTAA TTCAGCCAAC TCTCAAGGCA AGGCCAGGCC CCAGCCCGGC 1500 
TCCTTCAACA AGCAAGATTC AAAAGCTGAC GTCTCCCAGA AGGCGGACCT GGAAGAGGAG 1560 
CCCCTACTTC ACAACAGCAA GCTGGACAAA GTTCCTGGGG TACAAGGGCA GGCCAGAAAG 1620 
GAGAAAGCAG AGGCCTCTAA TGCAGGAGCT GCCTGTATGG GGAACAGCCA GCACCAGGGC 1680 
AGGCAGATGG GGGCGGGGGC ACACCCCCCA ATGATCCTGC CCCTTCCCCT GCGAAAGCCC 1740 
ACCACACTTA GGCAGTGCG A AGTGCTCATC CGCGAGCTGT GGAATACCAA CCTCCTGCAG 1 800 
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ACCCAAGAGC TGCGGCACCT CAAGTCCCTC CTGGAAGGGA GCCAGAGGCC CCAGGCAGCC 1 860 
CCGGAGGAAG CTAGCTTTCC CAGGGACCAA GAAGCCACGC ATTTCCCCAA GGTCTCCACC 1920 
AAGAGCCTCT CCAAGAAATG CCTGAGCCCA CCTGTGGCGG AGCGTGCCAT CCTGCCCGCA 1980 
CTGAAGCAGA CCCCGAAGAA CAACTTTGCC GAGAGGCAGA AGAGGCTGCA GGCAATGCAG 2040 
AAACGGCGCC TGCATCGCTC AGTGCTTTGA 

Seq ID NO: 28 Protein sequence: 

Protein Accession #: FGENESH predicted 



1 11 21 31 41 51 

I I I I I I 

MSGAGVAAGT RPPSSPTPGS RRRRQRPSVG VQSLRPQSPQ LRQSDPQKRN LDLEKSLQFL 60 
QQQHSEMLAK LHEEIEHLKR ENKGEPARGP RPALPPQAHS TLPLPQHRNT AINSSTRLGS 120 
GGTQDGEPLQ TVLAHLAALA PVCQPSGYRF WGTWTDAATS SRGWTMLCSQ AQHVLLSGSP 1 80 
GPEVIAGRQV ATGCSPDLPP PSRAEMGRNP WDSPCPARSL PQIAAVARPR ISSPMALSPH 240 
MLGAQGIWTH SIQGSLPAIW AATMGTKGGS RVLFPCHLSK ALPHPDSGPH PAQDPGLWSQ 300 
AHFPLSLGLG LTSGGHLTGG WSQPGNIAAG AVPRALPSQG DMEKGVEGGP FPSRCGNSSE 360 
LFWAKCGPSR QPQPCSAGDA DRTREEAMLS LGTCCSMCPK PSCFPDGPSG NHLSRASAPL 420 
GARWVCINGV WVEPGGPSPA RLKEGSSRTH RPGGKRGRLA GGSADTVRSP ADSLSMSSFQ 480 
SVKSISNSAN SQGKARPQPG SFNKQDSKAD VSQKADLEEE PLLHNSKLDK VPGVQGQARK 540 
EKAEASNAGA ACMGNSQHQG RQMGAGAHPP MILPLPLRKP TTLRQCEVLI RELWNTNLLQ 600 
TQELRHLKSL LEGSQRPQAA PEEASFPRDQ EATHFPKVST KSLSKKCLSP PVAERAILPA 660 
LKQTPKNNFA ERQKRLQAMQ KRRLHRSVL 

Seq ID NO: 29 DNA sequence 

Nucleic Acid Accession #: NM_012319.2 

Coding sequence: 138-2405 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

CTCGTGCCGA ATTCGGCACG AGACCGCGTG TTCGCGCCTG GTAGAGATTT CTCGAAGACA 60 
CCAGTGGGCC CGTGTGGAAC CAAACCTGCG CGCGTGGCCG GGCCGTGGGA CAACGAGGCC 120 
GCGGAGACGA AGGCGCA ATG GCGAGGAAGT TATCTGTAAT CTTGATCCTG ACCTTTGCCC 180 
TCTCTGTCAC AAATCCCCTT CATGAACTAA AAGCAGCTGC TTTCCCCCAG ACCACTGAGA 240 
AAATTAGTCC GAATTGGGAA TCTGGCATTA ATGTTGACTT GGCAATTTCC ACACGGCAAT 300 
ATCATCTACA ACAGCTTTTC TACCGCTATG GAGAAAATAA TTCTTTGTCA GTTGAAGGGT 360 
TCAGAAAATT ACTTCAAAAT ATAGGCATAG ATAAGATTAA AAGAATCCAT ATACACCATG 420 
ACCACGACCA TCACTCAGAC CACGAGCATC ACTCAGACCA TGAGCGTCAC TCAGACCATG 480 
AGCATCACTC AGACCACGAG CATCACTCTG ACCATGATCA TCACTCTCAC CATAATCATG 540 
CTGCTTCTGG TAAAAATAAG CGAAAAGCTC TTTGCCCAGA CCATGACTCA GATAGTTCAG 600 
GTAAAGATCC TAGAAACAGC CAGGGGAAAG GAGCTCACCG ACCAGAACAT GCCAGTGGTA 660 
GAAGGAATGT CAAGGACAGT GTTAGTGCTA GTGAAGTGAC CTCAACTGTG TACAACACTG 720 
TCTCTQAAGG AACTCACTTT CTAGAGACAA TAGAGACTCC AAGACCTGGA AAACTCTTCC 780 
CCAAAGATGT AAGCAGCTCC ACTCCACCCA GTGTCACATC AAAGAGCCGG GTGAGCCGGC 840 
TGGCTGGTAG G A A A ACAAAT GAATCTGTGA GTGAGCCCCG AAAAGGCTTT ATGTATTCCA 900 
GAAACACAAA TGAAAATCCT CAGGAGTGTT TCAATGCATC AAAGCTACTG ACATCTCATG 960 
GCATGGGCAT CCAGGTTCCG CTGAATGCAA CAGAGTTCAA CTATCTCTGT CCAGCCATCA 1020 
TCAACCAAAT TGATGCTAGA TCTTGTCTGA TTCATACAAG TGAAAAGAAG GCTGAAATCC 1080 
CTCCAAAGAC CTATTCATTA CAAATAGCCT GGGTTGGTGG TTTTATAGCC ATTTCCATCA 1 140 
TCAGTTTCCT GTCTCTGCTG GGGGTTATCT TAGTGCCTCT CATGAATCGG GTGTTTTTCA 1200 
AATTTCTCCT GAGTTTCCTT GTGGCACTGG CCGTTGGGAC TTTGAGTGGT GATGCTTTTT 1260 
TACACCTTCT TCCACATTCT CATGCAAGTC ACCACCATAG TCATAGCCAT GAAGAACCAG 1320 
CAATGGAAAT GAAAAGAGGA CCACTTTTCA GTCATCTGTC TTCTCAAAAC ATAGAAGAAA 1380 
GTGCCTATTT TGATTCCACG TGGAAGGGTC TAACAGCTCT AGGAGGCCTG TATTTCATGT 1440 
TTCTTGTTG A ACATGTCCTC ACATTGATCA AACAATTTAA AGATAAGAAG AAAA AGAATC 1 500 
AGAAGAAACC TGAAAATGAT GATGATGTGG AGATTAAGAA GCAGTTGTCC AAGTATGAAT 1560 
CTCAACTTTC AACAAATGAG GAGAAAGTAG ATACAGATGA TCGAACTGAA GGCTATTTAC 1620 
GAGCAGACTC ACAAGAGCCC TCCCACTTTG ATTCTCAGCA GCCTGCAGTC TTGGAAGAAG 1 680 
AAGAGGTCAT GATAGCTCAT GCTCATCCAC AGGAAGTCTA CAATGAATAT GTACCCAGAG 1740 
GGTGCAAGAA TAAATGCCAT TCACATTTCC ACGATACACT CGGCCAGTCA GACGATCTCA 1800 
TTCACCACCA TCATGACTAC CATCATATTC TCCATCATCA CCACCACCAA AACCACCATC 1860 
CTCACAGTCA CAGCCAGCGC TACTCTCGGG AGGAGCTGAA AGATGCCGGC GTCGCCACTT 1920 
TGGCCTGGAT GGTGATAATG GGTGATGGCC TGCACAATTT CAGCGATGGC CTAGCAATTG 1980 
GTGCTGCTTT TACTGAAGGC TTATCAAGTG GTTTAAGTAC TTCTGTTGCT GTGTTCTGTC 2040 
ATGAGTTGCC TCATGAATTA GGTGACTTTG CTGTTCTACT AAAGGCTGGC ATGACCGTTA 2100 
AGCAGGCTGT CCTTTATAAT GCATTGTCAG CCATGCTGGC GTATCTTGGA ATGGCAACAG 2160 
GAATTTTCAT TGGTCATTAT GCTGAAAATG TTTCTATGTG GATATTTGCA CTTACTGCTG 2220 
GCTTATTCAT GTATGTTGCT CTGGTTGATA TGGTACCTGA AATGCTGCAC AATGATGCTA 2280 
GTGACCATGG ATGTAGCCGC TGGGGGTATT TCTTTTTACA GAATGCTGGG ATGCTTTTGG 2340 
GTTTTGGAAT TATGTTACTT ATTTCCATAT TTGAACATAA AATCGTGTTT CGTATAAATT 2400 
TC TAGT TAAG GTTTAAATGC TAGAGTAGCT TAAAAAGTTG TCATAGTTTC AGTAGGTCAT 2460 
AGGGAGATGA GTTTGTATGC TGTACTATGC AGCGTTTAAA GTTAGTGGGT TTTGTGATTT 2520 
TTGTATTGAA TATTGCTGTC TGTTACAAAG TCAGTTAAAG GTACGTTTTA ATATTTAAGT 2580 
TATTCTATCT TGGAGATAAA ATCTGTATGT GCAATTCACC GGTATTACCA GTTTATTATG 2640 
TAAACAAGAG ATTTGGCATG ACATGTTCTG TATGTTTCAG GGAAAAATGT CTTTAATGCT 2700 
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TTTTCAAGAA CTAACACAGT TATTCCTATA CTGGATTTTA GGTCTCTGAA GAACTGCTGG 2760 
TGTTTAGGAA TAAGAATGTG CATGAAGCCT AAAATACCAA GAAAGCTTAT ACTGAATTTA 2820 
AGCAAAGAAA TAAAGGAGAA AAGAGAAGAA TCTGAGAATT GGGGAGGCAT AGATTCTTAT 2880 
AAAAATCACA AAATTTGTTG TAAATTAGAG GGGAGAAATT TAGAATTAAG TATAAAAAGG 2940 
CAGAATTAGT ATAGAGTACA TTCATTAAAC ATTTTTGTCA GGATTATTTC CCGTAAAAAC 3000 
GTAGTGAGCA CTCTCATATA CTAATTAGTG TACATTTAAC TTTGTATAAT ACAGAAATCT 3060 
AAATATATTT AATGAATTCA AGCAATATAC ACTTGACCAA GAAATTGGAA TTTCAAAATG 3120 
TTCGTGCGGG TTATATACCA GATGAGTACA GTGAGTAGTT TATGTATCAC CAGACTGGGT 3180 
TATTGCCAAG TTATATATCA CCA A A AG CTG TATGACTGGA TGTTCTGGTT ACCTGGTTTA 3240 
CAAAATTATC AGAGTAGTAA AACTTTGATA TATATGAGGA TATTAAAACT ACACTAAGTA 3300 
TCATTTGATT CGATTCAGAA AGTACTTTGA TATCTCTCAG TGCTTCAGTG CTATCATTGT 3360 
GAGCAATTGT CTTTATATAC GGTACTGTAG CCATACTAGG CCTGTCTGTG GCATTCTCTA 3420 
GATGTTTCTT TTTTACACAA TAAATTCCTT ATATCAGCTT G 

Seq ID NO: 30 Protein sequence: 
Protein Accession #: NP_03 645 1 .2 

1 11 21 31 41 51 

I I I I I I 

MARKLSVELI LTFALSVTNP LHELKAAAFP QTTEKISPNW ESGINVDLAI STRQYHLQQL 60 
FYRYGENNSL SVEGFRKLLQ NIGIDKDCRI HIHHDHDHHS DHEHHSDHER HSDHEHHSDH 120 
EHHSDHDHHS HHNHAASGKN KRKALCPDHD SDSSGKDPRN SQGKGAHRPE HASGRRNVKD 1 80 
SVSASEVTST VYNT V SEGTH FLETIETPRP GKLFPKDVSS STPPSVTSKS RVSRLAGRKT 240 
NESVSEPRKG FMYSRNTNEN PQECFNASKL LTSHGMGIQV PLNATEFNYL CPAIINQIDA 300 
RSCLIHTSEK KAEIPPKTYS LQIAWVGGFI AISIISFLSL LGVILVPLMN RVFFKFLLSF 360 
LVALAVGTLS GDAFLHLLPH SHASHHHSHS HEEPAMEMKR GPLFSHLSSQ NIEESAYFDS 420 
TWKGLTALGG LYFMFLVEHV LTLIKQFKDK KKKNQKKPEN DDDVEKKQL SKYESQLSTN 480 
EEKVDTDDRT EGYLRADSQE PSHFDSQQPA VLEEEEVMIA HAHPQEVYNE YVPRGCKNKC 540 
HSHFHDTLGQ SDDLIHHHHD YHHILHHHHH QNHHPHSHSQ RYSREELKDA GVATLAWMVI 600 
MGDGLHNFSD GLAIGAAFTE GLSSGLSTSV AVFCHELPHE LGDFAVLLKA GMTVKQAVLY 660 
NALSAMLAYL GMATGIFIGH YAENVSMWIF ALTAGLFMYV ALVDMVPEML HNDASDHGCS 720 
RWGYFFLQNA GMLLGFGIML LISIFEHKIV FRINF 

Seq ID NO: 31 DNA sequence 

Nucleic Acid Accession # : NM_002 184.1 

Coding sequence: 256-3012(underIined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

GAGCAGCCAA AAGGCCCGCG GAGTCGCGCT GGGCCGCCCC GGCGCAGCTG AACCGGGGGC 60 
CGCGCCTGCC AGGCCGACGG GTCTGGCCCA GCCTGGCGCC AAGGGGTTCG TGCGCTGTGG 120 
AGACGCGGAG GGTCGAGGCG GCGCGGCCTG AGTGAAACCC AATGGAAAAA GCATGACATT 180 
TAGAAGTAGA AGACTTAGCT TCAAATCCCT ACTCCTTCAC TTACTAATTT TGTGATTTGG 240 
AAATATCCGC GCAAG ATGT T GACGTTGCAG ACTTGGGTAG TGCAAGCCTT GTTTATTTTC 300 
CTCACCACTG AATCTACAGG TGAACTTCTA GATCCATGTG GTTATATCAG TCCTGAATCT 360 
CCAGTTGTAC AACTTCATTC TAATTTCACT GCAGTTTGTG TGCTAAAGGA AAAATGTATG 420 
GATTATTTTC ATGTAAATGC TAATTACATT GTCTGGAAAA CAAACCATTT TACTATTCCT 480 
AAGGAGCAAT ATACTATCAT AAACAGAACA GCATCCAGTG TCACCTTTAC AGATATAGCT 540 
TCATTAAATA TTCAGCTCAC TTGCAACATT CTTACATTCG GACAGCTTGA ACAGAATGTT 600 
TATGGAATCA CAATAATTTC AGGCTTGCCT CCAGAAAAAC CTAAAAATTT GAGTTGCATT 660 
GTGAACGAGG GGAAGAAAAT GAGGTGTGAG TGGGATGGTG GAAGGGAAAC ACACTTGGAG 720 
ACAAACTTCA CTTTAAAATC TGAATGGGCA ACACACAAGT TTGCTGATTG CAAAGCAAAA 780 
CGTGACACCC CCACCTCATG CACTGTTGAT TATTCTACTG TGTATTTTGT CAACATTGAA 840 
GTCTGGGTAG AAGCAGAGAA TGCCCTTGGG AAGGTTACAT CAGATCATAT CAATTTTGAT 900 
CCTGTATATA AAGTGAAGCC CAATCCGCCA CATAATTTAT CAGTGATCAA CTCAGAGGAA 960 
CTGTCTAGTA TCTTAAAATT GACATGGACC AACCCAAGTA TTAAGAGTGT TATAATACTA 1020 
AAATATAACA TTCAATATAG GACCAAAGAT GCCTCAACTT GGAGCCAGAT TCCTCCTGAA 1080 
GACACAGCAT CCACCCGATC TTCATTCACT GTCCAAGACC TTAAACCTTT TACAGAATAT 1 140 
GTGTTTAGGA TTCGCTGTAT GAAGGAAGAT GGTAAGGGAT ACTGGAGTGA CTGGAGTGAA 1200 
GAAGCAAGTG GGATCACCTA TGAAGATAGA CCATCTAAAG CACCAAGTTT CTGGTATAAA 1260 
ATAGATCCAT CCCATACTCA AGGCTACAGA ACTGTACAAC TCGTGTGGAA GACATTGCCT 1320 
CCTTTTGAAG CCAATGGAAA AATCTTGGAT TATGAAGTGA CTCTCACAAG ATGGAAATCA 1380 
CATTTACAAA ATTACACAGT TAATGCCACA AAACTGACAG TAAATCTCAC AAATGATCGC 1440 
TATCTAGCAA CCCTA ACAGT AAGAAATCTT GTTGGCAAAT CAG ATGCAGC TGTTTTAACT 1 500 
ATCCCTGCCT GTGACTTTCA AGCTACTCAC CCTGTAATGG ATCTTAAAGC ATTCCCCAAA 1560 
GATAACATGC TTTGGGTGGA ATGGACTACT CCAAGGGAAT CTGTAAAGAA ATATATACTT 1620 
GAGTGGTGTG TGTTATCAGA TAAAGCACCC TGTATCACAG ACTGGCAACA AGAAGATGGT 1680 
ACCGTGCATC GCACCTATTT AAGAGGGAAC TTAGCAGAGA GCAAATGCTA TTTGATAACA 1740 
GTTACTCCAG TATATGCTGA TGGACCAGGA AGCCCTGAAT CCATAAAGGC ATACCTTAAA 1800 
CAAGCTCCAC CTTCCAAAGG ACCTACTGTT CGGACAAAAA AAGTAGGGAA AAACGAAGCT 1860 
GTCTTAGAGT GGGACCAACT TCCTGTTGAT GTTCAGAATG GATTTATCAG AAATTATACT 1920 
ATATTTTATA GAACCATCAT TGGAAATGAA ACTGCTGTGA ATGTGGATTC TTCCCACACA 1980 
GAATATACAT TGTCCTCTTT GACTAGTGAC ACATTGTACA TGGTACGAAT GGCAGCATAC 2040 
ACAGATGAAG GTGGGAAGGA TGGTCCAGAA TTCACTTTTA CTACCCCAAA GTTTGCTCAA 2100 
GGAGAAATTG AAGCCATAGT CGTGCCTGTT TGCTTAGCAT TCCTATTGAC AACTCTTCTG 2160 
GGAGTGCTGT TCTGCTTTAA TAAGCGAGAC CTAATTAAAA AACACATCTG GCCTAATGTT 2220 
CCAGATCCTT CAAAGAGTCA TATTGCCCAG TGGTCACCTC ACACTCCTCC AAGGCACAAT 2280 
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TTTAATTCAA AAGATCAAAT GTATTCAGAT GGCAATTTCA CTGATGTAAG TGTTGTGGAA 2340 
ATAGAAGCAA ATGACAAAAA GCCTTTTCCA GAAGATCTGA AATCATTGGA CCTGTTCAAA 2400 
AAGGAAAAAA TTAATACTGA AGGACACAGC AGTGGTATTG GGGGGTCTTC ATGCATGTCA 2460 
TCTTCTAGGC CAAGCATTTC TAGCAGTGAT GAAAATGAAT CTTCACAAAA CACTTCGAGC 2520 
ACTGTCCAGT ATTCTACCGT GGTACACAGT GGCTACAGAC ACCAAGTTCC GTCAGTCCAA 2580 
GTCTTCTCAA GATCCGAGTC TACCCAGCCC TTGTTAGATT CAGAGGAGCG GCCAGAAGAT 2640 
CTACAATTAG TAGATCATGT AGATGGCGGT G ATGGTATTT TGCCCAGGCA ACAGTACTTC 2700 
AAACAGAACT GCAGTCAGCA TGAATCCAGT CCAGATATTT CACATTTTGA AAGGTCAAAG 2760 
CAAGTTTCAT CAGTCAATGA GGAAGATTTT GTTAGACTTA AACAGCAGAT TTCAGATCAT 2820 
ATTTCACAAT CCTGTGGATC TGGGCAAATG AAAATGTTTC AGGAAGTTTC TGCAGCAGAT 2880 
GCTTTTGGTC CAGGTACTGA GGGACAAGTA GAAAGATTTG AAACAGTTGG CATGGAGGCT 2940 
GCGACTGATG AAGGCATGCC TAAAAGTTAC TTACCACAGA CTGTACGGCA AGGCGGCTAC 3000 
ATGCCTCAG T GA AGGACTAG TAGTTCCTGC TACAACTTCA GCAGTACCTA TAAAGTAAAG 3060 
CTAAAATGAT TTTATCTGTG AATTC 

Seq ID NO: 32 Protein sequence: 
Protein Accession #: NP 002175.1 

1 11 21 31 41 51 

I I I I I I 

MLTLQTWVVQ ALFIFLTTES TGELLDPCGY ISPESPVVQL HSNFTAVCVL KEKCMDYFHV 60 
NANYIVWKTN HFTIPKEQYT IINRTASSVT FTDIASLN1Q LTCNILTFGQ LEQNVYGITI 120 
ISGLPPEKPK NLSCIVNEGK KMRCEWDGGR ETHLETNFTL KSEWATHKFA DCKAKRDTPT 1 80 
SCTVDYSTVY FVN1EVWVEA ENALGKVTSD HINFDPVYKV KPNPPHNLSV INSEELSSIL 240 
KLTWTNPSDC SVIILKYNIQ YRTKDASTWS QIPPEDTAST RSSFTVQDLK PFTEYVFRIR 300 
CMKEDGKGYW SDWSEEASGI TYEDRPSKAP SFWYKIDPSH TQGYRTVQLV WKTLPPFEAN 360 
GKILDYEVTL TRWKSHLQNY TVNATKLTVN LTNDRYLATL TVRNLVGKSD AAVLTIPACD 420 
FQATHPVMDL KAFPKDNMLW VEWTTPRESV KKYILEWCVL SDKAPCITDW QQEDGTVHRT 480 
YLRGNLAESK CYLITVTPVY ADGPGSPESI KAYLKQAPPS KGPTVRTKKV GKNEAVLEWD 540 
QLPVDVQNGF IRNYTIFYRT IIGNETAVNV DSSHTEYTLS SLTSDTLYMV RMAAYTDEGG 600 
KDGPEFTFTT PKFAQGEIEA IVVPVCLAFL LTTLLGVLFC FNKRDLKKH IWPNVPDPSK 660 
SHIAQWSPHT PPRHNFNSKD QMYSDGNFTD VSVVEIEAND KKPFPEDLKS LDLFKKEKIN 720 
TEGHSSGIGG SSCMSSSRPS ISSSDENESS QNTSSTVQYS TVVHSGYRHQ VPSVQVFSRS 780 
ESTQPLLDSE ERPEDLQLVD HVDGGDGILP RQQYFKQNCS QHESSPDISH FERSKQVSSV 840 
NEEDFVRLKQ QISDHISQSC GSGQMKMFQE VSAADAFGPG TEGQVERFET VGMEAATDEG 900 
MPKSYLPQTV RQGGYMPQ 

Seq ID NO: 33 DNA sequence 

Nucleic Acid Accession #: NM_018255.1 

Coding sequence: 1 1-2491 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

AGTTGGCGAC ATG GTGGCAC CCGTGCTGGA GACTTCTCAC GTGTTTTGCT GCCCAAACCG 60 
GGTGCGGGG A GTCCTGAACT GG AGCTCTGG GCCCAGAGG A CTTCTGGCCT TTGGCACGTC 1 20 
CTGCTCCGTG GTGCTCTATG ACCCCCTGAA AAGGGTTGTT GTTACCAACT TGAATGGTCA 1 80 
CACCGCCCGA GTCAATTGCA TACAGTGGAT TTGTAAACAG GATGGCTCCC CTTCTACTGA 240 
ATTAGTTTCT GGAGGATCTG ATAATCAAGT GATTCACTGG GAAATAGAGG ATAATCAGCT 300 
TTTAAAAGCA GTGCATCTTC AAGGCCATGA AGGACCTGTT TATGCGGTGC ATGCTGTTTA 360 
CCAGAGGAGG ACATCAGATC CTGCATTATG TACACTGATC GTTTCTGCAG CTGCAGATTC 420 
TGCTGTTCGA CTCTGGTCTA AAAAGGGTCC AGAAGTAATG TGCCTTCAGA CTTTAAACTT 480 
TGGAAATGGA TTTGCTTTGG CTCTCTGCTT ATCTTTTTTG CCAAATACTG ATGTACCAAT 540 
ATTAGCATGT GGCAATGATG ATTGCAGAAT TCACATATTT GCTCAACAAA ATGATCAGTT 600 
TCAGAAAGTG CTTTCTCTCT GTGGACATGA GGATTGGATT AGAGGAGTGG AATGGGCAGC 660 
CTTTGGTAGA GATCTTTTCC TAGCAAGCTG TTCACAAGAT TGCCTGATAA GAATATGGAA 720 
GCTGTATATA AAGTCAACAT CTTTAGAAAC TCAGGATGAC GATAACATAA GACTGAAAGA 780 
AAATACTTTT ACCATAGAAA ATGAAAGTGT TAAAATAGCA TTTGCTGTTA CTCTGGAGAC 840 
AGTGCTAGCC GGTCATGAAA ACTGGGTAAA TGCAGTTCAC TGGCAACCTG TGTTTTACAA 900 
AGATGGTGTC CTACAGCAGC CAGTGAGATT ATTATCTGCT TCCATGGATA AAACCATGAT ' 960 
TCTCTGGGCT CCAGATGAAG AGTCAGGAGT TTGGCTAGAA CAGGTTCGAG TAGGTGAAGT 1020 
AGGTGGGAAT ACTTTGGGAT TTTATGATTG CCAGTTCAAT GAAGATGGCT CCATGATCAT 1080 
TGCTCATGCT TTCCACGGAG CGTTGCACCT TTGG AAACAG AATACAGTTA ACCCAAGAGA 1 140 
GTGGACTCCA GAGATTGTCA TTTCAGGACA CTTTGATGGT GTCCAAGACC TAGTCTGGGA 1200 
TCCAGAAGGA GAATTTATTA TCACTGTTGG TACTGATCAG ACAACTAGAC TTTTTGCTCC 1260 
ATGGAAGAGA AAAGACCAAT CACAGGTGAC TTGGCATGAA ATTGCAAGGC CTCAGATACA 1320 
TGGGTATGAC CTGAAATGTT TGGCAATGAT TAATCGGTTT CAGTTTGTAT CTGGAGCAGA 1380 
TGAAAAAGTT CTTCGGGTTT TTTCTGCACC TCGGAATTTT GTGGAAAATT TTTGTGCCAT 1440 
TACAGGACAA TCACTGAATC ATGTGCTCTG TAATCAAGAT AGTGATCTTC CAGAAGGAGC 1500 
CACTGTCCCT GCATTGGGAT TATCAAATAA AGCTGTCTTT CAGGGAGATA TAGCTTCTCA 1560 
GCCTTCTGAT GAAGAGGAGC TGTTAACTAG TACTGGTTTT GAGTATCAGC AGGTGGCCTT 1620 
TCAGCCCTCC ATACTTACTG AGCCTCCCAC TGAGGATCAT CTTCTGCAGA ATACTTTGTG 1680 
GCCTGAAGTT CA AAAACTAT ATGGGCACGG TTATGAAATA TTTTGTGTTA CTTGTAACAG 1 740 
TTCAAAGACT CTGCTTGCCT CAGCTTGTAA GGCAGCTAAG AAAGAGCATG CAGCTATCAT 1800 
TCTTTGGAAC ACTACATCTT GGAAACAGGT GCAGAATTTA GTTTTCCACA GTTTGACAGT 1860 
CACGCAGATG GCCTTCTCAC CTAATGAGAA GTTCTTACTA GCTGTTTCCA GAGATCGAAC 1920 
CTGGTCATTG TGGAAAAAGC AGGATACAAT CTCACCTGAG TTCGAGCCAG TTTTTAGTCT 1980 
TTTTGCCTTC ACCAACAAAA TTACTTCTGT GCACAGTAGA ATTATTTGGT CTTGTGATTG 2040 
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GAGTCCTGAC AGCAAGTATT TCTTCACTGG GAGTCGAGAC AAAAAGGTGG TTGTCTGGGG 2100 
TGAGTGCGAC TCCACTGATG ACTGTATTGA GCACAACATT GGCCCCTGCT CCTCAGTCCT 2160 
GGACGTGGGT GGGGCTGTGA CAGCTGTCAG CGTCTGCCCA GTGCTCCACC CTTCTCAACG 2220 
ATACGTGGTT GCAGTAGGAT TGGAGTGTGG AAAGATTTGC TTATATACCT GGAAAAAGAC 2280 
5 TGATCAAGTT CCAGAAATAA ATGACTGGAC CCACTGTGTA GAAACAAGTC AAAGCCAAAG 2340 
TCATACACTG GCTATCAGAA AATTATGCTG GAAGAATTGC AGTGGAAAAA CTGAACAGAA 2400 
GGAAGCAGAA GGTGCTGAGT GGTTACACTT TGCAAGCTGT GGTGAAGATC ACACTGTGAA 2460 
GATACACAGA GTCAATAAAT GTG CACTG T A AT GG 

10 Seq ID NO: 34 Protein sequence: 

Protein Accession #: NP 060725.1 

1 11 21 31 41 51 

1 5 MVAPVLETSH VFCCPNRVRG VLNWSSGPRG LLAFGTSCSV VLYDPLKRVV VTNLNGHTAR 60 

VNCIQWICKQ DGSPSTELVS GGSDNQVIHW EIEDNQLLKA VHLQGHEGPV YAVHAVYQRR 120 
TSDPALCTLI VSAAADSAVR LWSKKGPEVM CLQTLNFGNG FALALCLSFL PNTDVPILAC 1 80 
GNDDCR1HIF AQQNDQFQKV LSLCGHEDWI RGVEWAAFGR DLFLASCSQD CLIRIWKLYI 240 
f"=: KSTSLETQDD DNIRLKENTF TIENESVK1A FAVTLETVLA GHENWVNAVH WQPVFYKDGV 300 

i£0 LQQPVRLLSA SMDKTMILWA PDEESGVWLE QVRVGEVGGN TLGFYDCQFN EDGSMIIAHA 360 

U FHGALHLWKQ NTVNPREWTP EIVISGHFDG VQDLVWDPEG EFIITVGTDQ TTRLFAPWKR 420 

I r! KDQSQVTWHE IARPQIHGYD LKCLAMINRF QFVSGADEKV LRVFSAPRNF VENFCAITGQ 480 

= ~ ? ; SLNHVLCNQD SDLPEGATVP ALGLSNKAVF QGDIASQPSD EEELLTSTGF EYQQVAFQPS 540 

El ILTEPPTEDH LLQNTLWPEV QKLYGHGYEI FCVTCNSSKT LLASACKAAK KEHAAIILWN 600 

=2i5 TTSWKQVQNL VFHSLTVTQM AFSPNEKFLL AVSRDRTWSL WKKQDTISPE FEPVFSLFAF 660 

I W' TNKITSVHSR IIWSCDWSPD SKYFFTGSRD KKWVWGECD STDDCIEHNI GPCSSVLDVG 720 

W GAVTAVSVCP VLHPSQRYVV AVGLECGKIC LYTWKKTDQV PEINDWTHCV ETSQSQSHTL 780 

==; : AIRKLCWKNC SGKTEQKEAE G AEWLHFASC GEDHTVKIHR VNKCAL 

=30 Seq ID NO: 35 DNA sequence 

~~. Nucleic Acid Accession #: NM_022131 

lissi Coding sequence: 1 1-2878 (underlined sequences correspond to start and stop codons) 

EZ 1 11 21 31 41 51 

F35 I I I I I I 

!*' TGCTGCG AGG ATGCTGCCTG GGCGGCTGTG CTGGGTGCCG CTCCTGCTGG CGCTGGGCGT 60 

GGGGAGCGGC AGCGGCGGTG GCGGGGACAG CCGGCAGCGC CGCCTCCTCG CGGCTAAAGT 120 
CI i CAATAAGCAC AAGCCATGGA TCGAGACTTC ATATCATGGA GTCATAACTG AGAACAATGA 1 80 

==f ; CACAGTCATT TTGGACCCAC CACTGGTAGC CCTGGATAAA GATGCACCGG TTCCTTTTGC 240 

1 40 AGGGGAAATC TGTGCGTTCA AGATCCATGG CCAGGAGCTG CCCTTTGAGG CTGTGGTGCT 300 
CAACAAGACA TCAGGAGAGG GCCGGCTCCG TGCCAAGAGC CCCATTGACT GTGAGTTGCA 360 
GAAGGAGTAC ACATTCATCA TCCAGGCCTA TGACTGTGGT GCTGGGCCCC ACGAGACAGC 420 
CTGGAAAAAG TCACACAAGG CCGTGGTCCA TATACAGGTG AAGGATGTCA ACGAGTTTGC 480 
TCCCACCTTC AAAGAGCCAG CCTACAAGGC TGTTGTGACG GAGGGCAAGA TCTATGACAG 540 

45 CATTCTGCAG GTGGAGGCCA TTGACGAGGA CTGCTCCCCA CAGTACAGCC AGATCTGCAA 600 
CTATGAAATC GTCACCACAG ATGTGCCTTT TGCCATCGAC AGAAATGGCA ACATCAGGAA 660 
CACTGAGAAG CTGAGCTATG ACAAACAACA CCAGTATGAG ATCCTGGTGA CCGCCTACGA 720 
CTGTGGACAG AAGCCCGCTG CTCAGGACAC CCTGGTGCAG GTGGATGTGA AGCCAGTTTG 780 
CAAGCCTGGC TGGCAAGACT GGACCAAGAG GATTGAGTAC CAGCCTGGCT CCGGGAGCAT 840 

50 GCCCCTGTTC CCCAGCATCC ACCTGGAGAC GTGCGATGGA GCCGTGTCTT CCCTCCAGAT 900 
CGTCACAGAG CTGCAGACTA ATTACATTGG GAAGGGTTGT GACCGGGAGA CCTACTCTGA 960 
GAAATCCCTT CAGAAGTTAT GTGGAGCCTC CTCTGGCATC ATTGACCTCT TGCCATCCCC 1020 
TAGCGCTGCC ACCAACTGGA CTGCAGGACT GCTGGTGGAC AGCAGTGAGA TGATCTTCAA 1080 
GTTTGACGGC AGGCAGGGTG CCAAAATCCC CGATGGGATT GTGCCCAAGA ACCTGACCGA 1 140 

5 5 TCAGTTCACC ATCACCATGT GGATGAAACA CGGCCCCAGC CCTGGTGTGA GAGCCGAGAA 1200 
GGAAACCATC CTCTGCAACT CAGACAAAAC CGAAATGAAC CGGCATCACT ATGCCCTGTA 1260 
TGTGCACAAC TGCCGCCTCG TCTTTCTCTT GCGGAAGGAC TTCGACCAGG CTGACACCTT 1320 
TCGCCCCGCG GAGTTCCACT GGAAGCTGGA TCAGATTTGT GACAAAGAGT GGCACTACTA 1380 
TGTCATCAAT GTGGAGTTTC CTGTGGTAAC CTTATACATG GATGGAGCAA CATATGAACC 1440 

60 ATACCTGGTG ACCAACGACT GGCCCATTCA TCCATCTCAC ATAGCCATGC AACTCACAGT 1500 
CGGCGCTTGT TGGCAAGGAG GAGAAGTCAC CAAACCACAG TTTGCTCAGT TCTTTCATGG 1560 
AAGCCTGGCC AGTCTCACCA TCCGCCCTGG CAAAATGGAA AGCCAGAAGG TGATCTCCTG 1620 
CCTGCAGGCC TGCAAGGAAG GGCTGGACAT TAATTCCTTG GAAAGCCTTG GCCAAGGAAT 1680 
AAAGTATCAC TTCAACCCCT CGCAGTCCAT CCTGGTGATG GAAGGTGACG ACATTGGGAA 1740 

65 CATTAACCGT GCTCTCCAGA AAGTCTCCTA CATCAACTCC AGGCAGTTCC CAACGGCGGG 1800 
TGTGCGGCGC CTCAAAGTAT CCTCCAAAGT CCAGTGCTTT GGGGAAGACG TATGCATCAG 1860 
TATCCCTGAG GTAGATGCCT ATGTGATGGT CCTCCAGGCC ATCGAGCCCC GGATCACCCT 1920 
CCGGGGCACA GACCACTTCT GGAGACCTGC TGCCCAGTTT GAAAGTGCCA GGGGAGTGAC 1980 
CCTCTTCCCT GATATCAAGA TTGTGAGCAC CTTCGCCAAA ACCGAAGCCC CCGGGGACGT 2040 

70 GAAAACCACA GACCCCAAAT CAGAAGTCTT AGAGGAAATG CTTCATAACT TAGATTTCTG 21 00 
TGACATTTTG GTGATCGGAG GGGACTTGGA CCCAAGGCAG GAGTGCTTGG AGCTCAACCA 2160 
CAGTGAGCTC CACCAACGAC ACCTGGATGC CACTAATTCT ACTGCAGGCT ACTCCATCTA 2220 
CGGTGTGGGC TCCATGAGCC GCTATGAGCA GGTGCTACAT CACATCCGCT ACCGCAACTG 2280 
GCGTCCGGCT TCCCTTGAGG CCCGGCGTTT CCGGATTAAG TGCTCAGAAC TCAATGGGCG 2340 

75 CTACACTAGC AATGAGTTCA ACTTGGAGGT CAGCATCCTT CATGAAGACC AAGTCTCAGA 2400 
TAAGGAGCAT GTCAATCATC TGATTGTGCA GCCTCCCTTC CTCCAGTCTG TCCATCATCC 2460 
TGAGTCCCGG AGTAGCATCC AGCACAGTTC AGTGGTCCCA AGCATTGCCA CAGTGGTCAT 2520 
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CATCATCTCC GTGTGCATGC TTGTGTTTGT CGTGGCCATG GGTGTGTACC GGGTCCGGAT 2580 
CGCCCACCAG CACTTCATCC AGGAGACTGA GGCTGCCAAG GAATCTGAGA TGGACTGGGA 2640 
CGATTCTGCG CTGACTATCA CAGTCAACCC CATGGAGAAA CATGAAGGAC CAGGGCATGG 2700 
GGAAGATGAG ACTGAGGGAG AAGAGGAGGA AGAAGCCGAG GAAGAAATGA GCTCCAGCAG 2760 
TGGCTCTGAC GACAGCGAAG AGGAGGAGGA GGAGGAAGGG ATGGGCAGAG GCAGACATGG 2S20 
GCAGAATGGA GCCAGGCAAG CCCAGCTGGA GTGGGATGAC TCCACCCTCC CCTAC TAGT G 2880 
CCCAGGGGTC TGCTGCCTGG CCCACATGTC CCTTTTGTAA ACCCTGACCC AGTGTATGCC 2940 
CATGTCTATC ATACCTCACC TCTGATGTCT GTGACATGTC TGGGAAGGCC TTCTCCAGCT 3000 
TCCTGGAGCC CACCCTTTAA GCCTTGGGCA CTCCCTGTGT TTCATCCATG GGGAAGTTCC 3060 
AAGAAGCCCA GCATGGCCAT CAGTGAGGAC TTCAGGGTAG ACTTTGTCCT GTAGCCTCCA 3120 
CTTCTGCCCT AAGTTCCCCA GCATCCTGAC TACCTGTCTG CAGAGTTTGC CTTTGTTTTT 3180 
TCCTGCAGGG AAGAAGGCCC ACCTTTGTGT CACTCACCTC CCCAGGCTCA GAGTCCCCAA 3240 
GGCCCTGGGG TTCCAACTCA CTGTGCGTCT CCTCCACACA GACCAGTAGG TTCTCCTATG 3300 
CTGACTCCAG GTTGCTTCAT ACAAGGAGGG TGGTTGAACT TCACACACGT AAGGTCTTAG 3360 
TGCTTAACAG TTTAAAGGAA AGTCCTTGTT GAGGCAGAAC TAAGTTTACA GGGAAAGGTA 3420 
CACACATTCT CTCTCTCTCT CTCTCTCTGT CTATCTAGTT CCCCAGCTTG GAGAGCCTTT 3480 
CCCCTTGCTT CTTTCTGAGG CCATATAAGC TTATAAGAAA AGTCCCAAAC CAAGAATAGG 3540 
TCCTTGGCCA CAAGCAGGGT CTGATCCCCC ATCAGAGCTA TCTGAGCCTG CCTGTCTGGG 3600 
CACCTGCTGC AACCATGCAG CTACCCTGCC AGGGGCACTC AGCAAACAGA ACCACAGGGC 3660 
CCAGGAGGCA TTCCACACAG GCACTGCCCC AGGACAACAC AACAAGGACA GTCACAACAA 3720 
GGACAACAAG GACACAACAC AACACACAAC AAGGACAGTC ACAACAAGCC TAGAGCCAGA 3780 
AAGCAGATGG AAATGCTAAT GAGGTCAAAC GTAGGCTTCA TGGTGGGTGG AGTGGGGGTG 3840 
GCTGGGCTCC CCCAGGACAG AGGGGACCCT GAGGTTGGCA AGGCTCTCAC CACTCAGCCT 3900 
TATGGTCCCT TATCTCCTAT CTTCCCTCTT GAGAAAATAC ACGCTTTCTG CATGTATTAG 3960 
AAACGCACGA GCTCCACCAA GTCTACAATG AAAGTTTGAA ATTTAACTGC AAGGAATTAG 4020 
AAGCATATTT GCAATCATTG CAGCTTCTTC TTTCTTCTGC TCATAAAAGG AGGAACACTT 4080 
TAGATAGAGG GCAAATATAT CTGAAAACCT AATTTCTTTC TTTTTTTGAT AAGGAAATCT 4140 
TTTCCATCTC CATCCTAACA TGCACAACCT GTGAAGAGAA TTGTTTCTAT AGTAACTGGT 4200 
CTGTGATCTT TTGTGGCCAA GAGAATAGCA GGCAAGAATT AG GGCCTTGA CAGAATTTCC 4260 
ACGAAGCTCT GAGAACATGT TTGTTTCGAA TGTCTGATTC CTCTTTGTCA TCAATGTGTA 4320 
TGCTCTGTCC CCATCCTTCA CTCCTCCTCA AGCTCACACC AATTGGTTTG GCACAGGCAC 4380 
AGAGCTGGTC CCTAGTTAAG TGGCATTTAT GTTAAAAAAA AATAGTTCAG AATCTCAGCC 4440 
TTTTCTTTGT GTCATCAAAA CAGCTTAAGA AGGGGACTAC TGCCAATGTC CTCTAGTCTG 4500 
ACCTCCACCC AGGGAGGACC CATGGCAGGT CTTTTCAACT TTCTGATTCA TGAGAACAAC 4560 
CTTGTGAAGC TTTTCCCACC TCCTAAAGTG TTTTCTGCAT CTGTTCCTTC CTTTGGACCT 4620 
CACAACAAAT CCTGTGAAGT AACTGAGACA TCTGTTGTTA GATACATTTT TGTGATGAGT 4680 
AAACTGAGGC TTCG 

Seq ID NO: 36 Protein sequence: 
Protein Accession #: NP071414.1 

1 11 21 31 41 51 

I I I I I I 

MLPGRLCWVP LLLALGVGSG SGGGGDSRQR RLLAAKVNKH KPWIETSYHG VITENNDTVI 60 
LDPPLVALDK DAPVPFAGEI CAFKIHGQEL PFEAVVLNKT SGEGRLRAKS PIDCELQK.EY 120 
TFIIQAYDCG AGPHETAWKK. SHKAVVfflQV KDVNEFAPTT KEPAYKAVVT EGKIYDSILQ 1 80 
VEA1DEDCSP QYSQICNYEI VTTDVPFAID RNGNIRNTEK LSYDKQHQYE ILVTAYDCGQ 240 
KPAAQDTLVQ VDVKPVCKPG WQDWTKRIEY QPGSGSMPLF PSIHLETCDG AVSSLQIVTE 300 
LQTNYIGKGC DRETYSEKSL QKLCGASSGI IDLLPSPSAA TNWTAGLLVD SSEMIFKFDG 360 
RQGAKIPDGI VPKNLTDQFT ITMWMKHGPS PGVRAEKETI LCNSDKTEMN RHHYALYVHN 420 
CRLVFLLRKD FDQADTFRPA EFHWKLDQIC DKEWHYYVIN VEFPVVTLYM DGATYEPYLV 480 
TNDWPIHPSH IAMQLTVGAC WQGGEVTKPQ FAQFFHGSLA SLTIRPGKME SQKVISCLQA 540 
CKEGLDINSL ESLGQGIKYH FNPSQSILVM EGDDIGNINR ALQKVSYINS RQFPTAGVRR 600 
LKVSSKVQCF GEDVCISIPE VDAYVMVLQA IEPRITLRGT DHFWRPAAQF ESARGVTLFP 660 
DIKIVSTFAK TEAPGDVKTT DPKSEVLEEM LHNLDFCDIL VIGGDLDPRQ ECLELNHSEL 720 
HQRHLDATNS TAGYSIYGVG SMSRYEQVLH HIRYRNWRPA SLEARRFRIK CSELNGRYTS 780 
NEFNLEVS1L HEDQVSDKEH VNHLIVQPPF LQSVHHPESR SSIQHSSVVP SIATWIHS 840 
VCMLVFVVAM GVYRVRIAHQ HFIQETEAAK ESEMDWDDSA LTITVNPMEK HEGPGHGEDE 900 
TEGEEEEEAE EEMSSSSGSD DSEEEEEEEG MGRGRHGQNG ARQAQLEWDD STLPY 

Seq ID NO: 37 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 143-874 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

GGGAGGGAGA GAGGCGCGCG GGTGAAAGGC GCATTGATGC AGCCTGCGGC GGCCTCGGAG 60 
CGCGGCGGAG CCAGACGCTG ACCACGTTCC TCTCCTCGGT CTCCTCCGCC TCCAGCTCCG 120 
CGCTGCCCGG CAGCCGGGAG CC ATGC GACC CCAGGGCCCC GCCGCCTCCC CGCAGCGGCT 1 80 
CCGCGGCCTC CTGCTGCTCC TGCTGCTGCA GCTGCCCGCG CCGTCGAGCG CCTCTGAGAT 240 
CCCCAAGGGG AAGCAAAAGG CGCAGCTCCG GCAGAGGGAG GTGGTGGACC TGTATAATGG 300 
AATGTGCTTA CAAGGGCCAG CAGGAGTGCC TGGTCGAGAC GGGAGCCCTG GGGCCAATGG 360 
CATTCCGGGT ACACCTGGGA TCCCAGGTCG GGATGGATTC AAAGGAGAAA AGGGGGAATG 420 
TCTGAGGGAA AGCTTTGAGG AGTCCTGGAC ACCCAACTAC AAGCAGTGTT CATGGAGTTC 480 
ATTGAATTAT GGCATAGATC TTGGGAAAAT TGCGGAGTGT ACATTTACAA AGATGCGTTC 540 
AAATAGTGCT CTAAGAGTTT TGTTCAGTGG CTCACTTCGG CTAAAATGCA GAAATGCATG 600 
CTGTCAGCGT TGGTATTTCA CATTCAATGG AGCTGAATGT TCAGGACCTC TTCCCATTGA 660 
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AGCTATAATT TATTTGGACC AAGGAAGCCC TGAAATGAAT TCAACAATTA ATATTCATCG 720 
CACTTCTTCT GTGGAAGGAC TTTGTGAAGG AATTGGTGCT GGATTAGTGG ATGTTGCTAT 780 
CTGGGTTGGC ACTTGTTCAG ATTACCCAAA AGGAGATGCT TCTACTGGAT GGAATTCAGT 840 
TTCTCGCATC ATTATTGAAG AACTACCAAA ATAAA TGCTT TAATTTTCAT TTGCTACCTC 900 
TTTTTTTATT ATGCCTTGGA ATGGTTCACT TAAATGACAT TTTAAATAAG TTTATGTATA 960 
CATCTG A ATG AAAAGCAAAG CTAAATATGT TTACAGACCA AAGTGTGATT TCACACTGTT 1020 
TTTAAATCTA GCATTATTCA TTTTGCTTCA ATCAAAAGTG GTTTCAATAT TTTTTTTAGT 1080 
TGGTTAGAAT ACTTTCTTCA TAGTCACATT CTCTCAACCT ATAATTTGGA ATATTGTTGT 1140 
GGTCTTTTGT TTTTTCTCTT AGTATAGCAT TTTTAAAAAA ATATAAAAGC TACCAATCTT 1200 
TGTACAATTT GTAAATGTTA AGAATTTTTT TTATATCTGT TAAATAAAAA TTATTTCCAA 1260 
CAACCTTAAA AAAAAAAAAA AAAA 

Seq ID NO: 38 Protein sequence: 
Protein Accession #: none found 

1 11 21 31 41 51 

I I I I I I 

MRPQGPAASP QRLRGLLLLL LLQLPAPSSA SEIPKGKQKA QLRQREWDL YNGMCLQGPA 60 
GVPGRDGSPG ANGIPGTPGI PGRDGFKGEK GECLRESFEE SWTPNYKQCS WSSLNYGIDL 120 
GKIAECTFTK MRSNS ALRVL FSGSLRLKCR NACCQRWYFT FNG AECSGPL PIEAIIYLDQ 1 80 
GSPEMNSTIN IHRTSSVEGL CEGIGAGLVD VAIWVGTCSD YPKGDASTGW NSVSRIIIEE 240 
LPK 

Seq ID NO: 39 DNA sequence 

Nucleic Acid Accession ft: NM_000949 

Coding sequence: 285-2153(underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

GGAGGCTGAA ATCCCCAGAC GCCGGTTTTC TGGGCTGGGC TTTCTGCTTA CTCACTCCTT 60 
CTCCCTCTTT CTGG ATTTTA CCGACCGTTC GCGAAACAGC TTTCCACACA ATGGAGCTTC 1 20 
ATGTCCTCGT GCAGGAAGTA CTCATCGACT GATGTGGC AG ACTTTGCTCC CTGACAAAAC 1 80 
TAAAGAACTC TCCTATTCAT GGAGGCGAAC ACTGAGGATG CTTTCCACAT GAACCCTGAA 240 
GTGAACTTCT GATACATTTC CTGCAGCAAG AGAAGGCAGC CAACATGAAG GAAAATGTGG 300 
CATCTGCAAC CGTTTTCACT CTGCTACTTT TTCTCAACAC CTGCCTTCTG AATGGACAGT 360 
TACCTCCTGG AAAACCTGAG ATCTTTAAAT GTCGTTCTCC CAATAAGGAA ACATTCACCT 420 
GCTGGTGGAG GCCTGGGACA GATGGAGGAC TTCCTACCAA TTATTCACTG ACTTACCACA 480 
GGGAAGGAGA GACACTCATG CATGAATGTC CAGACTACAT AACCGGTGGC CCCAACTCCT 540 
GCCACTTTGG CAAGCAGTAC ACCTCCATGT GGAGGACATA CATCATGATG GTCAATGCCA 600 
CTAACCAGAT GGGAAGCAGT TTCTCGGATG AACTTTATGT GGACGTGACT TACATAGTTC 660 
AGCCAGACCC TCCTTTGGAG CTGGCTGTGG AAGTAAAACA GCCAGAAGAC AGAAAACCCT 720 
ACCTGTGGAT TAAATGGTCT CCACCTACCC TGATTGACTT AAAAACTGGT TGGTTCACGC 780 
TCCTGTATGA AATTCGATTA AAACCCGAGA AAGCAGCTGA GTGGGAGATC CATTTTGCTG 840 
GGCAGCAAAC AGAGTTTAAG ATTCTCAGCC TACATCCAGG ACAGAAATAC CTTGTCCAGG 900 
TTCGCTGCAA ACCAGACCAT GGATACTGGA GTGCATGGAG TCCAGCGACC TTCATTC AGA 960 
TACCTAGTGA CTTCACCATG AATGATACAA CCGTGTGGAT CTCTGTGGCT GTCCTTTCTG 1020 
CTGTCATCTG TTTGATTATT GTCTGGGCAG TGGCTTTGAA GGGCTATAGC ATGGTGACCT 1080 
GCATCTTTCC GCCAGTTCCT GGGCCAAAAA TAAAAGGATT TGATGCTCAT CTGTTGGAGA 1140 
AGGGCAAGTC TGAAGAACTA CTGAGTGCCT TGGGATGCCA AGACTTTCCT CCCACTTCTG 1200 
ACTATGAGGA CTTGCTGGTG GAGTATTTAG AAGTAGATGA TAGTGAGGAC CAGCATCTAA 1260 
TGTCAGTCCA TTCAAAAGAA CACCCAAGTC AAGGTATGAA ACCCACATAC CTGGATCCTG 1320 
ACACTGACTC AGGCCGGGGG AGCTGTGACA GCCCTTCCCT TTTGTCTGAA AAGTGTGAGG 1380 
AACCCCAGGC CAATCCCTCC ACATTCTATG ATCCTGAGGT CATTGAGAAG CCAGAGAATC 1440 
CTGAAACAAC CCACACCTGG GACCCCCAGT GCATAAGCAT GGAAGGCAAA ATCCCCTATT 1500 
TTCATGCTGG TGGATCCAAA TGTTCAACAT GGCCCTTACC ACAGCCCAGC CAGCACAACC 1560 
CCAGATCCTC TTACCACAAT ATTACTGATG TGTGTGAGCT GGCTGTGGGC CCTGCAGGTG 1620 
CACCGGCCAC TCTGTTGAAT GAAGCAGGTA AAGATGCTTT AAAATCCTCT CAAACCATTA 1680 
AGTCTAGAGA AGAGGGAAAG GCAACCCAGC AGAGGGAGGT AGAAAGCTTC CATTCTGAGA 1740 
CTGACCAGGA TACGCCCTGG CTGCTGCCCC AGGAGAAAAC CCCCTTTGGC TCCGCTAAAC 1800 
CCTTGGATTA TGTGGAGATT CACAAGGTCA ACAAAGATGG TGCATTATCA TTGCTACCAA 1860 
AACAGAGAGA GAACAGCGGC AAGCCCAAGA AGCCCGGGAC TCCTGAGAAC AATAAGGAGT 1920 
ATGCCAAGGT GTCCGGGGTC ATGGATAACA ACATCCTGGT GTTGGTGCCA GATCCACATG 1980 
CTAAAAACGT GGCTTGCTTT GAAGAATCAG CCAAAGAGGC CCCACCATCA CTTGAACAGA 2040 
ATCAAGCTGA GAAAGCCCTG GCCAACTTCA CTGCAACATC AAGCAAGTGC AGGCTCCAGC 2100 
TGGGTGGTTT GGATTACCTG GATCCCGCAT GTTTTACACA CTCCTTTCAC TGAT AGCTTG 2160 
ACTAATGGAA TGATTGGTTA AAATGTGATT TTTCTTCAGG TAACACTACA GAGTACGTGA 2220 
AATGCTCAAG AATGTAGTCA GACTGACACT ACTAAAGCTC CCAGCTCCTT TCATGCTCCA 2280 
TTTTTAACCA CTTGCCTCTT TCTCCAGCAG CTGATTCCAG AACAAATCAT TATGTTTCCT 2340 
AACTGTGATT TGTAGATTTA CTTTTTGCTG TTAGTTATAA AACTATGTGT TCAATGAAAT 2400 
AAAAGCACAC TGCTTAGTAT TCTTGAGGGA CAATGCCAAT AGGTATATCC TCTGGAAAAG 2460 
GCTTTCATGA TTTGGCATGG GACAGACGGA AATGAAATTG TCAAAATTGT TTACCATAGA 2520 
AAGATGACAA AAGAAAATTT TCCACATAGG AAAATGCCAT GAAAATTGCT TTTGAAAAAC 2580 
AACTGCATAA CCTTTACACT CCTCGTCCAT TTTATTAGGA TTACCCAAAT ATAACCATTT 2640 
AAAGAAAGAA TGCATTCCAG AACAAATTGT TTACATAAGT TCCTATACCT TACTGACACA 2700 
TTGCTGATAT GCAAGTAAGA AAT 

Seq ID NO: 40 Protein sequence: 
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Protein Accession #: NP_000940. 1 

1 11 21 31 41 51 

, I I I I I I 

5 MKENVASATV FTLLLFLNTC LLNGQLPPGK PEIFKCRSPN KETFTCWWRP GTDGGLPTNY 60 

SLTYHREGET LMHECPDYIT GGPNSCHFGK QYTSMWRTYI MMVNATNQMG SSFSDELYVD 120 
VTYIVQPDPP LELAVEVKQP EDRKPYLWIK WSPPTLIDLK TGWFTLLYEI RLKPEKAAEW 1 80 
EIHFAGQQTE FKILSLHPGQ KYLVQVRCKP DHGYWSAWSP ATFIQIPSDF TMNDTTVWIS 240 
VAVLSAVICL IIVWAVALKG YSMVTCIFPP VPGPKIKGFD AHLLEKGKSE ELLSALGCQD 300 

10 FPPTSDYEDL LVEYLEVDDS EDQHLMSVHS KEHPSQGMKP TYLDPDTDSG RGSCDSPSLL 360 
SEKCEEPQAN PSTFYDPEVI EKPENPETTH TWDPQCISME GKIPYFHAGG SKCSTWPLPQ 420 
PSQHNPRSSY HNITDVCELA VGPAGAPATL LNEAGKDALK SSQTIKSREE GKATQQREVE 480 
SFHSETDQDT PWLLPQEKTP FGSAKPLDYV E1HKVNKDGA LSLLPKQREN SGKPKKPGTP 540 
ENNKEYAKVS GVMDNNILVL VPDPHAKNVA CFEESAKEAP PSLEQNQAEK ALANFTATSS 600 

1 5 KCRLQLGGLD YLDPACFTHS FH 

Seq ID NO: 41 DNA sequence 

5 : Nucleic Acid Accession #: none found, Eos cloned sequence 

= = ' Coding sequence: 1-1572 (underlined sequences correspond to start and stop codons) 

£|0 

i n 21 31 41 51 

6 I I I I I I 

5 r-- • ATGACCCAAA ATAAATTAAA GCTTTGTTCC AAAGCCAATG TGTATACTGA AGTGCCTG AT 60 

i GGAGGATGGG GCTGGGCGGT AGCTGTTTCA TTTTTCTTCG TTGAAGTCTT CACCTACGGC 1 20 

025 ATC ATC A AG A CATTTGGTGT CTTCTTTAAT GACTTAATGG ACAGTTTTAA TGAATCCAAT 180 
m ■■ AGCAGGATCT CATGGATAAT CTCAATCTGT GTGTTTGTCT TAACATTTTC AGCTCCCCTC 240 

> ? = : GCCACAGTCC TGAGCAATCG TTTCGGACAC CGTCTGGTAG TG ATGTTGGG GGGGCTACTT 300 

\ GTCAGCACCG GGATGGTGGC CGCCTCCTTC TCACAAGAGG TTTCTCATAT GTACGTCGCC 360 

ATCGGCATCA TCTCTGGTCT GGGATACTGC TTTAGTTTTC TCCCAACTGT AACCATCCTA 420 
fc30 TCACAATATT TTGGCAAAAG ACGTTCCATA GTCACTGCAG TTGCTTCCAC AGGAGAATGT 480 
TTCGCTGTGT TTGCTTTCGC ACCAGCAATC ATGGCTCTGA AGGAGCGCAT TGGCTGGAGA 540 
1- TACAGCCTCC TCTTCGTGGG CCTACTACAG TTAAACATTG TCATCTTCGG AGCACTGCTC 600 

IsJ AGACCCATCT TTATCAGAGG ACCAGCGTCA CCGAAAATAG TCATCCAGGA AAATCGGAAA 660 

L= GAAGCGCAGT ATATGCTTGA AAATGAGAAA ACACGAACCT CAATAGACTC CATTGACTCA 720 

[3 5 GGAGTAG AAC TAACTACCTC ACCTAAAAAT GTGCCTACTC ACACTAACCT GGAACTGGAG 780 
HP CCGAAGGCCG ACATGCAGCA GGTCCTGGTG AAGACCAGCC CCAGGCCAAG CGAAAAGAAA 840 

F GCCCCGCTAT TAGACTTCTC CATTTTGAAA GAGAAAAGTT TTATTTGTTA TGCATTATTT 900 

GGTCTCTTTG CAACACTGGG ATTCTTTGCA CCTTCCTTGT ACATCATTCC TCTGGGCATT 960 
Cli AGTCTGGGCA TTGACCAGGA CCGCGCTGCT TTTTTATTAT CTACGATGGC CATTGCAGAA 1 020 

=-40 GTTTTCGGAA GGATCGGAGC TGGTTTTGTC CTCAACAGGG AGCCCATTCG TAAGATTTAC 1080 
"=' : ATTGAGCTCA TCTGCGTCAT CTTATTGACT GTGTCTCTGT TTGCCTTTAC TTTTGCTACT 1 140 

GAATTCTGGG GTCTAATGTC ATGCAGCATA TTTTTTGGGT TTATGGTTGG AACAATAGGA 1200 
GGGACTCACA TTCCACTGCT TGCTGAGGAT GATGTCGTGG GCATTGAGAA GATGTCTTCT 1260 
GCAGCTGGGG TCTACATCTT CATTCAGAGC ATAGCAGGAC TGGCTGGACC GCCCCTTGCA 1320 
45 GGTTTGTTGG TGGACCAAAG TAAGATCTAC AGCAGGGCCT TCTACTCCTG CGCAGCTGGC 1 380 
ATGGCCCTGG CTGCTGTGTG CCTCGCCCTG GTGAGACCGT GTAAGATGGG ACTGTGCCAG 1440 
CATCATCACT CAGGTGAAAC AAAGGTAGTG AGCCATCGTG GGAAGACTTT ACAGGACATA 1500 
CCTGAAGACT TTCTGGAAAT GGATCTTGCA AAAAATGAGC ACAGAGTTCA CGTGCAAATG 1560 
GAGCCGGTA T GA 

50 — 

Seq ID NO: 42 Protein sequence: 

Protein Accession # none found, Eos cloned sequence 

1 11 21 31 41 51 

I I I I I I 

MTQNKLKLCS KANVYTEVPD GGWGWAVAVS FFFVEVFTYG IIKTFGVFFN DLMDSFNESN 60 
SRISWIISIC VFVLTFSAPL ATVLSNRFGH RLWMLGGLL VSTGMVAASF SQEVSHMYVA 120 
IGIISGLGYC FSFLPTVTIL SQYFGKRRSI VTAVASTGEC FAVFAFAPAI MALKERIGWR 1 80 
YSLLFVGLLQ LNIVIFGALL RPIFIRGPAS PKIVIQENRK EAQYMLENEK TRTSIDSIDS 240 
GVELTTSPKN VPTHTNLELE PKADMQQVLV KTSPRPSEKK APLLDFSILK EKSFICYALF 300 
GLFATLGFFA PSLYItPLGI SLGIDQDRAA FLLSTMAIAE VFGRIGAGFV LNREPIRKiY 360 
IELICVILLT VSLFAFTFAT EFWGLMSCSI FFGFMVGTIG GTHIPLLAED DVVGIEKMSS 420 
AAGVYIFIQS IAGLAGPPLA GLLVDQSKIY SRAFYSCAAG-MALAAVCLAL VRPCKMGLCQ 480 
HHHSGETKVV SHRGKTLQDI PEDFLEMDLA KNEHRVHVQM EPV 

Seq ID NO: 43 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted ORF 

Coding sequence- 1 -1 749 (underlined sequences correspond to start and stop codons) 

70 1 11 21 31 41 51 

I I I I I I 

ATGCTGTCTG GCTTCTTGAT GAGTCCCAGT ACCCAGCACA GAGCACAGTA CACTCCCGGA 60 
GGAAAGAAAC TTCCGTGGGA GGCTTCCATC GGTGCGCACA CCTCCCGAGG GCGAGGCAGC 120 
GACCGGGAGA GGGAGAGCCG GCCGGAGGCT GCCGGGCTCC TGTGGGACCG CGCTGCAGCC 1 80 
75 GGGGAGGCGG AGAAGGGGAA CCGGGGCGAG CCGCCCGCCT GGATCCGCGC CCAGCAGCAG 240 
CCGCGGCCGC CGCCAGCTGG GCAGGCTCCC GGGACTGCGG CTGGGGGCGC GCAGGACCCT 300 
CGCCTGCGTC CTGGACGTTC CCGGGGGAGG GTCCGGTTGC CAGTGAAACC TCCAGAGGCT 360 
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TCCGGACGAC AGCCCCGGGG GCCTTCTGAC TGCATCCCGA GATTTCCATC AGCGAGTGCA 420 
ACTCATAAGG CAGTCCCTAA GGGGACCGGG CCACCGGCTG AGGACGGGGA TGGCTTAGGA 480 
GCTCCTGGAC CTAGGGCCCG GCGTCGTCGC CTCCTGGGCG TCGCGGCAGA GGGGAGTGGC 540 
CCGCGCGGAA AGCGCCGCGG GACAGTCAGT GACGAGGCCC GGGGGTCGCC GGGGCCACGA 600 
CTTCTCGGAG ACCGTCCTGC GCTCTCTGGA GACGCGCTGT CCGCGCCCAG GGTGGTGCCA 660 
TGTGGGGCGC TCGCCGCTCG TCCGTCTCCT CATCCTGGAA CGCCGCTTCG CTCCTGCAGC 720 
TGCTGCTGGC TGCGCTGCTG GCGGCGGGGG CGAGGGCCCA GCGGCGAGTA CTGCCACGGC 780 
TGGCTGGACG CGCAGGGCGT CTGGCGCATC GGCTTCCAGT GTCCCGAGCG CTTCGACGGC 840 
GGCGACGCCA CCATCTGCTG CGGCAGCTGC GCGTTGCGCT ACTGCTGCTC CAGCGCCGAG 900 
GCGCGCCTGG ACCAGGGCGG CTGCGACAAT GACCGCCAGC AGGGCGCTGG CGAGCCTGGC 960 
CGGGCGGACA AAGACGGGCC CCGACGGCTC GGCAGGGCTT CATGTCTTAG GGGTACCCAA 1020 
GGAGACGGCG AGGGTGCGCC CCCACCCGTG AGGGCCTGGC AGCGGTGCTC CCCTGAAGGC 1080 
TCCCCGAAAG GAAGGCAGCT CCTCAGGGCT TTCCCGGGGC TGCTGCCCCG TGCCAGACGC 1 140 
CGCGGATTCC CATCTTCTCC ACGCGGCGGC CCCTCTCCCC TGCAGCGGCC CGCCTTGCCC 1200 
ATCTACGTGC CGTTCCTCAT TGTTGGCTCC GTGTTTGTCG CCTTTATCAT CTTGGGGTCC 1260 
CTGGTGGCAG CCTGTTGCTG CAGATGTCTC CGGCCTAAGC AGGATCCCCA GCAGAGCCGA 1320 
GCCCCAGGGG GTAACCGCTT GATGGAGACC ATCCCCATGA TCCCCAGTGC CAGCACCTCC 1380 
CGGGGGTCGT CCTCACGCCA GTCCAGCACA GCTGCCAGTT CCAGCTCCAG CGCCAACTCC 1440 
GGGGCCCGGG CGCCCCCAAC AAGGTCACAG ACCAACTGTT GCTTGCCGGA AGGGACCATG 1500 
AACAACGTGT ATGTCAACAT GCCCACGAAT TTCTCTGTGC TGAACTGTCA GCAGGCCACC 1560 
CAGATTGTGC CACATCAAGG GCAGTATCTG CATCCCCCAT ACGTGGGGTA CACGGTGCAG 1620 
CACGACTCTG TGCCCATGAC AGCTGTGCCA CCTTTCATGG ACGGCCTGCA GCCTGGCTAC 1680 
AGGCAGATTC AGTCCCCCTT CCCTCACACC AACAGTGAAC AGAAGATGTA CCCAGCGGTG 1740 
ACTGTA TAA C CGAGAGTCAC TGGTGGGTTC CTTTACTGAA GGGAGACGAA GGCAGGGGTG 1800 
GATTCTCGAG GTGGAAGTCC GCACATGTCG GTGGTATTTA TGGCACGATT CCTTTGGATG 1860 
GCTTCATTTG CCCCCAGACT GTATGAAAAC ATCTCCGAAT TAGCATTTCT GGATATGTTT 1920 
CATCCAGGGT ATCATTGATT TATGATGG AA AACCGGCCTC AGCTGGAGAT GACTGTGATG 1980 
TTGCTGATGG GTGTATAACA AATGCTTGAG TCCGAAGTGC CCTTGAGATA TGGTTGACGA 2040 
AAGAATTTTA TAAACTGATA AATTAAGGAT TTTTATTATG TTGTTATTAT TATTTCTTTT 2100 
TTGTTGTTGA CTGCACAGGA TCAAAATGCC TGTTATCTCC CTTTTACTGG GACTTTTTTT 2160 
TTTTTTTTTT TTTTTTTTAA TCAGACAGGG TCTTGCTCTG TTGCCCAGGC TGGAGTGCAG 2220 
TGGTGCGATC TCGGCTCACT GCAACTTCAG CCTCCTGGAT TCAGGCAACA CTCCTGCCTC 2280 
AGCCTCCCAC GTGGCTGGGA TTACAGGTGC CTGCCCCCAT GGCTAATTTT TTGTATTTTT 2340 
TGTAGAGATG GGGTTTCACC ATGTTGGCTG GGCTGGTCTC ACTCTCCTGA CCTCAAGCAA 2400 
TCTGCCTGTC TCAGCCTCCC AAAGTGCTGG GATTACAGGC GTGAGCCACC GCCCCCAGCC 2460 
TGAGCCTTTT TTTTTTTCTA ATGCATCCAA GGTTAAGGGG AAGACGCAAA TAACAGGACT 2520 
ATTCTAAAAG GAAACCTGTT TGAACTCTGT GAGATCAGTC ATCAGTCTCA GTATTCCACA 2580 
GGCACACCTT AATTTCATTG TAAAAAGATA TATATATTTT GTCTATTTTT GTGCTTTTGG 2640 
GGGCCTATTT TGTGCTTTTT TACCTTATGT AGAGATCTTA TTACAAAGTG ATTTTCTACA 2700 
TTAAAAAGAG ACTGAAATAA ATTGTATAGT TACTTAACTA ATGAAGACAT TTCAGAACTC 2760 
TGGGATGATT TTAATCTTGA AGTAGTAGGT GGTATAGTCA TAAAACCATT CATCCCCTTC 2820 
TTGATTGTAT CTTAATTTTC TGGCTTTAAG GTGACATCTG AGAGGTAATG CATTCTTTTT 2880 
TATATTGAAA TCATAAACTA TCACCCGCTG CTTCTCTGAG TTACTTTTAA TTTTGCCTTG 2940 
TGGTTATGGT TTGGCGTTTC CTTCTGTTTG GTTTTCAGAG CCCCATGTCT ATATAGTCCT 3000 
GAGTGCAAGT AATTACTATA CTTGTAAATG AAGATCAGTA TTTCTGCCTA GATCTGATAA 3060 
AAAAATTTTC TTGTCTTAGT TATAAAAATT CAAAGAAATG TGTTACAAAG ATACTTAGTA 3120 
TAGCTCCTCA GCCATAACCT GAGACTTGGG ATGAAATTTA AACCAGATAC GATTTACTTT 3180 
GCAGATCATA AGGCTTTTTA TACTCTTGTT ATCAAAATGG CTTATTTTTC AGGCACTAAG 3240 
GATTGTTAAG AGAAAAGCTT TTCAACGAAG GATTGCCTTT CTTCTCCCAC ACTGTTCTTG 3300 
ATTTCCTCTC TCTTTCAGGC CTCAACAGGC ACTGTATTCA TTGCCAATGT TCCAAATTAT 3360 
CAAATTCAAG TGAATTTATT TGTGTGTTCT TTACTTATAT AAAAAAAGAT AACTTTAAGG 3420 
ATGTGCAAGT ACATTTCCAA CTGCTAGCAC AACCAGTATT TTGTAATTAA ACAAATCGCT 3480 
GTATGGTATG GTCTTCTACA CATTTATGTC TATAGATATC TATCGATCAT CTTTCTATTC 3540 
TGTTTCATGA CTGAATAATG TAAAACCAGT GTTGGCAATT GGTATCATCA ATGATACTCA 3600 
TTTTTTAATA ACCAAAGGCA GGGGAAAATC ATTTTACTTA TTAATAAATA TTTTATGATG 3660 
TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA 

Seq ID NO: 44 Protein sequence: 

Protein Accession #: FGENESH predicted 

1 11 21 31 41 51 

I 1 I I I I 

MLSGFLMSPS TQHRAQYTPG GKKLPWEASI GAHTSRGRGS DRERESRPEA AGLLWDRAAA 60 
GEAEKGNRGE PPAWIRAQQQ PRPPPAGQAP GTAAGGAQDP RLRPGRSRGR VRLPVKPPEA 1 20 
SGRQPRGPSD CIPRFPSASA THKAVPKGTG PPAEDGDGLG APGPRARRRR LLG VAAEGSG 1 80 
PRGKRRGTVS DEARGSPGPR LLGDRPALSG DALSAPRWP CGALAARPSP HPGTPLRSCS 240 
CCWLRCWRRG RGPSGEYCHG WLDAQGVWRI GFQCPERFDG GDATICCGSC ALRYCCSSAE 300 
ARLDQGGCDN DRQQGAGEPG RADKDGPRRL GRASCLRGTQ GDGEGAPPPV RAWQRCSPEG 360 
SPKGRQLLRA FPGLLPRARR RGFPSSPRGG PSPLQRPALP IYVPFLIVGS VFVAFIILGS 420 
LVAACCCRCL RPKQDPQQSR APGGNRLMET IPMIPSASTS RGSSSRQSST AASSSSSANS 480 
GARAPPTRSQ TNCCLPEGTM NNVYVNMPTN FSVLNCQQAT QIVPHQGQYL HPPYVGYTVQ 540 
HDSVPMTAVP PFMDGLQPGY RQIQSPFPHT NSEQKMYPAV TV 

Seq ID NO: 45 DNA sequence 

Nucleic Acid Accession #: NM_O02285 

Coding sequence: 55-3738 (underlined sequences correspond to start and stop codons) 
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1 11 21 31 41 51 

1 I I I I I 

GGCCGAGCCT CGGCGGCGGC GGTAGCGGCG GCGGCGACGC TGACACCTCC CACC ATG GAC 60 
AGCTTCGACT TAGCCCTGCT CCAGGAATGG GACCTCGAGT CACTGTGTGT CTATGAACCA 120 
5 GATAG AAATG CATTACGG AG GAAAG AACGA GAAAGAAG AA ATCAAGAAAC TCAACAGGAT 1 80 
GATGGCACGT TTAATTCTAG TTACTCTCTC TTCAGTGAGC CCTACAAGAC TAACAAGGGG 240 
GATGAACTCT CCAACCGGAT CCAGAACACT TTAGGCAATT ATGATGAAAT GAAAGACTTT 300 
TTAACTGATA GAACCAATCA GAGTCATCTC GTTGGAGTTC CCAAACCTGG GGTTCCTCAG 360 
ACTCCTGTGA ACAAGATCGA TGAACATTTT GTTGCAGATT CAAGAGCCCA GAACCAGCCC 420 
1 0 TCGTCTATCT GTAGCACTAC AACTTCCACA CCAGCAGCTG TCCCCGTGCA GCAGAGTAAA 480 
AGAGGCACTA TGGGCTGGCA GAAGGCTGGG CACCCACCCT CTGACGGCCA ACAGAGAGCA 540 
ACACAACAGG GCTCTCTCAG GACCTTGCTT GGAGATGGTG TTGGCAGACA GCAGCCTCGG 600 
GCCAAACAAG TGTGCAATGT GGAGGTGGGC CTTCAGACCC AGGAGAGGCC ACCTGCCATG 660 
GCGGCCAAGC ACAGCAGCAG CGGACACTGT GTTCAGAACT TTCCTCCATC CCTAGCTTCA 720 
1 5 AAACCCAGCC TGGTCCAGCA GAAACCGACC GCGTATGTGA GGCCAATGGA CGGCCAAGAT 780 

CAGGCCCCTG ATGAGTCTCC TAAGCTGAAG TCGTCTTCGG AAACCAGCGT GCACTGCACA 840 
TCATACAGGG GAGTCCCTGC CAGCAAGCCG GAGCCTGCCA GAGCCAAGGC CAAGCTCTCC 900 
M- AAGTTCAGCA TCCCCAAGCA GGGGGAGGAG AGTAGATCTG GAGAAACCAA CAGCTGTGTT 960 

«=: GAAGAAATAA TCCGGGAGAT GACCTGGCTT CCACCACTTT CTGCTATTCA AGCACCTGGC 1020 

130 AAAGTGGAAC CAACCAAATT TCCATTTCCA AATAAGGACT CTCAGCTTGT ATCCTCTGGA 1080 

F=i CACAATAATC CAAAGAAAGG TGATGCAGAG CCAGAGAGTC CAGACAATGG CACATCGAAT 1 140 

TZ'. ACATCAATGC TGGAAGATGA CCTTAAGCTA AGCAGTGATG AAGAGGAGAA TGAACAGCAG 1200 

ij i GCAGCTCAGA GAACGGCTCT CCGCGCTCTC TCTGACAGCG CCGTGGTCCA GCAGCCCAAC 1260 

pQ TGCAGAACCT CGGTGCCTTC CAGCAAGGGC AGCAGCAGCA GCAGCAGCAG CGGCACGAGC 1320 

ZZ5 AGCTCCTCCA GCGACTCAGA GAGCAGCTCC GGATCTGACT CGGAGACCGA GAGCAGCTCC 1380 
TV AGCGAGAGTG AGGGCAGCAA GCCCCCCCAC TTCTCCAGCC CCGAGGCTGA ACCGGCATCC 1440 

| TCTAACAAGT GGCAGCTGGA TAAATGGCTA AACAAAGTTA ATCCCCACAA GCCTCCTATT 1 500 

CTGATCCAAA ATGAAAGCCA CGGGTCAGAG AGCAATCAGT ACTACAACCC GGTGAAAGAG 1560 
O GACGTCCAGG ACTGTGGGAA AGTCCCCGAC GTTTGCCAGC CCAGCCTGAG AGAGAAGGAG 1620 

30 ATCAAGAGCA CTTGCAAGGA GGAGCAAAGG CCAAGGACAG CCAACAAGGC CCCTGGGAGT 1680 
AAAGGCGTGA AGCAG AAGTC CCCGCCCGCG GCCGTGGCCG TGGCGGTGAG CGCAGCCGCC 1 740 
Q CCGCCACCCG CAGTGCCCTG TGCGCCCGCG GAGAACGCGC CCGCGCCTGC CCGGAGGTCC 1 800 

5~ GCGGGCAAGA AGCCCACCAG GCGCACCG AG AGGACCTCAG CCGGGGACGG CGCCAACTGC 1 860 

CACCGGCCCG AGGAGCCCGC GGCCGCGGAC GCGCTGGGGA CGAGCGTGGT GGTCCCCCCG 1920 
|]35 GAGCCCACCA AAACCAGGCC CTGTGGCAAC AACAGAGCGA GCCACCGCAA GGAGCTGCGC 1980 

" TCCTCCGTGA CCTGCGAGAA GCGCCGCACG CGGGGGCTAA GCAGGATCGT CCCCAAATCC 2040 
=2=: AAGGAGTTCA TTGAGACAGA GTCGTCATCT TCATCCTCCT CCTCGGACTC CGACCTGGAG 2100 

r : i TCCGAGCAGG AGGAGTACCC TCTGTCCAAA GCACAGACCG TGGCTGCCTC TGCCTCCTCC 2160 

~: GGGAATGATC AGAGGCTG AA GG AGGCCGCT GCCAACGGGG GCAGTGGTCC TAGGGCCCCT 2220 
f 40 GTAGGCTCCA TCAACGCCAG GACCACCAGT GACATCGCCA AGGAGCTGGA GGAGCAGTTC 2280 
TACACACTGG TCCCCTTTGG CCGGAACGAA CTTCTCTCCC CTCTAAAGGA CAGTGATGAG 2340 
ATCAGGTCTC TCTGGGTCAA AATCGACCTG ACCCTCCTGT CCAGGATCCC AGAACACCTG 2400 
CCCCAGGAGC CAGGGGTATT GAGCGCCCCT GCCACCAAGG ACTCTGAGAG CGCACCGCCC 2460 
AGCCACACCT CGGACACACC TGCAGAAAAG GCTTTGCCAA AATCCAAGAG GAAACGCAAG 2520 
45 TGTGACAACG AAGACGACTA CAGGGAGATC AAGAAGTCCC AGGGAGAGAA AGACAGCTCT 2580 
TCAAGACTGG CCACCTCCAC CAGTAATACT TTGTCTGCAA ACCACTGCAA CATGAACATC 2640 
AACAGTGTGG CAATACCAAT AAATAAAAAT G AAA AAATG C TTCGGTCGCC CATCTCACCC 2700 
CTCTCTGATG CATCTAAACA CAAATACACC AGCGAGGACT TAACTTCTTC CAGCCGACCT 2760 
AATGGCAACA GTTTGTTTAC TTCAGCCTCT TCCAGCAAAA AGCCTAAGGC CGACAGCCAG 2820 
5 0 CTGCAGCCTC ACGGCGGAG A CCTCACG AAA GCAGCTCACA ACAATTCTGA A AACATTCCC 2880 
CTCCACAAGT CACGGCCGCA GACGAAGCCG TGGTCTCCAG GCTCCAACGG CCACAGGGAC 2940 
TGCAAGAGGC AGAAACTTGT CTTCGATGAT ATGCCTCGCA GTGCCGATTA TTTTATGCAA 3000 
GAAGCTAAAC GAATGAAGCA TAAAGCAGAT GCAATGGTGG AAAAGTTTGG AAAGGCTTTG 3060 
AACTATGCTG AAGCAGCATT GTCGTTTATC GAGTGTGGAA ATGCAATGGA ACAAGGCCCC 3120 
5 5 ATGGAATCCA AATCTCCTT A TTACCTGATG T ATTC AGAAA CAGTAGAGCT CATCAGGTAT 3 1 80 

GCTATGAGAC TAAAAACCCA CTCAGGCCCC AATGCCACAC CAGAAGACAA ACAACTGGCT 3240 
GCATTATGTT ACCGATGCCT GGCCCTCCTG TACTGGCGGA TGTTTCGACT CAAAAGGGAC 3300 
CACGCTGTAA AGTATTCAAA AGCACTAATC GACTATTTCA AGAACTCATC TAAAGCCGCC 3360 
CAAGCCCCAT CTCCGTGGGG GGCCAGTGGA AAGAGCACTG GAACCCCATC CCCCATTTCT 3420 
60 CCCAACCCCT TTCCCGGCAG CTCCGTGGGG TCTCAGGGCA GCCTCTCCAA CGCCAGCGCC 3480 
CTGTCCCCGT CGACCATCGT CAGCATCCCA CAGCGCATCC ACCAGATGGC GGCCAACCAC 3540 
GTCAGCATCA CCAACAGCAT CCTGCACAGC TACGACTACT GGGAGATGGC CGACAACCTG 3600 
GCCAAGGAAA ACCGAGAATT CTTCAACGAC CTGGATCTGC TCATGGGGCC GGTCACCCTG 3660 
CACAGCAGCA TGGAGCACCT GGTCCAGTAC TCCCAACAGG GCCTGCACTG GCTGCGGAAC 3720 
65 AGCGCCCACC TGTC ATAG GG ACCTCACCCT GGGGCCAGAG TGGGCTCTGG TCTCCACAGA 3780 
TGGCTCAACG TTTTTGGACA CTGTGCTACT GAAACTCCCA GCCACAGCAT TTATAGACTG 3840 
CGGTGAACAT TTCCTCA 

SeqIDNO: 46 Protein sequence: 
70 Protein Accession #: NP_002276 

1 11 21 31 41 51 

MDSFDLALLQ EWDLESLCVY EPDRNALRRK ERERRNQETQ QDDGTFNSSY SLFSEPYKTN 60 
75 KGDELSNRIQ NTLGNYDEMK DFLTDRTNQS HLVGVPKPGV PQTPVNKIDE HFVADSRAQN 120 
QPSSICSTTT STPA AVPVQQ SKRGTMGWQK AGHPPSDGQQ RATQQGSLRT LLGDGVGRQQ 1 80 
PRAKQVCNVE VGLQTQERPP AMAAKHSSSG HCVQNFPPSL ASKPSLVQQK PTAYVRPMDG 240 
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QDQAPDESPK LKSSSETSVH CTSYRGVPAS KPEPARAKAK LSKFSIPKQG EESRSGETNS 300 
CVEEIIREMT WLPPLSAIQA PGKVEPTKFP FPNKDSQLVS SGHNNPKKGD AEPESPDNGT 360 
SNTSMLEDDL KLSSDEEENE QQAAQRTALR ALSDSAVVQQ PNCRTSVPSS KGSSSSSSSG 420 
TSSSSSDSES SSGSDSETES SSSESEGSKP PHFSSPEAEP ASSNKWQLDK WLNKVNPHKP 480 
5 PILIQNESHG SESNQYYNPV KEDVQDCGKV PDVCQPSLRE KEKSTCKEE QRPRTANKAP 540 

GSKGVKQKSP PAAVAVAVSA AAPPPAVPCA PAENAPAPAR RSAGKKPTRR TERTSAGDGA 600 
NCHRPEEPAA ADALGTSVVV PPEPTKTRPC GNNRASHRKE LRSSVTCEKR RTRGLSRIVP 660 
KSKEFIETES SSSSSSSDSD LESEQEEYPL SKAQTVAASA SSGNDQRLKE AAANGGSGPR 720 
APVGSINART TSDIAKELEE QFYTLVPFGR NELLSPLKDS DEIRSLWVKI DLTLLSRIPE 780 

1 0 HLPQEPGVLS APATKDSESA PPSHTSDTPA EKALPKSKRK RKCDNEDDYR EDCKSQGEKD 840 
SSSRLATSTS NTLSANHCNM NINSVAIPIN KNEKMLRSPI SPLSDASKHK YTSEDLTSSS 900 
RPNGNSLFTS ASSSKKPKAD SQLQPHGGDL TKAAHNNSEN IPLHKSRPQT KPWSPGSNGH 960 
RDCKRQKLVF DDMPRSADYF MQEAKRMKHK ADAMVEKFGK ALNYAEAALS FIECGNAMEQ 1020 
GPMESKSPYY LMYSETVELI RYAMRLKTHS GPNATPEDKQ LAALCYRCLA LLYWRMFRLK 1080 

1 5 RDHAVKYSKA LIDYFKNSSK AAQAPSPWGA SGKSTGTTSP ISPNPFPGSS VGSQGSLSNA 1 140 
SALSPSTIVS 1PQRIHQMAA NHVSITNSIL HSYDYWEMAD NLAKENREFF NDLDLLMGPV 1200 
TLHSSMEHLV QYSQQGLHWL RNSAHLS 

gjjj SeqIDNO: 47 DNA sequence 

;20 Nucleic Acid Accession #: NM_033151 

L-l Coding sequence: 351-4499(underlined sequences correspond to start and stop codons) 

; , 

f£>_ 1 11 21 31 41 51 

$5 i i i i i i 

! % ' ACTGGGATAA AGCAAGAAGA CTGATTTTAT GAGCAGGGGT TTGATACATC AAAGG AGATT 60 

K ! GCCCAGGATC AAGGGTGCGG TGTTGGGGGT GGGTTGGGGA GGGTGGTTAG AGAAGGTTTC 120 

_T ACTAAGTGAT TTGGGCCTGA GGCCTGAGAA GATGTTTAAA AAGAGGGATC AAGCACAGGC 180 

Eli TAAGGAGAGG AAAGAGCAGG CACCCAAACC TCTGCATGGC CCCAATATGC TCCCTGCAGG 240 

3 0 GTAGTGCCCC CTCTTCTGGC TGCTCAAGGC GAGATCTAAG CTTCTTCTAA CTCCTGCTGT 300 
CTTTTCATAT TCTCTGATTC TGGGAAACGA AGAATTGGCA GGAACTGAAA ATGACTAGGA 360 
£!!■: AGAGGACATA CTGGGTGCCC AACTCTTCTG GTGGCCTCGT GAATCGTGGC ATCGACATAG 420 

•"I GCGATGACAT GGTTTCAGGA CTTATTTATA AAACCTATAC TCTCCAAGAT GGCCCCTGGA 480 

; GTCAGCAAGA GAGAAATCCT GAGGCTCCAG GGAGGGCAGC TGTCCCACCG TGGGGGAAGT 540 

fS 5 ATGATGCTGC CTTG AG AACC ATGATTCCCT TCCGTCCCAA GCCGAGGTTT CCTGCCCCCC 600 
= ~ . AGCCCCTGGA CAATGCTGGC CTGTTCTCCT ACCTCACCGT GTCATGGCTC ACCCCGCTCA 660 

+ 1 TGATCCAAAG CTTACGGAGT CGCTTAGATG AGAACACCAT CCCTCCACTG TCAGTCCATG 720 

P ! j ATGCCTCAGA CAAAAATGTC CAAAGGCTTC ACCGCCTTTG GGAAGAAGAA GTCTCAAGGC 780 

!?! GAGGGATTGA AAAAGCTTCA GTGCTTCTGG TGATGCTGAG GTTCCAGAGA ACAAGGTTGA 840 

W0 TTTTCGATGC ACTTCTGGGC ATCTGCTTCT GCATTGCCAG TGTACTCGGG CCAATATTGA 900 

TTATACCAAA GATCCTGGAA TATTCAGAAG AGCAGTTGGG GAATGTTGTC CATGGAGTGG 960 
GACTCTGCTT TGCCCTTTTT CTCTCCGAAT GTGTGAAGTC TCTGAGTTTC TCCTCCAGTT 1020 
GGATCATCAA CCAACGCACA GCCATCAGGT TCCGAGCAGC TGTTTCCTCC TTTGCCTTTG 1080 
AGAAGCTCAT CCAATTTAAG TCTGTAATAC ACATCACCTC AGGAGAGGCC ATCAGCTTCT 1 140 
45 TCACCGGTGA TGTAAACTAC CTGTTTGAAG GGGTGTGCTA TGGACCCCTA GTACTGATCA 1200 
CCTGCGCATC GCTGGTCATC TGCAGCATTT CTTCCTACTT CATTATTGGA TACACTGCAT 1260 
TTATTGCCAT CTTATGCTAT CTCCTGGTTT TCCCACTGGC GGTATTCATG ACAAGAATGG 1320 
CTGTGAAGGC TCAGCATCAC ACATCTGAGG TCAGCGACCA GCGCATCCGT GTGACCAGTG 1380 
AAGTTCTCAC TTGCATTAAG CTGATTAAAA TGTACACATG GGAGAAACCA TTTGCAAAAA 1440 
50 TCATTGAAGA CCTAAGAAGG AAGGAAAGGA AACTATTGGA GAAGTGCGGG CTTGTCCAGA 1500 
GCCTGACAAG TATAACCTTG TTCATCATCC CCACAGTGGC CACAGCGGTC TGGGTTCTCA 1560 
TCCACACATC CTTAAAGCTG AAACTCACAG CGTCAATGGC CTTCAGCATG CTGGCCTCCT 1620 
TGAATCTCCT TCGGCTGTCA GTGTTCTTTG TGCCTATTGC AGTCAAAGGT CTCACGAATT 1680 
CCAAGTCTGC AGTGATGAGG TTCAAGAAGT TTTTCCTCCA GGAGAGCCCT GTTTTCTATG 1740 
55 TCCAGACATT ACAAGACCCC AGCAAAGCTC TGGTCTTTGA GGAGGCCACC TTGTCATGGC 1800 
AACAGACCTG TCCCGGGATC GTCAATGGGG CACTGGAGCT GGAGAGGAAC GGGCATGCTT 1860 
CTGAGGGGAT GACCAGGCCT AGAGATGCCC TCGGGCCAGA GGAAGAAGGG AACAGCCTGG 1920 
GCCCAGAGTT GCACAAGATC AACCTGGTGG TGTCCAAGGG GATGATGTTA GGGGTCTGCG 1980 
GCAACACGGG GAGTGGTAAG AGCAGCCTGT TGTCAGCCAT CCTGGAGGAG ATGCACTTGC 2040 
60 TCGAGGGCTC GGTGGGGGTG CAGGGAAGCC TGGCCTATGT CCCCCAGCAG GCCTGGATCG 2100 
TCAGCGGGAA CATCAGGGAG AACATCCTCA TGGGAGGCGC ATATGACAAG GCCCGATACC 2160 
TCCAGGTGCT CCACTGCTGC TCCCTGAATC GGGACCTGGA ACTTCTGCCC TTTGGAGACA 2220 
TGACAGAGAT TGGAGAGCGG GGCCTCAACC TCTCTGGGGG GCAGAAACAG AGGATCAGCC 2280 
TGGCCCGCGC CGTCTATTCC GACCGTCAGA TCTACCTGCT GGACGACCCC CTGTCTGCTG 2340 
65 TGGACGCCCA CGTGGGGAAG CACATTTTTG AGGAGTGCAT TAAGAAGACA CTCAGGGGGA 2400 
AGACGGTCGT CCTGGTGACC CACCAGCTGC AGTACTTAGA ATTTTGTGGC CAGATCATTT 2460 
TGTTGGAAAA TGGGAAAATC TGTGAAAATG GAACTCACAG TGAGTTAATG CAGAAAAAGG 2520 
GGAAATATGC CCAACTTATC CAGAAGATGC ACAAGGAAGC CACTTCGGAC ATGTTGCAGG 2580 
ACACAGCAAA GATAGCAGAG AAGCCAAAGG TAGAAAGTCA GGCTCTGGCC ACCTCCCTGG 2640 
70 AAGAGTCTCT CAACGGAAAT GCTGTGCCGG AGCATCAGCT CACACAGGAG GAGGAGATGG 2700 
AAGAAGGCTC CTTGAGTTGG AGGGTCTACC ACCACTACAT CCAGGCAGCT GGAGGTTACA 2760 
TGGTCTCTTG CATAATTTTC TTCTTCGTGG TGCTGATCGT CTTCTTAACG ATCTTCAGCT 2820 
TCTGGTGGCT GAGCTACTGG TTGGAGCAGG GCTCGGGGAC CAATAGCAGC CGAGAGAGCA 2880 
ATGGAACCAT GGCAGACCTG GGCAACATTG CAGACAATCC TCAACTGTCC TTCTACCAGC 2940 
75 TGGTGTACGG GCTCAACGCC CTGCTCCTCA TCTGTGTGGG GGTCTGCTCC TCAGGGATTT 3000 
TCACCAAAGT CACGAGGAAG GCATCCACGG CCCTGCACAA CAAGCTCTTC AACAAGGTTT 3060 
TCCGCTGCCC CATGAGTTTC TTTGACACCA TCCCAATAGG CCGGCTTTTG AACTGCTTCG 3120 

368 



Attorney Docket No.:018S01-005210US 



CAGGGGACTT GG AACAGCTG GACCAGCTCT TGCCCATCTT TTCAGAGCAG TTCCTGGTCC 3 1 80 
TGTCCTTAAT GGTGATCGCC GTCCTGTTGA TTGTCAGTGT GCTGTCTCCA TATATCCTGT 3240 
TAATGGGAGC CATAATCATG GTTATTTGCT TCATTTATTA TATGATGTTC AAGAAGGCCA 3300 
TCGGTGTGTT CAAGAGACTG GAGAACTATA GCCGGTCTCC TTTATTCTCC CACATCCTCA 3360 
5 ATTCTCTGCA AGGCCTGAGC TCCATCCATG TCTATGGAAA AACTGAAGAC TTCATCAGCC 3420 
AGTTTAAGAG GCTGACTGAT GCGCAGAATA ACTACCTGCT GTTGTTTCTA TCTTCCACAC 3480 
GATGGATGGC ATTGAGGCTG GAGATCATGA CCAACCTTGT GACCTTGGCT GTTGCCCTGT 3540 
TCGTGGCTTT TGGCATTTCC TCCACCCCCT ACTCCTTTAA AGTCATGGCT GTCAACATCG 3600 
TGCTGCAGCT GGCGTCCAGC TTCCAGGCCA CTGCCCGGAT TGGCTTGGAG ACAGAGGCAC 3660 
1 0 AGTTCACGGC TGTAGAGAGG ATACTGCAGT ACATGAAGAT GTGTGTCTCG GAAGCTCCTT 3720 
TACACATGGA AGGCACAAGT TGTCCCCAGG GGTGGCCACA GCATGGGGAA ATCATATTTC 3780 
AGGATTATCA CATGAAATAC AGAGACAACA CACCCACCGT GCTTCACGGC ATCAACCTGA 3840 
CCATCCGCGG CCACGAAGTG GTGGGCATCG TGGGAAGGAC GGGCTCTGGG AAGTCCTCCT 3900 
TGGGCATGGC TCTCTTCCGC CTGGTGGAGC CCATGGCAGG CCGGATTCTC ATTGACGGCG 3960 
1 5 TGGACATTTG CAGCATCGGC CTGGAGGACT TGCGGTCCAA GCTCTCAGTG ATCCCTCAAG 4020 
ATCCAGTGCT GCTCTCAGGA ACCATCAGAT TCAACCTAGA TCCCTTTGAC CGTCACACTG 4080 
ACCAGCAGAT CTGGGATGCC TTGGAGAGGA CATTCCTGAC CAAGGCCATC TCAAAGTTCC 4140 
H : CCAAAAAGCT GCATACAGAT GTGGTGGAAA ACGGTGGAAA CTTCTCTGTG GGGGAGAGGC 4200 

f: AGCTGCTCTG CATTGCCAGG GCTGTGCTTC GCAACTCCAA GATCATCCTT ATCGATGAAG 4260 

*§.'0 CCACAGCCTC CATTGACATG GAGACAGACA CCCTGATCCA GCGCACAATC CGTGAAGCCT 4320 
C5 TCCAGGGCTG CACCGTGCTC GTCATTGCCC ACCGTGTCAC CACTGTGCTG AACTGTGACC 4380 

i m ACATCCTGGT TATGGGCAAT GGGAAGGTGG TAGAATTTGA TCGGCCGGAG GTACTGCGGA 4440 

^ i AGAAGCCTGG GTCATTGTTC GCAGCCCTCA TGGCCACAGC CACTTCTTCA CTGAGATAAG 4500 

Hi GAGATGTGGA GACTTCATGG AGGCTGGCAG CTGAGCTCAG AGGTTCACAC AGGTGCAGCT 4560 

r-25 TCGAGGCCCA CAGTCTGCGA CCTTCTTGTT TGGAGATGAG AACTTCTCCT GGAAGCAGGG 4620 
* GTAAATGTAG GGGGGGTGGG GATTGCTG G A TGGAAACCCT GGAATAGGCT ACTTGATGGC 4680 
V_ I TCTCAAG ACC TTAG AACCCC AGAACCATCT AAG ACATGGG ATTCAGTGAT C ATGTGGTTC 4740 

..- TCCTTTTAAC TTACATGCTG AATAATTTTA TAATAAGGTA AAAGCTTATA GTTTTCTGAT 4800 

W CTGTGTTAGA AGTGTTGCAA ATGCTGTACT GACTTTGTAA AATATAAAAC TAAGGAAAAC 4860 

s 30 tc 

0 Seq ID NO: 48 Protein sequence: 

Us Protein Accession #: NP 149163.2 

r- 

fp 5 MTMCRTYWVPNSSGGLVNTRGIDIGDDMVSGLIYKTYTLQDGPWSQQERNPEAPGRAAVPPWGKYDAALRT 
»i M1PFRPKPRFPAPQPLDNAGLFSYLTVSWLTPLMIQSLRSRLDENTIPPLSVHDASDKNVQRLHRLWEEE 
VSRRGIEKASVLLVMLRFQRTRLIFDALLGICFCIASVLGPILIIPKILEYSEEQLGNVVHGVGLCFALF 
C'i LSECVKSLSFSSSWIINQRTAIRFRAAVSSFAFEKLIQFKSVIHITSGEAISFFTGDVNYLFEGVCYGPL 
STj VLITCASLVICSISSYFIIGYTAFIAILCYLLVFPLAVFMTRMAVKAQHHTSEVSDQRIRVTSEVLTCIK 

1 **0 LIKMYTWEKPFAKIIEDLRRKERKLLEKCGLVQSLTSITLFIEPTVATAVWVLIHTSLKLKLTASMAFSM 

LASLNLLRLSVFFVPIAVKGLTNSKSAVMRFKKFFLQESPVFYVQTLQDPSKALVFEEATLSWQQTCPGI 
VNGALELERNGHASEGMTRPRDALGPEEEGNSLGPELHKINLVVSKGMMLGVCGNTGSGKSSLLSAILEE 
MHLLEGSVGVQGSLAYVPQQAWIVSGNIRENILMGGAYDKARYLQVLHCCSLNRDLELLPFGDMTEIGER 
GLNLSGGQKQRISLARAVYSDRQIYLLDDPLSAVDAHVGKHIFEECIKKTLRGKTVVLVTHQLQYLEFCG 
45 QnLLENGKICENGTHSELMQKKGKYAQLIQKMHKEATSDMLQDTAKIAEKPKVESQALATSLEESLNGN 
AWEHQLTQEEEMEEGSLSWRVYHHYIQAAGGYMVSCIIFFFVVLIVFLTIFSFWWLSYWLEQGSGTNSS 
RESNGTMADLGNIADNPQLSFYQLVYGLNALLLICVGVCSSGIFTKVTRKASTALHNKLFNKVFRCPMSF 
FDTIPIGRLLNCFAGDLF^LDQLLPIFSEQFLVLSLMVIAVLLrVSVLSPYILLMGAIIMVICFlYYMMF 
KKAIGVFKRLENYSRSPLFSMLNSLQGLSSIHVYGKTEDFISQFKRLTDAQNNYLLLFLSSTRWMALRL 
5 0 EIMTNLVTLAVALFVAFGISSTPYSFKVMAVNIVLQLASSFQATARIGLETEAQFTAVERILQYMKMCVS 
EAPLHMEGTSCPQGWPQHGEIIFQDYHMKYRDNTPTVLHGINLTIRGHEVVGIVGRTGSGKSSLGMALFR 
LVEPMAGRILIDGVDICSIGLEDLRSKLSVIPQDPVLLSGT1RFNLDPFDRHTDQQIWDALERTFLTKAI 
SKFPKKLHTDVVENGGNFSVGERQLLCIARAVLRNSKIILIDEATASIDMETDTLIQRTIREAFQGCTVL 
VIAHRVTTVLNCDHILVMGNGKWEFDRPEVLRKKPGSLFAALMATATSSLR 



55 



Seq ID NO: 49 DNA sequence 

Nucleic Acid Accession #: NM_033419 

Coding sequence. 1 8-980 (underlined sequences correspond to start and stop codons) 



60 1 11 21 31 41 51 

I I I I I I 

CGAGCCAGGG AG AAAGG ATG GCCGGCCTGG CGGCGCGGTT GGTCCTGCTA GCTGGGGCAG 60 
CGGCGCTGGC GAGCGGCTCC CAGGGCGACC GTGAGCCGGT GTACCGCGAC TGCGTACTGC 120 
AGTGCGAAGA GCAGAACTGC TCTGGGGGCG CTCTG AATCA CTTCCGCTCC CGCCAGCCAA 1 80 

65 TCTACATGAG TCTAGCAGGC TGGACCTGTC GGGACG ACTG TAAGTATGAG TGTATGTGGG 240 
TCACCGTTGG GCTCTACCTC CAGGAAGGTC ACAAAGTGCC TCAGTTCCAT GGCAAGTGGC 300 
CCTTCTCCCG GTTCCTGTTC TTTCAAGAGC CGGCATCGGC CGTGGCCTCG TTTCTCAATG 360 
GCCTGGCCAG CCTGGTGATG CTCTGCCGCT ACCGCACCTT CGTGCCAGCC TCCTCCCCCA 420 
TGTACCACAC CTGTGTGGCC TTCGCCTGGG TGTCCCTCAA TGCATGGTTC TGGTCCACAG 480 

70 TTTTCCACAC CAGGGACACT GACCTCACAG AGAAAATGGA CTACTTCTGT GCCTCCACTG 540 
TCATCCTACA CTCAATCTAC CTGTGCTGCG TCAGGACCGT GGG GCTGCAG CACCCAGCTG 600 
TGGTCAGTGC CTTCCGGGCT CTCCTGCTGC TCATGCTGAC CGTGCACGTC TCCTACCTGA 660 
GCCTCATCCG CTTCGACTAT GGCTACAACC TGGTGGCCAA CGTGGCTATT GGCCTGGTCA 720 
ACGTGGTGTG GTGGCTGGCC TGGTGCCTGT GGAACCAGCG GCGGCTGCCT CACGTGCGCA 780 

75 AGTGCGTGGT GGTGGTCTTG CTGCTGCAGG GGCTGTCCCT GCTCGAGCTG CTTGACTTCC 840 
CACCGCTCTT CTGGGTCCTG GATGCCCATG CCATCTGGCA CATCAGCACC ATCCCTGTCC 900 
ACGTCCTCTT TTTCAGCTTT CTGGAAGATG ACAGCCTGTA CCTGCTGAAG GAATCAGAGG 960 
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ACAAGTTCAA GCTGGACTGA AGACCTTGGA GCGAGTCTGC CCCAGTGGGG ATCCTGCCCC 1020 
CGCCCTGCTG GCCTCCCTTC TCCCCTCAAC CCTTGAGATG ATTTTCTCTT TTCAACTTCT 1080 
TGAACTTGGA CATGAAGGAT GTGGGCCCAG AATCATGTGG CCAGCCCACC CCCTGTTGGC 1140 
CCTCACCAGC CTTGGAGTCT GTTCTAGGGA AGGCCTCCCA GCATCTGGGA CTCGAGAGTG 1200 
5 GGCAGCCCCT CTACCTCCTG GAGCTGAACT GGGGTGGAAC TGAGTGTGCT CTTAGCTCTA 1260 
CCGGGAGGAC AGCTGCCTGT TTCCTCCCCA TCAGCCTCCT CCCCACATCC CCAGCTGCCT 1320 
GGCTGGGTCC TGAAGCCCTC TGTCTACCTG GGAGACCAGG GACCACAGGC CTTAGGGATA 1380 
CAGGGGGTCC CCTTCTGTTA CCACCCCCCA CCCTCCTCCA GGACACCACT AGGTGGTGCT 1440 
GGATGCTTGT TCTTTGGCCA GCCAAGGTTC ACGGCGATTC TCCCCATGGG ATCTTGAGGG 1500 
1 0 ACCAAGCTGC TGGGATTGGG AAGGAGTTTC ACCCTGACCA TTGCCCTAGC CAGGTTCCCA 1 560 
GGAGGCCTCA CCATACTCCC TTTCAGGGCC AGGGCTCCAG CAAGCCCAGG GCAAGGATCC 1620 
TGTGCTGCTG TCTGGTTGAG AGCCTGCCAC CGTGTGTCGG GAGTGTGGGC CAGGCTGAGT 1680 
GCATAGGTGA CAGGGCCGTG AGCATGGGCC TGGGTGTGTG TGAGCTCAGG CCTAGGTGCG 1740 
CAGTGTGGAG ACGGGTGTTG TCGGGGAAGA GGTGTGGCTT CAAAGTGTGT GTGTGCAGGG 1 800 
1 5 GGTGGGTGTG TTAGCGTGGG TTAGGGGAAC GTGTGTGCGC GTG CTGGTGG GCATGTGAGA 1 860 
TGAGTGACTG CCGGTGAATG TGTCCACAGT TGAGAGGTTG GAGCAGGATG AGGGAATCCT 1920 
GTCACCATCA ATAATCACTT GTGGAGCGCC AGCTCTGCCC AAGG CGCCAC CTGGGCGGAC 1980 
i- =. AGCCAGGAGC TCTCCATGGC CAGGCTGCCT GTGTGCATGT TCCCTGTCTG GTGCCCCTTT 2040 

!!' GCCCGCCTCC TGCAAACCTC ACAGGGTCCC CACACAACAG TGCCCTCCAG AAGCAGCCCC 2100 

120 TCGGAGGCAG AGGAAGGAAA ATGGGGATGG CTGGGGCTCT CTCCATCCTC CTTTTCTCCT 2160 
=~ : : TGCCTTCGCA TGGCTGGCCT TCCCCTCCAA AACCTCCATT CCCCTGCTGC CAGCCCCTTT 2220 

= ; GCCATAGCCT GATTTTGGGG AGGAGGAAGG GGCGATTTGA GGGAGAAGGG GAGAAAGCTT 2280 

y"! ATGGCTGGGT CTGGTTTCTT CCCTTCCCAG AGGGTCTTAC TGTTCCAGGG TGGCCCCAGG 2340 

fjjjj GCAGGCAGGG GCCACACTAT GCCTGCGCCC TGGTAAAGGT GACCCCTGCC ATTTACCAGC 2400 

is 15 AGCCCTGGCA TGTTCCTGCC CCACAGGAAT AGAATGGAGG GAGCTCCAGA AACTTTCCAT 2460 
fl ! CCCAAAGGCA GTCTCCGTGG TTGAAGCAGA CTGGATTTTT GCTCTGCCCC TGACCCCTTG 2520 

' " TCCCTCTTTG AGGGAGGGGA GCTATGCTAG GACTCCAACC TCAGGGACTC GGGTGGCCTG 2580 

CGCTAGCTTC TTTTGATACT GAAAACTTTT AAGGTGGGAG GGTGGCAAGG GATGTGCTTA 2640 
f" i ATAAATCAAT TCCAAGCCTC AAAAAAAAAA AAA AAAAAAA AA 

; 30 

= Seq ID NO: 50 Protein sequence: 

f ~: Protein Accession #: NP_219487.1 

M ! 1 11 21 31 41 51 

fPP I I I I I I 

= ;;; MKDVGPESCG QPTPCWPSPA LESVLGKASQ HLGLESGQPL YLLELNWGGT ECALSSTGRT 60 

sfc AACFLPISLL PTSPAAWLGP EALCLPGRPG TTGLRDTGGP LLLPPPTLLQ DTTRWCWMLV 120 

PS-. LWPAKVHGDS PHGILRDQAA GIGKEFHPDH CPSQVPRRPH HTPFQGQGSS KPRAR1LCCC 1 80 

*»■' LVESLPPCVG SVGQAECIGD RAVSMGLGVC ELRPRCAVWR RVLSGKRCGF KVCVCRGWVC 

?4;0 

Seq ID NO: 51 DNA sequence 

Nucleic Acid Accession #: XM_059098. 1 

Coding sequence: 178-618(underlined sequences correspond to start and stop codons) 

45 

1 11 21 31 41 51 

I I I I I I 

GATGTACACT CTGAAGTGAG CACATTCCTG TTGGCAGGAC ATGACACCTT GGCAGCAAGC 60 
ATCTCCTGG A TCCTTT ACTG CCTGGCTCTG AACCCTGAGC ATCAAG AGAG ATGCCGGGAG 1 20 

5 0 GAGGTCAGGG GCATCCTGGG GGATGGGTCT TCTATCACTT GGGACCAGCT GGGTGAGATG 1 80 
TCGTACACCA CAATGTGCAT CAAGGAGACG TGCCGATTGA TTCCTGCAGT CCCGTCCATT 240 
TCCAGAGATC TCAGCAAGCC ACTTACCTTC CCAGATGGAT GCACATTGCC TGCAGGGATC 300 
ACCGTGGTTC TTAGTATTTG GGGTCTTCAC CACAACCCTG CTGTCTGGAA AAACCCAAAG 360 
GTCTTTGACC CCTTGAGGTT CTCTCAGGAG AATTCTGATC AGAGACACCC CTATGCCTAC 420 

55 TTACCATTCT CAGCTGGATC AAGGAACTGC ATTGGGCAGG AGTTTGCCAT GATTGAGTTA 480 
AAGGTAACCA TTGCCTTGAT TCTGCTCCAC TTCAGAGTGA CTCCAGACCC CACCAGGCCT 540 
CTTACTTTCC CCAACCATTT TATCCTCAAG CCCAAGAATG GGATGTATTT GCACCTGAAG 600 
AAACTCTCTG AATGTTAGAT CTCAGGGTAC AATGATTAAA CGTACTTTGT TTTTCGAAGT 660 
TAAATTTACA GCTAATGATC CAAGCAGATA GAAAGGGATC AATGTATGGT GGGAGGATTG 720 

60 GAGGTTGGTG GGATAGGGGT CTCTGTGAAG AGATCCAAAA TCATTTCTAG GTACACAGTG 780 
TGTCAGCTAG ATCTGTTTCT ATATAACTTT GGGAGATTTT CAGATCTTTT CTGTTAAACT 840 
TTCACTACTA TTAATGCTGT ATACACCAAT AGACTTTCAT ATATTTTCTG TTGTTTTTAA 900 
AATAGTTTTC AGAATTATGC AAGTAATAAG TGCATGTATG CTCACTGTCA AAAATTCCCA 960 
ACACTAGAAA ATCATGTAG A ATAAAAATTT TAAATCTCAC TTCACTTAGC CGACATTCCA 1 020 

65 TGCCCTGACC AATCCTACTG CTTTTCCTAA AAACAGAATA ATTTGGTGTG CATTCTTTCA 1080 
GACTTTTTCC TATACATTTT ATATGTAGAA ATGTAGCAAT GTATTTGTAT AGATGTGATC 1 140 
ATTCCTATAT TGTTATTGAT TTTTTTCACT TAATAAAAAT TCACCTTATT CCTT 

Seq ID NO: 52 Protein sequence- 
70 Protein Accession #: XP 059098.1 

1 11 21 31 41 51 
I I I I I I 

MSYTTMCIKE TCRLIPAVPS ISRDLSKPLT FPDGCTLPAG ITVVLSIWGL HHNPAVWKNP 60 
75 KVFDPLRFSQ ENSDQRHPYA YLPFSAGSRN CIGQEFAMIE LKVTIALILL HFRVTPDPTR 120 
PLTFPNHFIL KPKNGMYLHL KKLSEC 
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Seq ID NO: 53 DNA sequence 

Nucleic Acid Accession #: NM_030916 

Coding sequence: 1-1 533 (underlined sequences correspond to start and stop codons) 

5 1 11 21 31 41 51 

I I I I I I 

ATG CCCCTGT CCCTGGGAGC CGAGATGTGG GGGCCTGAGG CCTGGCTGCT GCTGCTGCTA 60 
CTGCTGGCAT CATTTACAGG CCGGTGCCCC GCGGGTGAGC TGGAGACCTC AGACGTGGTA 120 
ACTGTGGTGC TGGGCCAGG A CGCAAAACTG CCCTGCTTCT ACCG AGGGGA CTCCGGCGAG 1 80 
1 0 CAAGTGGGGC AAGTGGCATG GGCTCGGGTG GACGCGGGCG AAGGCGCCCA GGAACTAGCG 240 
CTACTGCACT CCAAATACGG GCTTCATGTG AGCCCGGCTT ACGAGGGCCG CGTGGAGCAG 300 
CCGCCGCCCC CACGCAACCC CCTGGACGGC TCAGTGCTCC TGCGCAACGC AGTGCAGGCG 360 
GATGAGGGCG AGTACGAGTG CCGGGTCAGC ACCTTCCCCG CCGGCAGCTT CCAGGCGCGG 420 
CTGCGGCTCC GAGTGCTGGT GCCTCCCCTG CCCTCACTGA ATCCTGGTCC AGCACTAGAA 480 
1 5 GAGGGCCAGG GCCTGACCCT GGCAGCCTCC TGCACAGCTG AGGGCAGCCC AGCCCCCAGC 540 
GTGACCTGGG ACACGGAGGT CAAAGGCACA ACGTCCAGCC GTTCCTTCAA GCACTCCCGC 600 
TCTGCTGCCG TCACCTCAGA GTTCCACTTG GTGCCTAGCC GCAGCATGAA TGGGCAGCCA 660 
Id: CTGACTTGTG TGGTGTCCCA TCCTGGCCTG CTCCAGGACC AAAGGATCAC CCACATCCTC 720 

L_. CACGTGTCCT TCCTTGCTGA GGCCTCTGTG AGGGGCCTTG AAGACCAAAA TCTGTGGCAC 780 

120 ATTGGCAGAG AAGGAGCTAT GCTCAAGTGC CTGAGTGAAG GGCAGCCCCC TCCCTCATAC 840 
?"-; AACTGGACAC GGCTGGATGG GCCTCTGCCC AGTGGGGTAC GAGTGGATGG GGACACTTTG 900 

r.~: GGCTTTCCCC CACTGACCAC TGAGCACAGC GGCATCTACG TCTGCCATGT CAGCAATGAG 960 

II I TTCTCCTCAA GGGATTCTCA GGTCACTGTG GATGTTCTTG ACCCCCAGGA AGACTCTGGG 1020 

f?l AAGCAGGTGG ACCTAGTGTC AGCCTCGGTG GTGGTGGTGG GTGTGATCGC CGCACTCTTG 1080 

325 TTCTGCCTTC TGGTGGTGGT GGTGGTGCTC ATGTCCCGAT ACCATCGGCG CAAGGCCCAG 1 140 
! U CAGATGACCC AGAAATATGA GGAGGAGCTG ACCCTGACCA GGGAGAACTC CATCCGGAGG 1200 

\. I CTGCATTCCC ATCACACGGA CCCCAGGAGC CAGCCGGAGG AGAGTGTAGG GCTGAGAGCC 1260 

GAGGGCCACC CTGATAGTCT CAAGGACAAC AGTAGCTGCT CTGTGATGAG TGAAGAGCCC 1320 
Cl GAGGGCCGCA GTTACTCCAC GCTGACCACG GTGAGGGAGA TAGAAACACA GACTGAACTG 1380 

30 CTGTCTCCAG GCTCTGGGCG GGCCGAGGAG GAGGAAGATC AGGATGAAGG CATCAAACAG 1440 
GCCATGAACC ATTTTGTTCA GGAGAATGGG ACCCTACGGG CCAAGCCCAC GGGCAATGGC 1500 
D ATCTACATCA ATGGGCGGGG ACACCTGGTC TGA 



'• Z _ Seq ID NO: 54 Protein sequence: 
fS5 Protein Accession #: NP 112178.1 

4=! 1 11 21 31 41 51 

fH I I I I I i 

IT! MPLSLGAEMW GPEAWLLLLL LLASFTGRCP AGELETSDVVTVVLGQDAKLPCFYRGDSGE 60 

WO QVGQVAWARV DAGEGAQELA LLHSKYGLHV SPAYEGRVEQ PPPPRNPLDG SVLLRNAVQA 120 
DEGEYECRVS TFPAGSFQAR LRLRVLVPPL PSLNPGPALE EGQGLTLAAS CTAEGSPAPS 180 
VTWDTEVKGT TSSRSFKHSR SAAVTSEFHL VPSRSMNGQP LTCVVSHPGL LQDQRITHIL 240 
HVSFLAEASV RGLEDQNLWH IGREGAMLKC LSEGQPPPSY NWTRLDGPLP SGVRVDGDTL 300 
GFPPLTTEHS GIYVCHVSNE FSSRDSQVTV DVLDPQEDSG KQVDLVSASV VVVGVIAALL 360 
45 FCLLVVVVVL MSRYHRRKAQ QMTQKYEEEL TLTRENSIRR LHSHHTDPRS QPEESVGLRA 420 
EGHPDSLKDN SSCSVMSEEP EGRSYSTLTT VREIETQTEL LSPGSGRAEE EEDQDEGIKQ 480 
AMNHFVQENG TLRAKPTGNG IYINGRGHLV 

Seq ID NO: 55 DNA sequence 
5 0 Nucleic Acid Accession #: AF007 1 70. 1 

Coding sequence: 73-1725 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
,,.111111 

55 AAGGAGGCGG CCTCCGGGAA AAGCGACCGC AGGACTCCTG AGAGCAGCCT CCATGAGGCC 60 
CTGGACCAGT GC ATGA CCGC CCTGGACCTC TTCCTCACCA ACCAGTTCTC AGAAGCACTC 120 
AGCTACCTCA AGCCCAGAAC CAAGGAAAGC ATGTACCACT CACTGACATA TGCCACCATC 180 
CTGGAGATGC AGGCCATGAT GACCTTTGAC CCTCAGGACA TCCTGCTTGC CGGCAACATG 240 
ATGAAGGAGG CACAGATGCT GTGTCAGAGG CACCGGAGGA AGTCTTCTGT AACAGATTCC 300 

60 TTCAGCAGCC TGGTGAACCG CCCCACGCTG GGCCAATTCA CTGAAGAAGA AATCCACGCT 360 
GAGGTCTGCT ATGCAGAGTG CCTGCTGCAG CGAGCAGCCC TGACCTTCCT GCAGGACGAG 420 
AACATGGTGA GCTTCATCAA AGGCGGCATC AAAGTTCGAA ACAGCTACCA GACCTACAAG 480 
GAGCTGGACA GCCTTGTTCA GTCCTCACAA TACTGCAAGG GTGAGAACCA CCCGCACTTT 540 
GAAGGAGGAG TGAAGCTTGG TGTAGGGGCC TTCAACCTGA CACTGTCCAT GCTTCCTACT 600 

65 AGGATCCTGA GGCTGTTGGA GTTTGTGGGG TTTTCAGGAA ACAAGGACTA TGGGCTGCTG 660 
C AGCTGG AG G AGGGAGCGTC AGGGCACAGC TTCCGCTCTG TGCTCTGTGT CATGCTCCTG 720 
CTGTGCTACC ACACCTTCCT CACCTTCGTG CTCGGTACTG GGAACGTCAA CATCGAGGAG 780 
GCCGAGAAGC TCTTGAAGCC CTACCTGAAC CGGTACCCTA AGGGTGCCAT CTTCCTGTTC 840 
TTTGCAGGGA GGATTGAAGT CATTAAAGGC AACATTGATG CAGCCATCCG GCGTTTCGAG 900 

70 GAGTGCTGTG AGGCCCAGCA GCACTGGAAG CAGTTTCACC ACATGTGCTA CTGGGAGCTG 960 
ATGTGGTGCT TCACCTACAA GGGCCAGTGG AAGATGTCCT ACTTCTACGC CGACCTGCTC 1020 
AGCAAGGAGA ACTGCTGGTC CAAGGCCACC TACATTTACA TGAAGGCCGC CTACCTCAGC 1080 
ATGTTTGGGA AGGAG GACCA CAAGCCGTTC GGGGACGACG AAGTGGAATT ATTTCGAGCT 1 140 
GTGCCAGGCC TGAAGCTCAA GATTGCTGGG AAATCTCTAC CCACAGAGAA GTTTGCCATC 1200 

75 CGGAAGTCCC GGCGCTACTT CTCCTCCAAC CCTATCTCGC TGCCAGTGCC TGCTCTGGAA 1260 
ATGATGTACA TCTGGAACGG CTACGCCGTG ATTGGGAAGC AGCCGAAACT CACGGATGGG 1320 
ATACTTGAGA TTATCACTAA GGCTGAAGAG ATGCTGGAGA AAGGCCCAGA GAACGAGTAC 1380 
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TCAGTGGATG ACGAGTGCTT GGTGAAATTG TTGAAAGGCC TGTGTCTGAA ATACCTGGGC 1440 
CGTGTCCAGG AGGCCGAGGA GAATTTTAGG AGCATCTCTG CCAATGAAAA GAAGATTAAA 1500 
TATGACCACT ACTTGATCCC AAACGCCCTG CTGGAGCTGG CCCTGCTGCT TATGGAGCAA 1560 
GACAGAAACG AAGAGGCCAT CAAACTTTTG GAATCTGCCA AGCAAAACTA CAAGAATTAC 1620 
TCCATGGAGT CAAGGACACA CTTTCGAATC CAGGCAGCCA CACTCCAAGC CAAGTCTTCC 1680 
CTAGAGAACA GCAGCAGATC CATGGTCTCA TCAGTGTCCT TG TAGC TTTG TGCAGCAGTT 1740 
CCGGGCTGGA AGACAGAGAC AGCTGGACAG AGCTCCTGAA AACATTTCAA AATACCCCCT 1800 
CCCCCTGCCC TGCCCTGCCT TTGGGGTCCA CCGGCACTCC AGTTGGATGG CACAACATAG 1 860 
TGTATCCGTG CAGAAGCCGA GCTGGCATTT TCACCAGTGT AGCCAAGGGC CTTTGCCAAG 1920 
GGCAGAGCAG GTGGAGCCCT CTGCCTGCCC TATCACACAT ACGGGTACTT GCTTTTCACT 1980 
GTGATGTTTA AGAGAATGTA TGAACAGTTT ACATTTTCCT TAGAAATACA TTGATGGGAT 2040 
CACAGTTGGC TTTAAAAACC AACAACAATC AACCACCTGT AAGTCTTTGT CTTCACCTAT 2100 
TATCATCTGG AGGTAAATCT CTTTATATGA TGATGCCAAA GGGCAAATTG CTTTTCAAAT 2160 
TCAGCAAGTT CTCAGCTTGT GTGACGGAAG GTCCTTCAGA GGACCTGAGG AATGCCTGGG 2220 
AGAGGCTAAG CCTCAGGCTT CAATGCTTCT GGGGTTGGGC ATGAGGATGT ACACAGACAC 2280 
CCACTACCTT ACTACTCACA CTTCATTTCA CTCCTTTTGT AAATTTCCAA TTTAAAAATC 2340 
AAGCACGTCT TTTTAGTGAG ATAAAATCTG AGCTCTTCTG TAGAAAAATC AATCTCTACC 2400 
AGTAGAAAAT GCCAGGGCTT GATGGAAGAG CTGTGTAGCC CTTTCTATGC CAAAGCCAGG 2460 
AAATTTGGGG GGCAGGAGGA GGTTCTCAGA ATCCAGTCTG TATCTTTGCT GTATGCCAAA 2520 
CTGAAACCAC TGGGAATAAT TTATGAAACA TAAAAATCTT CTGTACTTCA CTCCAAGGTA 2580 
CATTTGCTTA CTGACAGCAT TTTTGTTAAA ACTGTTATTC TTGAAAAAAA AAAAAAAAAA 2640 
AA 

Seq ID NO: 56 Protein sequence: 
Protein Accession #: AAC39582. 1 

1 11 21 31 41 51 

I I I I I I 

MTALDLFLTN QFSEALSYLK PRTKESMYHS LTYATILEMQ AMMTFDPQDI LLAGNMMKEA 60 
QMLCQRHRRK SSVTDSFSSL VNRPTLGQFT EEEIHAEVCY AECLLQRAAL TFLQDENMVS 120 
FIKGGKVRN SYQTYKELDS LVQSSQYCKG ENHPHFEGGV KLGVGAFNLT LSMLPTRILR 1 80 
LLEFVGFSGN KDYGLLQLEE GASGHSFRSV LCVMLLLCYH TFLTFVLGTG NVNIEEAEKL 240 
LKPYLNRYPK GAIFLFFAGR IEVIKGNIDA AIRRFEECCE AQQHWKQFHH MCYWELMWCF 300 
TYKGQWKMSY FYADLLSKEN CWSKATYIYM KAAYLSMFGK EDHKPFGDDE VELFRAVPGL 360 
KLKIAGKSLP TEKFAIRKSR RYFSSNPISL PVPALEMMYI WNGYAVIGKQ PKLTDGILEI 420 
ITKAEEMLEK GPENEYSVDD ECLVKLLKGL CLKYLGRVQE AEENFRSISA NEKKIKYDHY 480 
LIPNALLELA LLLMEQDRNE EAKLLESAK QNYKNYSMES RTHFRIQAAT LQAKSSLENS 540 
SRSMVSSVSL 

Seq ID NO: 57 DNA sequence 

Nucleic Acid Accession #: NM 006670. 1 

Coding sequence: 1 -927 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

ATGCCTGGGG GGTGCTCCCG GGGCCCCGCC GCCGGGGACG GGCGTCTGCG GCTGGCGCGA 60 
CTAGCGCTGG TACTCCTGGG CTGGGTCTCC TCGTCTTCTC CCACCTCCTC GGCATCCTCC 120 
TTCTCCTCCT CGGCGCCGTT CCTGGCTTCC GCCGTGTCCG CCCAGCCCCC GCTGCCGGAC 1 80 
CAGTGCCCCG CGCTGTGCGA GTGCTCCGAG GCAGCGCGCA CAGTCAAGTG CGTTAACCGC 240 
AATCTGACCG AGGTGCCCAC GGACCTGCCC GCCTACGTGC GCAACCTCTT CCTTACCGGC 300 
AACCAGCTGG CCAGCAACCA CTTCCTTTAC CTGCCGCGGG ATGTGCTGGC CCAACTGCCC 360 
AGCCTCAGGC ACCTGGACTT AAGTAATAAT TCGCTGGTGA GCCTGACCTA CGTGTCCTTC 420 
CGCAACCTGA CACATCTAGA AAGCCTCCAC CTGGAGGACA ATGCCCTCAA GGTCCTTCAC 480 
AATGGCACCC TGGCTGAGTT GCAAGGTCTA CCCCACATTA GGGTTTTCCT GGACAACAAT 540 
CCCTGGGTCT GCGACTGCCA CATGGCAGAC ATGGTGACCT GGCTCAAGGA AACAGAGGTA 600 
GTGCAGGGCA AAGACCGGCT CACCTGTGCA TATCCGGAAA AAATGAGGAA TCGGGTCCTC 660 
TTGGAACTCA ACAGTGCTGA CCTGGACTGT GACCCGATTC TTCCCCCATC CCTGCAAACC 720 
TCTTATGTCT TCCTGGGTAT TGTTTTAGCC CTGATAGGCG CTATTTTCCT CCTGGTTTTG 780 
TATTTGAACC GCAAGGGGAT AAAAAAGTGG ATGCATAACA TCAGAGATGC CTGCAGGGAT 840 
CACATGGAAG GGTATCATTA CAGATATGAA ATCAATGCGG ACCCCAGATT AACAAACCTC 900 
AGTTCTAACT CGGATGTCCT CGAGTGA 

Seq ID NO: 58 Protein sequence 
Protein Accession #: NP_00666 1 . 1 



1 11 21 31 41 51 

I I I I I I 

MPGGCSRGPA AGDGRLRLAR LALVLLGWVS SSSPTSSASS FSSSAPFLAS AVSAQPPLPD 60 
QCPALCECSE AARTVKCVNR NLTEVPTDLP AYVRNLFLTG NQLASNHFLY LPRDVLAQLP 120 
SLRHLDLSNN SLVSLTYVSF RNLTHLESLH LEDNALKVLH NGTLAELQGL PfflRVFLDNN 1 80 
PWVCDCHMAD MVTWLKETEV VQGKDRLTCA YPEKMRNRVL LELNSADLDC DPILPPSLQT 240 
SYVFLGIVLA LIGAIFLLVL YLNRKGKKW MHNIRDACRD HMEGYHYRYE INADPRLTNL 300 
SSNSDVLE 



Seq ID NO. 59 DNA sequence 

Nucleic Acid Accession #: NM_024022 
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Coding sequence: 1-1 362(underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 
5 ATGGGGGAAA ATGATCCGCC TGCTGTTGAA GCCCCCTTCT CATTCCGATC GCTTTTTGGC 60 
CTTGATGATT TGAAAATAAG TCCTGTTGCA CCAGATGCAG ATGCTGTTGC TGCACAGATC 120 
CTGTCACTGC TGCCATTG AA GTTTTTTCCA ATCATCGTCA TTGGG ATCAT TGCATTGATA 1 80 
TTAGCACTGG CCATTGGTCT GGGCATCCAC TTCGACTGCT CAGGGAAGTA CAGATGTCGC 240 
TCATCCTTTA AGTGTATCGA GCTGATAGCT CGATGTGACG GAGTCTCGGA TTGCAAAGAC 300 
1 0 GGGGAGGACG AGTACCGCTG TGTCCGGGTG GGTGGTCAGA ATGCCGTGCT CCAGGTGTTC 360 
ACAGCTGCTT CGTGGAAGAC CATGTGCTCC GATGACTGGA AGGGTCACTA CGCAAATGTT 420 
GCCTGTGCCC AACTGGGTTT CCCAAGCTAT GTGAGTTCAG ATAACCTCAG AGTGAGCTCG 480 
CTGGAGGGGC AGTTCCGGGA GGAGTTTGTG TCCATCGATC ACCTCTTGCC AGATGACAAG 540 
GTGACTGCAT TACACCACTC AGTATATGTG AGGGAGGGAT GTGCCTCTGG CCACGTGGTT 600 
1 5 ACCTTGCAGT GCACAGCCTG TGGTCATAGA AGGGGCTACA GCTCACGCAT CGTGGGTGGA 660 
AACATGTCCT TGCTCTCGCA GTGGCCCTGG CAGGCCAGCC TTCAGTTCCA GGGCTACCAC 720 
CTGTGCGGGG GCTCTGTCAT CACGCCCCTG TGGATCATCA CTGCTGCACA CTGTGTTTAT 780 

M GACTTGTACC TCCCCAAGTC ATGGACCATC CAGGTGGGTC TAGTTTCCCT GTTGGACAAT 840 

CCAGCCCCAT CCCACTTGGT GGAGAAGATT GTCTACCACA GCAAGTACAA GCCAAAGAGG 900 

*£0 CTGGGCAATG ACATCGCCCT TATGAAGCTG GCCGGGCCAC TCACGTTCAA TGAAATGATC 960 

0 CAGCCTGTGT GCCTGCCCAA CTCTGAAGAG AACTTCCCCG ATGG AAAAGT GTGCTGGACG 1 020 
j=: TCAGGATGGG GGGCCACAGA GGATGGAGGT GACGCCTCCC CTGTCCTGAA CCACGCGGCC 1080 
V i GTCCCTTTGA TTTCCAACAA GATCTGCAAC CACAGGGACG TGTACGGTGG CATCATCTCC 1 140 
ff S CCCTCCATGC TCTGCGCGGG CTACCTGACG GGTGGCGTGG ACAGCTGCCA GGGGGACAGC 1200 
525 GGGGGGCCCC TGGTGTGTCA AGAGAGGAGG CTGTGG AAGT TAGTGGGAGC G ACCAGCTTT 1 260 

1 i= ! GGCATCGGCT GCGCAGAGGT GAACAAGCCT GGGGTGTACA CCCGTGTCAC CTCCTTCCTG 1320 
'%| GACTGGATCC ACGAGCAGAT GGAGAGAGAC CTAAAAACCTGA 

W Seq ID NO: 60 Protein sequence 

,30 Protein Accession #: NP_076927 

D 1 11 21 31 41 51 

U I I I I I I 

' MGENDPPAVE APFSFRSLFG LDDLKJSPVA PDADAVAAQI LSLLPLKFFP HVIGIIALI 60 

fp5 LALAIGLGIH FDCSGKYRCR SSFKCIELIA RCDGVSDCKD GEDEYRCVRV GGQNAVLQVF 120 

* ~" TAASWKTMCS DDWKGHYANV ACAQLGFPSY VSSDNLRVSS LEGQFREEFV SIDHLLPDDK 1 80 

W" VTALHHSVYV REGCASGHVV TLQCTACGHR RGYSSRIVGG NMSLLSQWPW QASLQFQGYH 240 

Pi LCGGSVITPL WIITAAHCVY DLYLPK.SWTI QVGLVSLLDN PAPSHLVEKI VYHSKYKPKR 300 

S': LGNDIALMKL AGPLTFNEMI QPVCLPNSEE NFPDGKVCWT SGWGATEDGG DASPVLNHAA 360 

i 3$0 VPLISNKICN HRDVYGG1IS PSMLCAGYLT GGVDSCQGDS GGPLVCQERR LWKLVGATSF 420 
GIGCAEVNKP GVYTRVTSFL DWIHEQMERD LKT 

Seq ID NO: 61 DNA sequence 
Nucleic Acid Accession ff: NM_006475 
45 Coding sequence: 28-2538 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

AACAGAACTG CAACGGAGAG ACTCAAGATG ATTCCCTTTT TACCCATGTT TTCTCTACTA 60 

5 0 TTGCTGCTTA TTGTTAACCC TATAAACGCC AACAATCATT ATGACAAGAT CTTGGCTCAT 120 
AGTCGTATCA GGGGTCGGGA CCAAGGCCC A AATGTCTGTG CCCTTCAACA GATTTTGGGC 1 80 
ACCAAAAAGA AATACTTCAG CACTTGTAAG AACTGGTATA AAAAGTCCAT CTGTGGACAG 240 
AAAACGACTG TTTTATATGA ATGTTGCCCT GGTTATATGA GAATGGAAGG AATGAAAGGC 300 
TGCCCAGCAG TTTTGCCCAT TGACCATGTT TATGGCACTC TGGGCATCGT GGGAGCCACC 360 

5 5 ACAACGCAGC GCTATTCTGA CGCCTCAAAA CTGAGGGAGG AG ATCGAGGG AAAGGGATCC 420 
TTCACTTACT TTGCACCGAG TAATGAGGCT TGGGACAACT TGGATTCTGA TATCCGTAGA 480 
GGTTTGGAGA GCAACGTGAA TGTTGAATTA CTGAATGCTT TACATAGTCA CATGATTAAT 540 
AAGAGAATGT TGACCAAGGA CTTAAAAAAT GGCATGATTA TTCCTTCAAT GTATAACAAT 600 
TTGGGGCTTT TCATTAACCA TTATCCTAAT GGGGTTGTCA CTGTTAATTG TGCTCGAATC 660 

60 ATCCATGGGA ACCAGATTGC AACAAATGGT GTTGTCCATG TCATTGACCG TGTGCTTACA 720 
CAAATTGGTA CCTCAATTCA AGACTTCATT GAAGCAGAAG ATGACCTTTC ATCTTTTAGA 780 
GCAGCTGCCA TCACATCGGA CATATTGGAG GCCCTTGGAA GAGACGGTCA CTTCACACTC 840 
TTTGCTCCCA CCAATGAGGC TTTTGAGAAA CTTCCACGAG GTGTCCTAGA AAGGTTCATG 900 
GGAGACAAAG TGGCTTCCGA AGCTCTTATG AAGTACCACA TCTTAAATAC TCTCCAGTGT 960 

65 TCTGAGTCTA TTATGGGAGG AGCAGTCTTT GAGACGCTGG AAGGAAATAC AATTGAGATA 1020 
GGATGTGACG GTGACAGTAT AACAGTAAAT GGAATCAAAA TGGTGAACAA AAAGGATATT 1080 
GTGACAAATA ATGGTGTG AT CCATTTG ATT GATCAGGTCC TAATTCCTGA TTCTGCCAAA 1 140 
CAAGTTATTG AGCTGGCTGG AAAACAGCAA ACCACCTTCA CGGATCTTGT GGCCCAATTA 1200 
GGCTTGGCAT CTGCTCTGAG GCCAGATGGA GAATACACTT TGCTGGCACC TGTGAATAAT 1260 

70 GCATTTTCTG ATGATACTCT CAGCATGGTT CAGCGCCTCC TTAAATTAAT TCTGCAGAAT 1320 

CACATATTGA AAGTAAAAGT TGGCCTTAAT GAGCTTTACA ACGGGCAAAT ACTGGAAACC 1380 
ATCGGAGGCA AACAGCTCAG AGTCTTCGTA TATCGTACAG CTGTCTGCAT TGAAAATTCA 1440 
TGCATGGAGA AAGGGAGTAA GCAAGGGAGA AACGGTGCGA TTCACATATT CCGCGAGATC 1500 
ATCAAGCCAG CAGAGAAATC CCTCCATGAA AAGTTAAAAC AAGATAAGCG CTTTAGCACC 1560 

75 TTCCTCAGCC TACTTGAAGC TGCAGACTTG AAAGAGCTCC TGACACAACC TGGAGACTGG 1620 
ACATTATTTG TGCCAACCAA TGATGCTTTT AAGGGAATGA CTAGTGAAGA AAAAGAAATT 1680 
CTGATACGGG ACAAAAATGC TCTTCAAAAC ATCATTCTTT ATCACCTGAC ACCAGGAGTT 1740 
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TTCATTGGAA AAGGATTTGA ACCTGGTGTT ACTAACATTT TAAAGACCAC ACAAGGAAGC 1800 
AAAATCTTTC TGAAAGAAGT AAATGATACA CTTCTGGTGA ATGAATTGAA ATCAAAAGAA 1860 
TCTGACATCA TGACAACAAA TGGTGTAATT CATGTTGTAG ATAAACTCCT CTATCCAGCA 1920 
GACACACCTG TTGGAAATGA TCAACTGCTG GAAATACTTA ATAAATTAAT CAAATACATC 1980 
5 CAAATTAAGT TTGTTCGTGG TAGCACCTTC AAAGAAATCC CCGTGACTGT CTATACAACT 2040 

AAAATTATAA CCAAAGTTGT GGAACCAAAA ATTAAAGTGA TTGAAGGCAG TCTTCAGCCT 2100 
ATTATCAAAA CTGAAGGACC CACACTAACA AAAGTCAAAA TTGAAGGTGA ACCTGAATTC 2160 
AGACTGATTA AAGAAGGTGA AACAATAACT GAAGTGATCC ATGGAGAGCC AATTATTAAA 2220 
AAATACACCA AAATCATTGA TGGAGTGCCT GTGGAAATAA CTGAAAAAGA GACACGAGAA 2280 
1 0 GAACGAATCA TTACAGGTCC TGAAATAAAA TACACTAGGA TTTCTACTGG AGGTGGAGAA 2340 

ACAGAAGAAA CTCTGAAGAA ATTGTTACAA GAAGAGGTCA CCAAGGTCAC CAAATTCATT 2400 
GAAGGTGGTG ATGGTCATTT ATTTGAAGAT GAAGAAATTA AAAGACTGCT TCAGGGAGAC 2460 
ACACCCGTGA GGAAGTTGCA AGCCAACAAA AAAGTTCAAG GTTCTAGAAG ACGATTAAGG 2520 
GAAGGTCGTT CTCAGTGAAA ATCCAAAAAC CAGAAAAAAA TGTTTATACA ACCCTAAGTC 2580 
1 5 AATAACCTGA CCTTAGAAAA TTGTGAGAGC CAAGTTGACT TCAGGAACTG AAACATCAGC 2640 

ACAAAGAAGC AATCATCAAA TAATTCTGAA CACAAATTTA ATATTTTTTT TTCTGAATGA 2700 
GAAACATGAG GGAAATTGTG GAGTTAGCCT CCTGTGGTAA AGGAATTGAA GAAAATATAA 2760 
j™ CACCTTACAC CCTTTTTCAT CTTGACATTA AAAGTTCTGG CTAACTTTGG AATCCATTAG 2820 

f~i AGAAAAATCC TTGTCACCAG ATTCATTACA ATTCAAATCG AAGAGTTGTG AACTGTTATC 2880 

20 CCATTGAAAA GACCGAGCCT TGTATGTATG TTATGGATAC ATAAAATGCA CGCAAGCCAT 2940 
Q TATCTCTCCA TGGGAAGCTA AGTTATAAAA ATAGGTGCTT GGTGTACAAA ACTTTTTATA 3000 

i *i TCAAAAGGCT TTGCACATTT CTATATGAGT GGGTTTACTG GTAAATTATG TTATTTTTTA 3060 

- ! CAACTAATTT TGTACTCTCA GAATGTTTGT CATATGCTTC TTGCAATQCA TATTTTTTAA 3120 

Hi TCTC AAACGT TTCAATAA AA CCATTTTTCA G ATATAAAG A G AATTACTTC AAATTGAGTA 3180 

jS5 ATTCAGAAAA ACTCAAGATT TAAGTTAAAA AGTGGTTTGG ACTTGGGAA 

\,j Seq ID NO: 62 Protein sequence: 

j=" Protein Accession #: NP006466 

,30 1 11 21 31 41 51 

O MIPFLPMFSL LLLLIVNPIN ANNHYDKILA HSRIRGRDQG PNVCALQQIL GTKKKYFSTC 60 

s . KNWYKKSICG QKTTVLYECC PGYMRMEGMK GCPAVLPIDH VYGTLGIVGA TTTQRYSDAS 120 

= " KLREEIEGKG SFTYFAPSNE AWDNLDSDIR RGLESNVNVE LLNALHSHMI NKRMLTKDLK 180 

[115 NGMHPSMYN NLGLFINHYP NGWTVNCAR IIHGNQIATN GWUVIDRVL TQIGTSIQDF 240 
'=•■■ IEAEDDLSSF RAAAITSDIL EALGRDGHFT LFAPTNEAFE KLPRGVLERF MGDKVASEAL 300 

~;!= ; MKYHILNTLQ CSESIMGGAV FETLEGNTIE IGCDGDSITV NGIKMVNKKD IVTNNGVIHL 360 

Pi IDQVLIPDSA KQVIELAGKQ QTTFTDLVAQ LGLASALRPD GEYTLLAPVN NAFSDDTLSM 420 

VQRLLKLILQ NHILKVKVGL NELYNGQILE TIGGKQLRVF VYRTAVCIEN SCMEKGSKQG 480 
§ 40 RNGAIHIFRE IKPAEKSLH EKLKQDKRFS TFLSLLEAAD LKELLTQPGD WTLFVPTNDA 540 
FKGMTSEEKE ILIRDKNALQ NHLYHLTPG VFIGKGFEPG VTNILKTTQG SK1FLKEVND 600 
TLLVNELKSK ESDIMTTNGV IHVVDKLLYP ADTPVGNDQL LEILNKLKY IQIKFVRGST 660 
FKEIPVTVYT nOITKVVEP KUCVIEGSLQ PIKTEGPTL TKVK1EGEPE FRLIKEGETI 720 
TEVIHGEPH KKYTKIIDGV PVEITEKETR EERIITGPEI KYTRISTGGG ETEETLKKLL 780 
45 QEEVTK.VTKJ? IEGGDGHLFE DEEQCRLLQG DTPVRKLQAN KKVQGSRRRL REGRSQ 

Seq ID NO: 63 DNA sequence 

Nucleic Acid Accession #: NMJD20974 

Coding sequence: 8 1 -3080 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

111!!! 

GGCGTCCGCG CACACCTCCC CGCGCCGCCG CCGCCACCGC CCGCACTCCG CCGCCTCTGC 60 
CCGCAACCGC TGAGCCATCC ATG GGGGTCG CGGGCCGCAA CCGTCCCGGG GCGGCCTGGG 1 20 

5 5 CGGTGCTGCT GCTGCTGCTG CTGCTGCCGC CACTGCTGCT GCTGGCGGGG GCCGTCCCGC 1 80 

CGGGTCGGGG CCGTGCCGCG GGGCCGCAGG AGGATGTAGA TGAGTGTGCC CAAGGGCTAG 240 
ATGACTGCCA TGCCGACGCC CTGTGTCAGA ACACACCCAC CTCCTACAAG TGCTCCTGCA 300 
AGCCTGGCTA CCAAGGGGAA GGCAGGCAGT GTGAGGACAT CGATGAATGT GGAAATGAGC 360 
TCAATGGAGG CTGTGTCCAT GACTGTTTGA ATATTCCAGG CAATTATCGT TGCACTTGTT 420 

60 TTGATGGCTT CATGTTGGCT CATGACGGTC ATAATTGTCT TGATGTGGAC GAGTGCCTGG 480 
AGAACAATGG CGGCTGCCAG CATACCTGTG TCAACGTCAT GGGGAGCTAT GAGTGCTGCT 540 
GCAAGGAGGG GTTTTTCCTG AGTGACAATC AGCACACCTG CATTCACCGC TCGGAAGAGG 600 
GCCTGAGCTG CATGAATAAG GATCACGGCT GTAGTCACAT CTGCAAGGAG GCCCCAAGGG 660 
GCAGCGTCGC CTGTGAGTGC AGGCCTGGTT TTGAGCTGGC CAAGAACCAG AGAGACTGCA 720 

65 TCTTGACCTG TAACCATGGG AACGGTGGGT GCCAGCACTC CTGTGACGAT ACAGCCGATG 780 
GCCCAGAGTG CAGCTGCCAT CCACAGTACA AGATGCACAC AGATGGGAGG AGCTGCCTTG 840 
AGCGAGAGGA CACTGTCCTG GAGGTGACAG AGAGCAACAC CACATCAGTG GTGGATGGGG 900 
ATAAACGGGT GAAACGGCGG CTGCTCATGG AAACGTGTGC TGTCAACAAT GGAGGCTGTG 960 
ACCGCACCTG TAAGGATACT TCGACAGGTG TCCACTGCAG TTGTCCTGTT GGATTCACTC 1020 

70 TCCAGTTGGA TGGGAAGACA TGTAAAGATA TTGATGAGTG CCAGACCCGC AATGG AG GTT 1080 
GTGATCATTT CTGCAAAAAC ATCGTGGGCA GTTTTGACTG CGGCTGCAAG AAAGGATTTA 1 140 
AATTATTAAC AGATGAGAAG TCTTGCCAAG ATGTGGATGA GTGCTCTTTG GATAGGACCT 1200 
GTGACCACAG CTGCATCAAC CACCCTGGCA CATTTGCTTG TGCTTGCAAC CGAGGGTACA 1260 
CCCTGTATGG CTTCACCCAC TGTGGAGACA CCAATGAGTG CAGCATCAAC AACGGAGGCT 1320 

75 GTCAGCAGGT CTGTGTGAAC ACAGTGGGCA GCTATGAATG CCAGTGCCAC CCTGGGTACA 1380 
AGCTCCACTG GAATAAAAAA GACTGTGTGG AAGTGAAGGG GCTCCTGCCC ACAAGTGTGT 1440 
CACCCCGTGT GTCCCTGCAC TGCGGTAAG A GTGGTGG AGG AGACGGGTGC TTCCTC AGAT 1 500 
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GTCACTCTGG CATTCACCTC TCTTCAGATG TCACCACCAT CAGGACAAGT GTAACCTTTA 1560 
AGCTAAATGA AGGCAAGTGT AGTTTGAAAA ATGCTGAGCT GTTTCCCGAG GGTCTGCGAC 1620 
CAGCACTACC AGAGAAGCAC AGCTCAGTAA AAGAGAGCTT CCGCTACGTA AACCTTACAT 1680 
GCAGCTCTGG CAAGCAAGTC CCAGGAGCCC CTGGCCGACC AAGCACCCCT AAGGAAATGT 1740 
5 TTATCACTGT TGAGTTTGAG CTTGAA ACTA ACCAAAAGGA GGTGACAGCT TCTTGTGACC 1 800 
TGAGCTGCAT CGTAAAGCGA ACCGAGAAGC GGCTCCGTAA AGCCATCCGC ACGCTCAGAA 1860 
AGGCCGTCCA CAGGGAGCAG TTTCACCTCC AGCTCTCAGG CATGAACCTC GACGTGGCTA 1920 
AAAAGCCTCC CAGAACATCT GAACGCCAGG CAGAGTCCTG TGGAGTGGGC CAGGGTCATG 1980 
CAGAAAACCA ATGTGTCAGT TGCAGGGCTG GGACCTATTA TGATGGAGCA CGAGAACGCT 2040 
1 0 GCATTTTATG TCCAAATGGA ACCTTCCAAA ATGAGG AAGG ACAAATGACT TGTGAACCAT 2 1 00 
GCCCAAGACC AGGAAATTCT GGGGCCCTGA AGACCCCAGA AGCTTGGAAT ATGTCTGAAT 2160 
GTGGAGGTCT GTGTCAACCT GGTGAATATT CTGCAGATGG CTTTGCACCT TGCCAGCTCT 2220 
GTGCCCTGGG CACGTTCCAG CCTGAAGCTG GTCGAACTTC CTGCTTCCCC TGTGGAGGAG 2280 
GCCTTGCCAC CAAACATCAG GGAGCTACTT CCTTTCAGGA CTGTGAAACC AGAGTTCAAT 2340 
1 5 GTTCACCTGG ACATTTCTAC AACACCACCA CTCACCGATG TATTCGTTGC CCAGTGGGAA 2400 
CATACCAGCC TGAATTTGGA AAAAATAATT GTGTTTCTTG CCCAGGAAAT ACTACGACTG 2460 
ACTTTGATGG CTCCACAAAC ATAACCCAGT GTAAAAACAG AAGATGTGGA GGGGAGCTGG 2520 
= = GAGATTTCAC TGGGTACATT GAATCCCCAA ACTACCCAGG CAATTACCCA GCCAACACCG 2580 

? "' AGTGTACGTG GACCATCAAC CCACCCCCCA AGCGCCGCAT CCTGATCGTG GTCCCTGAGA 2640 

f20 TCTTCCTGCC CATAGAGGAC GACTGTGGGG ACTATCTGGT GATGCGGAAA ACCTCTTCAT 2700 
CCAATTCTGT GACAACATAT GAAACCTGCC AGACCTACGA ACGCCCCATC GCCTTCACCT 2760 
fe= ; CCAGGTCAAA GAAGCTGTGG ATTCAGTTCA AGTCCAATGA AGGGAACAGC GCTAGAGGGT 2820 

{J! TCCAGGTCCC ATACGTGACA TATGATGAGG ACTACCAGGA ACTCATTGAA GACATAGTTC 2880 

GAGATGGCAG GCTCTATGCA TCTGAGAACC ATCAGGAAAT ACTTAAGGAT AAGAAACTTA 2940 
%l£5 TCAAGGCTCT GTTTGATGTC CTGGCCCATC CCCAGAACTA TTTCAAGTAC ACAGCCCAGG 3000 
fl ! AGTCCCGAGA GATGTTTCCA AGATCGTTCA TCCGATTGCT ACGTTCCAAA GTGTCCAGGT 3060 

[ " ' TTTTGAG ACC TTACAAATGA CTCAGCCC AC GTGCCACTCA ATACAAATGT TCTGCTATAG 3 1 20 

"~4 GGTTGGTGGG ACAGAGCTGT CTTCCTTCTG CATGTCAGCA CAGTCGGGTA TTGCTGCCTC 3180 

f = i CCGTATCAGT GACTCATTAG AGTTCAATTT TTATAGATAA TACAGATATT TTGGTAAATT 3240 

~~30 GAACTTGGTT TTTCTTTCCC AGCATCGTGG ATGTAGACTG AGAATGGCTT TGAGTGGCAT 3300 
3 CAGCTTCTCA CTGCTGTGGG CGGATGTCTT GGATAGATCA CGGGCTGGCT GAGCTGGACT 3360 

TTGGTCAGCC TAGGTGAGAC TCACCTGTCC TTCTGGGGTC TTACTCCTCC TCAAGGAGTC 3420 
~~ : TGTAGTGGAA AGGAGGCCAC AGAATAAGCT GCTTATTCTG AAACTTCAGC TTCCTCTAGC 3480 

M- CCGGCCCTCT CTAAGGGAGC CCTCTGCACT CGTGTGCAGG CTCTGACCAG GCAGAACAGG 3540 

3=35 CAAGAGGGGA GGGAAGGAGA CCCCTGCAGG CTCCCTCCAC CCACCTTGAG ACCTGGGAGG 3600 

ACTCAGTTTC TCCACAGCCT TCTCCAGCCT GTGTGATACA AGTTTGATCC CAGGAACTTG 3660 
_f!: AGTTCTAAGC AGTGCTCGTG AAAAAAAAAA GCAGAAAGAA TTAGAAATAA ATAAAAACTA 3720 

3^ AGCACTTCTG GAGACAT 

40 Seq ID NO: 64 Protein sequence: 
3 "' Protein Accession #: NP_066025.1 

1 11 21 31 41 51 

I I I I I I 

45 MGVAGRNRPG AAWAVLLLLL LLPPLLLLAG AVPPGRGRAA GPQEDVDECA QGLDDCHADA 60 
LCQNTPTSYK CSCKPGYQGE GRQCEDIDEC GNELNGGCVH DCLNIPGNYR CTCFDGFMLA 120 
HDGHNCLDVD ECLENNGGCQ HTCVNVMGSY ECCCKEGFFL SDNQHTCIHR SEEGLSCMNK 180 
DHGCSfflCKE APRGSVACEC RPGFELAKNQ RDCILTCNHG NGGCQHSCDD TADGPECSCH 240 
PQYKMHTDGR SCLEREDTVL EVTESNTTSV VDGDKRVKRR LLMETCAVNN GGCDRTCKDT 300 

50 STGVHCSCPV GFTLQLDGKT CKDIDECQTR NGGCDHFCKN IVGSFDCGCK KGFKLLTDEK 360 
SCQDVDECSL DRTCDHSCIN HPGTFACACN RGYTLYGFTH CGDTNECSIN NGGCQQVCVN 420 
TVGSYECQCH PGYKLHWNKK DCVEVKGLLP TSVSPRVSLH CGKSGGGDGC FLRCHSGIHL 480 
SSDVTTIRTS VTFKLNEGKC SLKNAELFPE GLRPALPEKH SSVKESFRYV NLTCSSGKQV 540 
PGAPGRPSTP KEMFITVEFE LETNQKEVTA SCDLSCIVKR TEKRLRKAIR TLRKAVHREQ 600 

5 5 FHLQLSGMNL DVAKKPPRTS ERQAESCGVG QGHAENQCVS CRAGTYYDGA RERCILCPNG 660 
TFQNEEGQMT CEPCPRPGNS G ALKTPEAWN MSECGGLCQP GEYSADGFAP CQLCALGTFQ 720 
PEAGRTSCFP CGGGLATKHQ GATSFQDCET RVQCSPGHFY NTTTHRCIRC PVGTYQPEFG 780 
KNNCVSCPGN TTTDFDGSTN ITQCKNRRCG GELGDFTGYI ESPNYPGNYP ANTECTWTIN 840 
PPPKRRILIV VPEIFLPIED DCGDYLVMRK TSSSNSVTTY ETCQTYERPI AFTSRSKKLW 900 

60 IQFKSNEGNS ARGFQVPYVT YDEDYQELIE DIVRDGRLYA SENHQEILKD KKLKALFDV 960 
LAHPQNYFKY TAQESREMFP RSFIRLLRSK VSRFLRPYK 

Seq ID NO: 65 DNA sequence 
Nucleic Acid Accession #: NM_0072 1 0 

65 Coding sequence: 1-1869 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

ATG AGGCTCC TCCGCAGACG CCACATGCCC CTGCGCCTGG CCATGGTGGG CTGCGCCTTT 60 
70 GTGCTCTTCC TCTTCCTCCT GCATAGGGAT GTGAGCAGCA GAGAGGAGGC CACAGAGAAG 120 
CCGTGGCTGA AGTCCCTGGT GAGCCGGAAG GATCACGTCC TGGACCTCAT GCTGGAGGCC 1 80 
ATGAACAACC TTAGAGATTC AATGCCCAAG CTCCAAATCA GGGCTCCAGA AGCCCAGCAG 240 
ACTCTGTTCT CCATAAACCA GTCCTGCCTC CCTGGGTTCT ATACCCCAGC TGAACTGAAG 300 
CCCTTCTGGG AACGGCCACC ACAGGACCCC AATGCCCCTG GGGCAGATGG AAAAGCATTT 360 
75 CAGAAGAGCA AGTGGACCCC CCTGGAGACC CAGGAAAAGG AAGAAGGCTA TAAGAAGCAC 420 
TGTTTCAATG CCTTTGCCAG CGACCGGATC TCCCTGCAGA GGTCCCTGGG GCCAGACACC 480 
CGACCACCTG AGTGTGTGGA CCAGAAGTTC CGGCGCTGCC CCCCACTGGC CACCACCAGC 540 
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GTGATCATTG TGTTCCACAA CGAAGCCTGG TCCACACTGC TGCGAACAGT GTACAGCGTC 600 
CTACACACCA CCCCTGCCAT CTTGCTCAAG GAGATCATAC TGGTGGATGA TGCCAGCACA 660 
GAGGAGCACC TAAAGGAGAA GCTGGAGCAG TACGTGAAGC AGCTGCAGGT GGTGAGGGTG 720 
GTGCGGCAGG AGGAGCGGAA GGGGTTGATC ACCGCCCGGC TGCTGGGGGC CAGCGTGGCA 780 
5 CAGGCGGAGG TGCTCACGTT CCTGGATGCC CACTGTGAGT GCTTCCACGG CTGGCTGGAG 840 
CCCCTCCTGG CTCGAATCGC TGAGGACAAG ACAGTGGTGG TGAGCCCAGA CATCGTCACC 900 
ATCGACCTTA ATACTTTTGA GTTCGCCAAG CCCGTCCAGA GGGGCAGAGT CCATAGCCGA 960 
GGCAACTTTG ACTGGAGCCT GACCTTCGGC TGGGAAACAC TTCCTCCACA TGAGAAGCAG 1020 
AGGCGCAAGG ATGAAACATA CCCCATCAAA TCCCCGACGT TTGCTGGTGG CCTCTTCTCC 1080 
1 0 ATCCCCAAGT CCTACTTTGA GCACATCGGT ACCTATGATA ATCAGATGGA GATCTGGGGA 1 140 
GGGGAGAACG TGGAAATGTC CTTCCGGGTG TGGCAGTGTG GGGGCCAGCT GGAGATCATC 1200 
CCCTGCTCTG TCGTAGGCCA TGTGTTCCGG ACCAAGAGCC CCCACACCTT CCCCAAGGGC 1260 
ACTAGTGTCA TTGCTCGCAA TCAAGTGCGC CTGGCAGAGG TCTGGATGGA CAGCTACAAG 1320 
AAGATTTTCT ATAGGAGAAA TCTGCAGGCA GCAAAGATGG CCCAAGAGAA ATCCTTCGGT 1380 
1 5 GACATTTCGG AACGACTGCA GCTGAGGGAA CAACTGCACT GTCACAACTT TTCCTGGTAC 1440 
CTGCACAATG TCTACCCAGA GATGTTTGTT CCTGACCTGA CGCCCACCTT CTATGGTGCC 1 500 
ATCAAGAACC TCGGCACCAA CCAATGCCTG GATGTGGGTG AGAACAACCG CGGGGGGAAG 1560 
l4: CCCCTCATCA TGTACTCCTG CCACGGCCTT GGCGGCAACC AGTACTTTGA GTACACAACT 1620 

CAGAGGGACC TTCGCCACAA CATCGCAAAG CAGCTGTGTC TACATGTCAG CAAGGGTGCT 1680 
*20 CTGGGCCTTG GGAGCTGTCA CTTCACTGGC AAGAATAGCC AGGTCCCCAA GGACGAGGAA 1740 
f*l TGGGAATTGG CCCAGGATCA GCTCATCAGG AACTCAGGAT CTGGTACCTG CCTGACATCC 1 800 

r5 CAGGACAAAA AGCCAGCCAT GGCCCCCTGC AATCCCAGTG ACCCCCATCA GTTGTGGCTC 1 860 

|J ! TTTGT CTAGG ACCCAGATCA TCCCCAGAGA GAGCCCCCAC AAGCTCCTCA GGAAACAGGA 1920 

ff! TTGCTGATGT CTGGGAACCT GATCACCAGC TTCTCTGGAG GCCGTAAAGA TGGATTTCTA 1980 

-25 AACCCACTGG GTGGCAAGGC AGGACCTTCC TAATCCTTGC AACAACATTG GGCCCATTTT 2040 
I U CTTTCCTTCA CACCGATGGA AGAGACCATT AGGACATATA TTTAGCCTAG CGTTTTCCTG 2100 

i TTCTAGAAAT AGAGGCTCCC AAAGTAGGGA AGGCAGCTGG GGGAGGGTTC AGGGCAGCAA 21 60 

" :i TGCTGAGTTC AAGAAAAGTA CTTCAGGCTG GGCACAGTGG CTCATGCCTG AAATCCTAGC 2220 

!_'.! ACTTTGGGAA GACAATGTGG GAGAATGGCT TGAGCCCAGG AGTTCAAGAC CGGCCTGAGC 2280 

3 0 AACATAGTG A GGATCCCATC TCTACGCCCA CCCTCCCCCC GGCAAAAAAA AAAGCTGGGT 2340 
ATGGTGGCTT ATGCCTGTAG TCGCAGCTAC TCAGAAGGCT GAGGTGGGAG GATTGCTTGT 2400 
D TCCCCGGAGG TTGAAGCTAC AGTGAGCCTT GATTGTGTCA CTGCACTCCA GCCTGGGCAA 2460 

n CAGGTAAGAC TCTGTCTCAA AAAAAAAACA AAAAAGAAGA AGAAAAGTAC TTCTACAGCC 2520 

f =: ATGTCCTATT CCTTGATCAT CCAAAGCACC TGCAGAGTCC AGTGAAATGA TATATTCTGG 2580 

T&5 CTGGGCACAG TGGCTCACAC CTGTAATCCT AGCACTTTGG GAGGCCAAGG CAGGTGGATC 2640 
' ACCTGAGGTC AGAAGTTTGA AACCAGCCTG GACTACATGG TGAAACTCCA TCTCTACTAA 2700 

4« AAGTACAAAA ATTAGCTGGG CATGATGGCA CGCACCTGCA GTCCCAGCTA CTTGGGAGGC 2760 

TGAGGCAGGA GAATCACTCG AACCCAGGAG GCAGAGGTTG CAGTGAGCCA AGACAGCACC 2820 
T ~ : ATTGCACCCC AGCCTGAGCA ACAAGAGCGA AACTCCATCT CAGGAAAAAA AAAAAAAAAA 2880 

m a 

Seq ID NO: 66 Protein sequence: 
Protein Accession #: NP_009 1 4 1 

45 1 11 21 31 41 51 

I I I I I 1 

MRLLRRRHMP LRLAMVGCAF VLFLFLLHRD VSSREEATEK PWLKSLVSRK DHVLDLMLEA 60 
MNNLRDSMPK LQIRAPEAQQ TLFSINQSCL PGFYTPAELK PFWERPPQDP NAPGADGKAF 120 
QKSKWTPLET QEKEEGYKKH CFNAFASDRI SLQRSLGPDT RPPECVDQKF RRCPPLATTS 1 80 

5 0 VIIVFHNEAW STLLRTVYSV LHTTPAILLK EIILVDDAST EEHLKEKLEQ YVKQLQV VRV 240 
VRQEERKGLI TARLLGASVA QAEVLTFLDA HCECFHGWLE PLLAR1AEDK TVWSPDIVT 300 
IDLNTFEFAK PVQRGRVHSR GNFDWSLTFG WETLPPHEKQ RRKDETYPIK SPTFAGGLFS 360 
IPKSYFEHIG TYDNQMEIWG GENVEMSFRV WQCGGQLEII PCSVVGHVFR TKSPHTFPKG 420 
TSVIARNQVR LAEVWMDSYK KIFYRRNLQA AKMAQEKSFG DISERLQLRE QLHCHNFSWY 480 

5 5 LHNVYPEMFV PDLTPTFYGA DCNLGTNQCL DVGENNRGGK PLMYSCHGL GGNQYFEYTT 540 
QRDLRHN1AK QLCLHVSKGA LGLGSCHFTG KNSQVPKDEE WELAQDQLIR NSGSGTCLTS 600 
QDKKPAMAPC NPSDPHQLWL FV 

Seq ID NO: 67 DNA sequence 
60 Nucleic Acid Accession #: NM_014112 

Coding sequence: 600-4484 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
,,111111 

65 TTCCTCCGCG AAGGCTCCTT TGATATTAAT AGTGTTGGTG TCTTGAAACT GACGTAATGC 60 

GCGGAGACTG AGGTCCTGAC AAGCGATAAC ATTTCTGATA AAGACCCGAT CTTACTGCAA 120 
TCTCTAGCGT CCTCTTTTTT GGTGCTGCTG GTTTCTCCAG ACCTCGCGTC CTCTCGATTG 1 80 
CTCTCTCGCC TTCCTATTTC TTTTTTTTTT TTTTAAACAA AAAACAACAC CCCCTCCCCT 240 
CTCCCACCCG GCACCGGGCA CATCCTTGCT CTATTTCCTT TCTCTTTCTC TCTCTCTCTC 300 

70 TCTCTTTTTT AATAAGGGTG GGGGAGGGAA AGGGGGGGGA GGCAGGAAAG ACCTTTTTCT 360 
CTCCCCCCCG CAATAATCCA AGATCAACTC TGCAAACAAC AGAAGACGGT TCATGGCTTT 420 
GGCCGCCGCG CCACCATCTT TCGGGCTGCC GAGGGTGTTC TTGACGATTA ATCAACAGAT 480 
GTACAGATCA GCTCTCAAAA TGTCTTCTGT GTCTTCTGAG CGTCTTCTAA GACAATTGCA 540 
TTAGCCTCCT GCTAGTTGAC TAATAGAATT AATAATTGTA AAAAGCACTC TAAAGCCACA 600 

75 TGCCTTATGA AGTCAATGCT GGGTATGATT TTACAAATAT GGTCCGGAAA AAGAACCCCC 660 
CTCTGAGAAA CGTTGCAAGT GAAGGCGAGG GCCAGATCCT GGAGCCTATA GGTACAGAAA 720 
GCAAGGTATC TGGAAAGAAC AAAGAATTCT CTGCAGATCA GATGTCAGAA AATACGGATC 780 
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AGAGTGATGC TGCAGAACTA AATCATAAGG AGGAACATAG CTTGCATGTT CAAGATCCAT 840 
CTTCTAGCAG TAAGAAGGAC TTGAAAAGCG CAGTTCTGAG TGAGAAGGCT GGCTTCAATT 900 
ATGAAAGCCC CAGTAAGGGA GGAAACTTTC CCTCCTTTCC GCATGATGAG GTGACAGACA 960 
GAAATATGTT GGCTTTCTCA TTTCCAGCTG CTGGGGGAGT CTGTGAGCCC TTGAAGTCTC 1020 
5 CGCAAAGAGC AGAGGCAGAT GACCCTCAAG ATATGGCCTG CACCCCCTCA GGGGACTCAC 1080 
TGGAGACAAA GGAAGATCAG AAGATGTCAC CAAAGGCTAC AGAGGAAACA GGGCAAGCAC 1 140 
AGAGTGGTCA AGCCAATTGT CAAGGTTTGA GCCCAGTTTC AGTGGCCTCA AAAAACCCAC 1200 
AAGTGCCTTC AGATGGGGGT GTAAGACTGA ATAAATCCAA AACTGACTTA CTGGTGAATG 1260 
ACAACCCAGA CCCGGCACCT CTGTCTCCAG AGCTTCAGGA CTTTAAATGC AATATCTGTG 1 320 
1 0 GATATGGTTA CTACGGCAAC GACCCCACAG ATCTGATTAA GCACTTCCGA AAGTATCACT 1380 
TAGGACTGCA TAACCGCACC AGGCAAGATG CTGAGCTGGA CAGCAAAATC TTGGCCCTTC 1440 
ATAACATGGT GCAGTTCAGC CATTCCAAAG ACTTCCAGAA GGTCAACCGT TCTGTGTTTT 1500 
CTGGTGTGCT GCAGGACATC AATTCTTCAA GGCCTGTTTT ACTAAATGGG ACCTATGATG 1560 
TGCAGGTGAC TTCAGGTGGA ACATTCATTG GCATTGGACG GAAAACACCA GATTGCCAAG 1 620 
1 5 GGAACACC AA GTATTTCCGC TGTAAATTCT GCAATTTC AC TTATATGGGC AACTCATCCA 1 680 
CCGAATTAGA ACAACATTTT CTTCAGACTC ACCCAAACAA AATAAAAGCT TCTCTCCCCT 1740 
CCTCTGAGGT TGCAAAACCT TCAGAGAAAA ACTCTAACAA GTCCATCCCT GCACTTCAAT 1800 
N' CCAGTGATTC TGGAGACTTG GGAAAATGGC AGGACAAGAT AACAGTCAAA GCAGGAGATG 1 860 

f = i ACACTCCTGT TGGGTACTCA GTGCCCATAA AGCCCCTCGA TTCCTCTAGA CAAAATGGTA 1920 

=20 CAGAGGCCAC CAGTTACTAC TGGTGTAAAT TTTGTAGTTT CAGCTGTGAG TCATCTAGCT 1980 
Ci CACTTAAACT GCTAGAACAT TATGGCAAGC AGCACGGAGC AGTGCAGTCA GGCGGCCTTA 2040 

s w ATCCAGAGTT AAATGATAAG CTTTCCAGGG GCTCTGTCAT TAATCAGAAT GATCTAGCCA 2100 

M i AAAGTTCAGA AGGAGAGACA ATGACCAAGA CAGACAAGAG CTCGAGTGGG GCTAAAAAGA 2160 

tt i AGGACTTCTC CAGCAAGGGA GCCGAGGATA ATATGGTAAC GAGCTATAAT TGTCAGTTCT 2220 

GTGACTTCCG ATATTCCAAA AGCCATGGCC CTGATGTAAT TGTAGTGGGG CCACTTCTCC 2280 
I & GTCATTATCA ACAGCTCCAT AACATTCACA AGTGTACCAT TAAACACTGT CCATTCTGTC 2340 

K ! CCAG AGGACT TTGCAGCCCA GAAAAGCACC TTGGAGAAAT TACTTATCCG TTTGCTTGTA 2400 

GAAAAAGTAA TTGTTCCCAC TGTGCACTCT TGCTTCTGCA CTTGTCTCCT GGGGCGGCTG 2460 
W GAAGCTCGCG AGTCAAACAT CAGTGCCATC AGTGTTCATT CACCACCCCT GACGTAGATG 2520 

. 3 0 TACTCCTCTT TCACTATGAA AGTGTGCATG AGTCCCAAGC ATCGGATGTC AAACAAGAAG 2580 
.L CAAATCACCT GCAAGGATCG GATGGGCAGC AGTCTGTCAA GGAAAGCAAA GAACACTCAT 2640 

G GTACCAAATG TGATTTTATT ACCCAAGTGG AAGAAGAGAT TTCCCGACAC TACAGGAGAG 2700 

CACACAGCTG CTACAAATGC CGTCAGTGCA GTTTTACAGC TGCCGATACT CAGTCACTAC 2760 
r [ TGGAGCACTT CAACACTGTT CACTGCCAGG AACAGGACAT CACTACAGCC AACGGCGAAG 2820 

fB.5 AGGACGGTCA TGCCATATCC ACCATCAAAG AGGAGCCCAA AATTGACTTC AGGGTCTACA 2880 
V" ATCTGCTAAC TCCAGACTCT AAAATGGGAG AGCCAGTTTC TGAGAGTGTG GTGAAGAGAG 2940 

"W> AGAAGCTGGA AGAGAAGGAC GGGCTCAAAG AGAAAGTTTG GACCGAGAGT TCCAGTGATG 3000 

O ACCTTCGCAA TGTGACTTGG AGAGGGGCAG ACATCCTGCG GGGGAGTCCG TCATACACCC 3060 

~j n AAGCAAGCCT GGGGCTGCTG ACGCCTGTGT CTGGCACCCA AGAGCAGACA AAGACTCTAA 3120 
3 40 GGGATAGTCC CAATGTGGAG GCCGCCCATC TGGCGCGACC TATTTATGGC TTGGCTGTGG 31 80 
AAACCAAGGG ATTCCTGCAG GGGGCGCCAG CTGGCGGAGA GAAGTCTGGG GCCCTCCCCC 3240 
AGCAGTATCC TGCATCGGGA GAAAACAAGT CCAAGGATGA ATCCCAGTCC CTGTTACGGA 3300 
GGCGTAG AG G CTCCGGTGTT TTTTGTGCCA ATTGCCTGAC CACAAAGACC TCTCTCTGGC 3360 
GAAAGAATGC AAATGGCGGA TATGTATGCA ACGCGTGTGG CCTCTACCAG AAGCTTCACT 3420 
45 CGACTCCCAG GCCTTTAAAC ATCATTAAAC AAAACAACGG TGAGCAGATT ATTAGGAGGA 3480 
GAACAAGAAA GCGCCTTAAC CCAGAGGCAC TTCAGGCTGA GCAGCTCAAC AAACAGCAGA 3540 
GGGGCAGCAA TGAGGAGCAA GTCAATGGAA GCCCGTTAGA GAGGAGGTCA GAAGATCATC 3600 
TAACTGAAAG TCACCAGAGA GAAATTCCAC TCCCCAGCCT AAGTAAATAC GAAGCCCAGG 3660 
GTTCATTGAC TAAAAGCCAT TCTGCTCAGC AGCCAGTCCT GGTCAGCCAA ACTCTGGATA 3720 
50 TTCACAAAAG GATGCAACCT TTGCACATTC AGATAAAAAG TCCTCAGGAA AGTACTGGAG 3780 
ATCCAGGAAA TAGTTCATCC GTATCTG AAG GGAAAGGAAG TTCTGAGAGA GGCAGTCCTA 3840 
TAGAAAAGTA CATGAGACCT GCGAAACACC CAAATTATTC ACCACCAGGC AGCCCTATTG 3900 
AAAAGTACCA GTACCCACTT TTTGGACTTC CCTTTGTACA TAATGACTTC CAGAGTGAAG 3960 
CTGATTGGCT GCGGTTCTGG AGTAAATATA AGCTCTCCGT TCCTGGGAAT CCGCACTACT 4020 
5 5 TGAGTCACGT GCCTGGCCTA CCAAATCCTT GCCAAAACTA TGTGCCTTAT CCCACCTTCA 4080 
ATCTGCCTCC TCATTTTTCA GCTGTTGGAT CAGACAATGA CATTCCTCTA GATTTGGCGA 4140 
TCAAGCATTC CAGACCTGGG CCAACTGCAA ACGGTGCCTC CAAGGAGAAA ACGAAGGCAC 4200 
CACCAAATGT AAAAAATGAA GGTCCCTTGA ATGTAGTAAA AACAGAGAAA GTTGATAGAA 4260 
GTACTCAAGA TGAACTTTCA ACAAAATGTG TGCACTGTGG CATTGTCTTT CTGGATGAAG 4320 
60 TGATGTATGC TTTGCATATG AGTTGCCATG GTGACAGTGG ACCTTTCCAG TGCAGCATAT 4380 
GCCAGCATCT TTGCACGGAC AAATATGACT TCACAACACA TATCCAGAGG GGCCTGCATA 4440 
GGAACAATGC ACAAGTGGAA AAAAATGGAA AACCTAAAGA GTAAAACCTT AGCACTTAGC 4500 
ACAATTAAAT AGAAATAGGT TTTCTTGATG GGAATTCAAT AGCTTGTAAT GTCTTATGAA 4560 
GACCTATTAA AAAAATACTT CATAGAGCCT GCCTTATCCA ACATGAAATT CCCTTCTTTT 4620 
65 GTTATTCTTT CTTTTGATGA GTAGGTTACC AAGATTAAAA AGTGAGATAA ATGGTCAATG 4680 
AGAAAGAATG GAAGATGGTA AACAATCACT TTTTAAAACC TGTTAAGTCA AAACCATCTT 4740 
GGCTAATATG TACTGGGGAA ATAATCCATA AGAGATATCA CCAGACTAGA ATTAATATAT 4800 
TTATAAAGAA AGAGACCAAA ACTGTCTAGA ATTTGAAAGG GTTTACATAT TATTATACTA 4860 
AAGCAGTACT GGACTGGCCA TTGGACCATT TGTTCCAAAA CCCATAAATT GTTGCCTAAA 4920 
70 TTTATAATGA TCATGAAACC CTAGGCAGAG GAGGAGAAAT TGAAGGTCCA GGGCAATGAA 4980 
AGAAAAATGG CGCCCTCTCA ATTTAGTCTT CTCTCATTGG CCATGTTTCA GATTTTGACC 5040 
TAGAAATGCG AGCTGTGGTT AGGCTTGGTT AGAGTGCAGC AAGCAACATG ACAGATGGTG 5100 
GCACGCTGTT TTTACCCAGC CCTGCCTGTA CATACACATG CACACCCTCT CTGATATTTT 5160 
TGTCCTTTAG ATGTTCAAAT ACTCAGTAGT CCTTTTGTTT GCGGTTTAGA TTCATTTTGT 5220 
75 CCACACATGT ACCCATTTTA AAAAACAATG TCCTCGATGC TTCTGTAGTG ATTTCATTTT 5280 
AGCCAGGTAT TTCTTTCTTG TGTGTGATGA ACCAGTATGG ATTTGCTTTT CTAAGCCTCC 5340 
TGTTGGTTAC TAATCTCACT TGGCACATTA TAACTAAAGG AATCCCCTCA ATTCAAAAGC 5400 
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ATAGATGGAT ACAAATGTCA GACCGTGGGT TTAATTTGTT TAGAACACAT GGCATTTCTT 5460 
CACAAGGTAA CCTGCTGTAT TTATTTATTT TCTTTTGGTT AAATATAATT TCCAAACTTT 5520 
GTGGTCAGGC AGCGTCTAAG GTTACGTTAC CACAGACTGA CAGTTGGTAT ATGTACCAGC 5580 
CAATCCCTTC ATTAAATGTA TACAGATTTA GTTAAGTAGC ATTAAATAGG ATTCTTAGAA 5640 
5 GTATGTCCTC ATAGAACTTT TAATACTTAA GGCTTTGTAA AAACTATCCA TGAAGGGAAA 5700 
GCTCCTCAGC ATAACTGCTC AGGGAAATAG GGCTAAATAA CTGAACATTA AATAATTGGT 5760 
TAAAGGTGCT GTTAGTCGAG CCTCAATGCT TGCTACAAGG ATGTATGTAC AAGGACTGAC 5820 
TTTAATAATT TGCATTATAT TGTCCCAACC AGTAGTTTAT TTTTTGCCAC GGAGATGTAG 5880 
AAGATATTAC AAGCTACTGG ATGCACTGTC AGATTAACTT ATTTCATTAA AGAAGTTGGG 5940 
1 0 AGAACAAATA GGAAAAAAAA AACTTATTTT TCTAGTAAAT ATTAATGTAT TACATTTCAA 6000 
ATAATGGTGC CTGACATATT GAATAATTAT TTTCTACAGT GTACGTATGC AACAAAGATA 6060 
TTCCATCATG CATTAGAGTC AGTTCTGGCT CTGCCTAGCT GTTTACATTT GCAAATGTAG 6120 
CAAACAAGGT AATGAAGCAA CTATTTCTAT TGCAGTAGAT ATCCTTTTGT GTGTGTGTGT 6180 
GTGCATTAAA GTTGTAAACG GTAACATGAA ACAAATGAAA GTTCTTGCTA TAATGGTATG 6240 
1 5 GAAAACAAGA AGGAAATGAA AATATTTTTA TGCCTACTTA GGAAAAAAAG GGTAGCACTT 6300 
ATTCATTCCA AGTACTmT TTTTTTTAAT TTTTAAGCTC TTAACTCACA TTGTTATGCT 6360 
TAAGATGATA AACATATATC CTCTTTTTAT TGCTTTGTCT ATGTTTCATA TGAAACATTT 6420 
r- : CAGAAATTAT TTTGATAAGT GTTGCTGGAA TCTGCAACGC TGATTTTTTT TTGCATTCTG 6480 

f=- TAGTCGCATT TGCACTCCAT TTTTACATTA ATTCGCAGTT GCTTTGTATC ATTGTTTTGT 6540 

~'%0 TTGGGTTTTG TTTCTTTTTC ACAGTGCCGG GTCTTCGTTT CTTAAAGTTG GATGGCAGGT 6600 
C, \ AGAGTTCAAC CAGTTCGTGA CTGTTGTAGC GAATGAAGTT AAAAAAATGT CTTTCTGATG 6660 

I ~ i TTGTGTTGTC ATTTTCATTT TTGCATTTTT TTGTTTGCAT ATTAAAAAAA GAGAAAAGAG 6720 

y ' 1 AAAGCAAGAG ACAGAAATCA GGACTAAGTC CTCTGCTTCA GTTTCATTGT TAACGGGCCT 6780 

tB TATTCTGATC TCACCTGTCG CGTAGCTCTA ATATTCACAT AAACTGAAAT AAAGAAGTGG 6840 

J2i5 AATGAGGAGC TTTGACATTC AAATTATGTG ATGTAATTTA TCTTCCTTAG GAATTTTGAT 6900 
! -= [ GGATGCATCT CAAAATGTAT AGCCAGACTT GAGAGGTGAC AATTAAAGAT CTAAAAAAGA 6960 

%. j GAGGAGATTC CCCCAAACAA CAATATTTAA TTTTCTTAGT AAAAAGAATA ACAGAATGCA 7020 

TCGTGGCAAT CCTTAAGCAA CATTATCTAT GTGGACTGCT TAAATCAGCA AAACACCAGA 7080 
L= ! AGTTTGGTTA ACTTGGGCAA TATGACAAGT ATTACTTTTT GGGCAAAACT ACTCATTAAG 7140 

g 3 0 CAATTTCTCT AGTGTGTCGG ACACAAATAG GTTCTTTATT TTTGGCATGT ATGCCTTTTT 7200 
" r . ATTTTCATTC AATTTTTTTT TTTTCTCAGA CAGACATAGT AGTATCAACT AGCATTGGAA 7260 

AATACATATC ACTATTCTTG GAATATTTAT GGTCAGTCTA CTTTTTAGTA AAATATTTTT 7320 
3 < GGATAGCGTT GACACGATAG ATCTTATTCC ATACTTCTTT ATTATTGATA ATTTTATTTT 7380 

5 CATTTTTTGC TTTCATTATT ATACATATTT TGGTGGAGAA GAGGTTGGGC TTTTTTGAAA 7440 
rB5 GAGACAAAAA TTTATTATAA CACTAAACAC TCCTTTTTTG ACATATTAAA GCCTTTATTC 7500 
' s: CATCTCTCAA GATATATTAT AAAATTTATT TTTTTAATTT AAGATTTCTG AATTATTTTA 7560 

"f" TCTTAAATTG TGATTTTAAA CGAGCTATTA TGGTACGGAA CTTTTTTTAA TGAGGAATTT 7620 

Q CATGATGATT TAGGAATTTT CTCTCTTGGA AAAGGCTTCC CCTGTGATGA AAATGATGTG 7680 

~ ■. CCAGCTAAAA TTGTGTGCCA TTTAAAAACT GAAAATATTT TAAAATTATT TGTCTATATT 7740 

i #0 CTAAATTGAG CTTTGGATCA AACTTTAGGC CAGGACCAGC TCATGCGTTC TCATTCTTCC 7800 
TTTTCTCACT CTTTCTCTCA TCACTCACCT CTGTATTCAT TCTGTTGTTT GGGATAGAAA 7860 
AATCATAAAG AGCCAACCCA TCTCAGAACG TTGTGGATTG AGAGAGACAC TACATGACTC 7920 
CAAGTATATG AGAAAAGGAC AGAGCTCTAA TTGATAACTC TGTAGTTCAA AAGGAAAAGA 7980 
GTATGCCCAA TTCTCTCTAC ATGACATATT GAGATTTTTT TTAATCAACT TTTAAGATAG 8040 
45 TGATGTTCTG TTCTAAACTG TTCTGTTTTA GTGAAGGTAG ATTTTTATAA AACAAGCATG 8100 
GGGATTCTTT TCTAAGGTAA TATTAATGAG AAGGGAAAAA AGTATCTTTA ACAGCTCTTT 8160 
GTTGAAGCCT GTGGTAGCAC ATTATGTTTA TAATTGCACA TGTGCACATA ATCTATTATG 8220 
ATCCAATGCA AATACAGCTC CAAAAATATT AAATGTATAT ATATTTTAAA ATGCCTGAGG 8280 
AAATAC ATTT TTCTTAATAA ACTGAAGAGT CTCAGTATGG CTATTAAAAT AATTATTAGC 8340 
5 0 CTCCTGTTGT GTGGCTGCAA AACATCACAA AGTGACCGGT CTTGAG ACCT GTGAACTGCT 8400 
GCCCTGTTTA GTAAATAAAA TTAATGCATT TCTAGAGGGG GAATATCTGC CATCCAGTGG 8460 
TGGAAATGTG GAGTAAAGAA GCTGGTGGTC TGCTTCTGTG CTGTATGCCA GCCTTTTGCC 8520 
TTAAGTTGAG AGGA6GTCAA CTTTAGCTAC TGTCTTTGGT TTGAGAGCCA TGGCAAAAAA 8580 
AAAAAAAGAA AAAAAGATCA AGTCGTCTTT GGTGAGCCAG TAAGGTGAAA GCTTGCTGAC 8640 
5 5 TGTCCAAGGC ACAAG AGAAA ATTGAGGAAT TGAAATGCAA CCTGAGTATC AAACTAAATA 8700 
TTCTAATCAA AGGTAGGTAC TGTTAGGTGG AATTCTATCA GCAGGCAACT GCAAATGAGA 8760 
AGAAGATAGA AGGACGCCCG TCGGGACTTT GGAGGGCATT GTTATTTTCC CAAAGAAAGA 8820 
CGGCCAAGGG CAGAGGCATG GATTCTTTGC AGAGCACTTC CTTTTGGTTT TTCAGTACTG 8880 
TTTCATAGAC AGTGGGCTCA CATGTTCCTG ATAGTGCTGC AGTTGCTTAG AAAGCATCCC 8940 
60 AGTTAATTGC AGTAATTAGA ACTTCTGGAA TATGCTAGGG CAGAAGTATG TCAAGTATGT 9000 
CACATGAAGA AAATGTGAAA TTCAAGAGTA ATCCACACGT GAGAAACTAG ACAATGTACA 9060 
TTCATGTGTT CTCTTGAAAG GAAAGGGAGA GCTGTAAGCT TCACTCTGTC CTACACCGGA 9120 
GAAAAGCAGG AATAACTTTA CCGTGGAAAT AATGTTTAGC TTTTATCAGA GAAAATTGTC 9180 
CTTCTAGAGC ATAGAGTCCC AAAACTCAAT TCTGGTTTTC CCCTGTTTTT jjxxxxxxrx 92 4o 
65 TTTTTCCCAA CATATGAACT GCAGCATATC ACTTTTTCTT TTTGTGCCTC AGGTTCCTCA 9300 

CCTGTAAAAT TGAAAAATAT ATGTATTAAT AATATTATTA ATAATAATAA TGGTAATGTA 9360 
GTACTTGTTT GTAAAGCACT TTGAGATCCT TGGTTGAAAG GCACCATAGG AGTGCCAAGT 9420 
ATTATTATGT GGCCAAGGGG GTTATTTAAA CTGTCAGTTC CCAAAGGCCA GGAAAGGTTG 9480 
GGGTCATTTT TCTTAAAGAC GAGCTGTAAA TATCAACTAG GCAGCCAATA GTGTTGACTA 9540 
70 TGAAGATGCA AAACTATTAC TAGGCTGATA AAATCATAGT TTCTTAATGG CTACCAATAA 9600 
GGCAAATATC ACAATAATAA ACGCCAAATT CCTTAGGGCG GACTATTTGA CAACCACATG 9660 
GAAAACTTTG GGGGAGGCAT GAGGGGGGAA CATCTCAAAA TGCCAATGTA AAATTTAACT 9720 
TACAGCAATA TTCACCAGCA GAAAATGTCT TTCATATGGA ATGATTTCAT GTTGCTAAGA 9780 
AAAAGAATTC AATTTGTAGT CCTGATTTGA ATACTAGAAT GTTGGCTATA ATAGTTCTGT 9840 
75 TCTTACAACA CATGAAATTT TTTCGTTTTA TTTTATTTTG TTTTCATAGT GCATGTTCAT 9900 

TTCTACTCAC AAACATGTTC TTGGTGTATT TCTTATGCAA ACAATCTTCA GGCAGCAAAG 9960 
ATGTCTGTTA CATCTAAACT TGAATAATAA AGTTTTACCA CCAGTTACAC A 
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Seq ID NO: 68 Protein sequence: 
Protein Accession #: NP_05483 1 

5 1 11 21 31 41 51 
I I I I I I 

MPYEVNAGYD FTNMVRKKNP PLRNVASEGE GQILEPIGTE SKVSGKNKEF SADQMSENTD 60 
QSDAAELNHK EEHSLHVQDP SSSSKKDLKS AVLSEKAGFN YESPSKGGNF PSFPHDEVTD 120 
RNMLAFSFPA AGG VCEPLKS PQRAEADDPQ DMACTPSGDS LETKEDQKMS PKATEETGQA 1 80 
10 QSGQANCQGL SPVSVASKNP QVPSDGGVRL NKSKTDLLVN DNPDPAPLSP ELQDFKCNIC 240 
GYGYYGNDPT DLIKHFRKYH LGLHNRTRQD AELDSKILAL HNMVQFSHSK DFQKVNRSVF 300 
SGVLQDINSS RPVLLNGTYD VQVTSGGTFI GIGRKTPDCQ GNTKYFRCKF CNFTYMGNSS 360 
TELEQHFLQT HPNKIKASLP SSEVAKPSEK NSNKSIPALQ SSDSGDLGKW QDK1TVKAGD 420 
DTPVGYSVPI KPLDSSRQNG TEATSYYWCK FCSFSCESSS SLKLLEHYGK QHGAVQSGGL 480 
1 5 NPELNDKLSR GSVINQNDLA KSSEGETMTK TDKSSSGAKK KDFSSKGAED NMVTSYNCQF 540 
CDFRYSKSHG PDVIVVGPLL RHYQQLHNIH KCTKHCPFC PRGLCSPEKH LGEITYPFAC 600 
RKSNCSHCAL LLLHLSPGAA GSSRVKHQCH QCSFTTPDVD VLLFHYESVH ESQASDVKQE 660 
!,-=: ANHLQGSDGQ QSVKESKEHS CTKCDFITQV EEEISRHYRR AHSCYKCRQC SFTAADTQSL 720 

i=, LEHFNTVHCQ EQDITTANGE EDGHAISTDC EEPKIDFRVY NLLTPDSKMG EPVSESVVKR 780 

3=30 EKLEEKDGLK EKVWTESSSD DLRNVTWRGA DILRGSPSYT QASLGLLTPV SGTQEQTKTL 840 
Pi RDSPNVEAAH LARPIYGLAV ETKGFLQGAP AGGEKSGALP QQYPASGENK SKDESQSLLR 900 

fS: RRRGSGVFCA NCLTTKTSLW RKNANGGYVC NACGLYQKLH STPRPLNIIK QNNGEQIIRR 960 

U \ RTRKRLNPEA LQAEQLNKQQ RGSNEEQVNG SPLERRSEDH LTESHQREIP LPSLSKYEAQ 1020 

ff I GSLTKSHSAQ QPVLVSQTLD IHKRMQPLHI QIKSPQESTG DPGNSSSVSE GKGSSERGSP 1080 

225 IEKYMRPAKH PNYSPPGSPI EKYQYPLFGL PFVHNDFQSE ADWLRFWSKY KLSVPGNPHY 1140 
l lJ LSHVPGLPNP CQNYVPYPTF NLPPHFSAVG SDNDIPLDLA KHSRPGPTA NGASKEKTKA 1200 

%.. ! PPNVKNEGPL NVVKTEKVDR STQDELSTKC VHCGIVFLDE VMYALHMSCH GDSGPFQCSI 1260 

^ CQHLCTDKYD FTTHIQRGLH RNNAQVEKNG KPKE 

. 30 Seq ID NO: 69 DNA sequence 

t._ Nucleic Acid Accession*: XM_073879 

CP Coding sequence: 1 -387(underlined sequences correspond to start and stop codons) 

f^T 1 11 21 31 41 51 

f§5 i i i i i i 

ATG GGGTTTG GAGACCAGGG AACGGTGGAA GGGAGCCTAG GAACGTCGAA AAAACCACCT 60 
4 s ' GAAGTGAAAA TGTTTGGAGC CAGTCAAGGT TTGCTGACAA TGGAAACAAA CCAGTCCCTG 120 

pi GCACAAGGCA CAGGCTGCTC AGTAGTAAAG GTAGACACTG TTCTCTTCG A GAGTTTATAC 1 80 

5r\ CACTGCGGCT TTGAACACGG GAGCGTGATG CACTGCCTTG GGGATGATCA CCCCCAGGAA 240 

I "40 GACAGGAAGG CTCACTTCTC TGCCCCAGTT GCAGCCATCG CCTCTCCAGC ACCGACTCCT 300 

GTCTGTCCTG CACACCACTC AACACAGAGC ATCTGCCAGT TTCTACAGCA CTGCAGGCAG 360 

AACACTCACT TGCAGGCTGC TAAC TAA 

Seq ID NO: 70 Protein sequence: 
45 Protein Accession #: XP_073879 

1 11 21 31 41 51 

I I I I I I 

MGFGDQGTVE GSLGTSKKPP EVKMFGASQG LLTMETNQSL AQGTGCSVVK VDTVLFESLY 60 
50 HCGFEHGSVM HCLGDDHPQE DRKAHFSAPV AAIASPAPTP VCPAHHSTQS ICQFLQHCRQ 120 
NTHLQAAN 

Seq ID NO: 71 DNA sequence 
Nucleic Acid Accession #: AB033064 
55 Coding sequence: 826-1986 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

GGGGAACAGC AAATTC AGTC ACAGACAATC CTCCACTCGG TCAAGAGCCA CTTTTCTCTT 60 

60 CCTGCCTTGC CCCCCCGCAG GGGGTAAGGA ACTGAGCGTT TAATCTTTAG CCGGTTGGCT 120 
ACCAGCTAAA ATTCTACTTA TCTTAGTTTC TAGTGGATAG CTTTCTTATT TTGCCCATGT 1 80 
TTTCTTAGAA TCCCTGTTTA ATATACTTTT GTCAGTAGTA GTATCTAGGA GTAGCAGGGA 240 
GAGTGACAAT AAATTAGCCC CTTCTTTTTT CCCTTGTCAT TCAGGCCCCT TTTCCTCTCC 300 
AGAGGGAAAT TACCAGTAAA CTCTTCTAAA TCTTCCACCC CTTCTCAGTC ATACTGTGAA 360 

65 GAAACACACT AAAGTGGACA TTATTTGACC AGTGAACACG AACCCAGCTT CAGGCATTGG 420 
TTTGTTGTGG CACATGGAGA AACATCTCTT TTAAAATATC TCCCAATTAC CCTTTTCACA 480 
ATTTGTATCC ACCTAGGATT TGCTGCTGGG GTAAGTCACT AGATTTATTT CTCAAAGCTC 540 
CCCTCTCTAT GAGCTGAAAG ACTGACCAAC CATGAACACT AGTAGGGGAT GGGGAAAGGG 600 
GACAGAGCAG AGCCAGTTGT TCCACACTTT GGGAAGCAGG AGTAGCTTTT ATCATCTTCC 660 

70 TCTGGGGAGC AGGCATAGAG ACATAAACTG AGTGAAAATG GGTGGAGGAA GAACTTCTAT 720 
ACCCACGAAC AACATGTGAA GAGAGAGAAC CAAACATAAA GTAAGGAGGG TAGACGTTAC 780 
ATCCAAGAGG AAATAATCCA GGCAAGGAAG CACAAGCTGA TCAAG ATGT G TAGTTCTGTG 840 
GCTGCCAAGT TGTGGTTTTT GACAGATCGT CGCATCAGGG AAGACTATCC TCAAAAAGAG 900 
ATTTTACGAG CATTGAAGGC CAAATGTTGT GAGGAGGAAC TGGACTTTAG GGCTGTGGTG 960 

7 5 ATGGATGAGG TGGTGCTGAC AATCGAGCAA GGAAACCTGG GTCTGCGGAT CA ATGGAGAG 1 020 
CTAATCACTG CCTACCCACA AGTGGTGGTA GTCAGAGTAC CAACCCCTTG GGTGCAAAGT 1080 
GATAGTGACA TCACTGTTTT GCGCCATCTA GAGAAGATGG GATGTCGGTT AATGAACCGA 1 140 
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CCTCAAGCCA TCCTGAACTG CGTTAATAAG TTCTGGACAT TTCAAGAGTT GGCTGGCCAT 1200 
GGTGTTCCTC TGCCGGATAC TTTCTCTTAT GGTGGCCACG AAAATTTTGC TAAAATGATT 1260 
GATGAGGCTG AAGTTCTGGA GTTCCCAATG GTAGTAAAGA ATACGCGGGG TCACAGAGGT 1320 
AAAGCTGTTT TCTTGGCTCG AGATAAGCAC CATTTGGCTG ATCTAAGCCA TCTTATTCGC 1380 
5 CATGAAGCGC CATACCTGTT CCAGAAGTAT GTTAAAGAGT CTCATGGACG GGATGTACGT 1440 
GTCATTGTCG TGGGAGGCCG TGTGGTTGGC ACCATGTTAC GTTGTTCAAC AGATGGGAGA 1500 
ATGCAAAGCA ACTGCTCATT AGGTGGTGTG GGGATGATGT GCTCATTGAG TGAACAAGGG 1560 
AAGCAGCTAG CTATCCAGGT GTCTAATATC CTGGGGATGG ATGTGTGTGG CATTGACCTG 1620 
CTGATGAAAG ATGACGGCTC CTTCTGCGTC TGTGAGGCCA ATGCAAATGT AGGTTTCATC 1680 
1 0 GCCTTTGATA AGGCTTGTAA TCTAGATGTA GCTGGTATCA TAGCAGACTA TGCCGCCTCC 1 740 
CTTCTACCCT CTGGCCGGCT CACCCGGCGT ATGTCCCTGC TCTCCGTGGT GTCCACTGCC 1 800 
AGTGAGACTA GTGAGCCGGA GCTGGGTCCC CCAGCCAGCA CTGCTGTTGA CAACATGAGT 1860 
GCAAGTTCCA GCTCTGTTGA CAGCGACCCT GAAAGCACGG AGCGAGAGCT GCTCACCAAG 1920 
CTCCCAGGGG GCCTGTTCAA CATGAACCAG CTGCTAGCCA ATGAAATCAA ACTACTGGTG 1980 
1 5 GAC TGAC TCC ACTGGTAATT AACCAACAAA ACCCTTGTAA AACTTTCTTT CTTCTTTTCT 2040 

ATTTTTAAAA CCAACTTGCA ATGCTGTTCA TGGAGGATGC TCAGGAAGAT GAGAGAAAAT 2100 
TAGTAGGATT AGTTGGAGAG AGTGGGAGAT AGATGAGACC TCTGCTAGTA AGATGTTACT 2160 
i -.. TTCATTTACA AATCCTACAA ATAGAGAGGC AGAATAGGTG GGGTATAGAA AAATGTCAGG 2220 

\ " CTCTCATAGT TACCCTTTTA AATTGCTAAA AAATGTGTAT GCTCATAGGC CATGAGGAAC 2280 

E2i0 AAATACTTTT TTTTTTTCAT GGTCCCTTGC TTTTGTTTTT GTACAAAAAA AAATGGTTTT 2340 
fm< GCTACAAATA TCCAAGTAGC ATAACTTCAC ATTGTGTTGG AAGATTTGTC ATCAGTGAGG 2400 

AAAACATCTG CTTAAATTAC AGGAATTTTT GTATTATACA GCTCTGAAAA TTCTGCCATT 2460 
[F! TCCTTATTAA CTAGCAGCTT TAGTTTGTAG TTTATGAAAT CTTGAGGGGC TCTTTTACTG 2520 

ij GGATTTCTTA TTTTTTTGTT TTTTCCCGCT TAATTTGGTG GGAGGTCAAA TTGAATATAA 2580 

IH5 CCCAATAAAG GCTTCTTAAT GACAAAATTG GCATGTTTGC ATGATGAAAT GGAAATGAAC 2640 
H I AGTATTGCAA TGTCCGGTAT ACAAAATAAC ATTAATTCAA TGTAGATAAA ATTACACTAG 2700 

I "~. TTTAAAATAT GTGCATTCAC TTGTATTTGT TAGTGTTTTA GTCTTTTTTG AAAGATGTGC 2760 

Si TCTGTTAATG TTGCTTTTTT TTTTTTTTTT TAATACATGC TAGTCTAACA TTTCCTGCTC 2820 

?=; TATGCCTGCA TCTTTAACAA TGGCCAAAGT GAAGAAAATG CTACCTTTTT TGTTAACAAG 2880 

% 3'0 ACACTGACTT GAAACATGTA CATTTAAAGC CTTTTATTTT TTCCCTTTTT GTTTTGGTAG 2940 
S TTGGGCATTT AAATAAGGAC AAGGAAAAAT ATTTTTGGGG GCAAATCAAG AGCCTATGAG 3000 

§t*, TTCTAAGTAT AAAGCTGAAG TGATTTCGAA TGCCAGCGTT ATATATTTGC ATTTTTCACA 3060 

TTTTACGAGG GAGTATATGT GTATGTGTGT GCACGCATGC ATGTGTATGT GTTTTGCTTT 3 120 
M= TTGTTTCCAT CAACTAATCA AAAAGGATAA TTTAGAAAAT GGAGCATGAT GGGAAACAGA 3180 

;=3j5 GTTTTTGACT TTAAAAAACA GATGAGTTGT TTTCATAAGT AGACTCCACT GGGGTAGAGG 3240 
I. V TATTCACCTT AAAACATAGG GTGAGTAGAT GCTTTTTTAG GCCTTTTTGT GTATATGTAC 3300 

_g! GTTGTTTGTT TTTTTCCTTT TGTTTCTAGC CTGTTCAGTG TACAGTTTAT TCAAGGCTAC 3360 

l~. ATGCTTTTCT TTAATGCTTC TGGCTATGCA TTTTCTCTTT TTACATATAG GATTTGGGAT 3420 

S=^ TGGGGGTGGG TTGGATGTTT TTGTTTGGGG ACTTATTTAG TAGTATTGAG TCTCTTATAG 3480 

tm) CCCTACTCTT AAGCCTTCAA TACTGTCCAC TCTTTATATT CCTTTACTTG CAGAATTTAT 3540 
; AAAAGCCCCC AAACTGCATA TAATATGAGC CTTTAAAACA TGGGTAAAAC TAATCCCATT 3600 

GATGGGTTTG GATGGTATGT TAAGAAATGG AGATGCTGCA GAGCCCAACG TAATTHTTA 3660 
AACAGCAAGT TTTCCATCTC CCTACGAATC CTCTGAAGCT TTTACCCAAG CCCTTTCTTG 3720 
CCTCTCCAGT GCTATTTTCC TTCAGATGGA CCTTAAACAT AATTTCTTGG ACACTACTAG 3780 
45 AGAGACTTCG AGGCAATAAT AAAAGATCAG TATTAACCAG CTATAACAGA GGTTTGATCA 3840 
TGCTTACTTG TACAGTTTTT CCCCCGTTTT AAAAAGGAAT GTAATAAAAT TTGTTTTTTC 3900 
CATAGAATTA AATAATATTA AAATTGAGTG AAAGGTTGAT TGTTGATGAA TAGAATAGTA 3960 
CCTCTCATCT GTGCAGTGTC TCATTTCACC TCAGAGAAAA GGATACATAA GAGGAGTTTG 4020 
TAATTTATCT TAGGATATTC TAATTGCATT TAAAAGAACT TATCTTGCGC AGGGTAAATG 4080 
50 GGGGACTCAC ATACATATAT TAATACCTCT GACTCATTAA CAGAAAGAAA TACTTGGTAC 4140 
TTCTTTCGCT GAATGACCAT ACTGTGGAGG ATGCATACTA TTTGGTATAG AGAAATAAAT 4200 
GAGGAAGAAA GAACTGCTTA ATTAAATTAT CATTCATATG TTCATATAGA GACCATCTGG 4260 
TTGCCATGTG TATTATGACA CATACACTTT GAATAGTTAC ATATCACAAG TATGTAGTTC 4320 
ATGTTTGTGT TGGTGGGGTA AGGCATCAGG AAAAATGTAG TTAGTCTTTT CTTAACTTAT 4380 
5 5 ACCAAATTAA CCAACTATAT TATAGGAAAT ATGTG AAATT AGTTCATTAG CTTTATTCAC 4440 
TATTATGCAT TCACATGATA TTAAAACGTA CACTCACATG TTAGAATGAA AAGAGCAGTA 4500 
GTTATCTTAG ATTTTAAAAA CATGGATATC TTCTTGAATT CCTTCAAGAT TGAGGTAGAG 4560 
AATAAGAGCA AATCATTCTG GAAGTACCTT AAGGAAACAA ACAGCAGCAG ATATTTAGGT 4620 
TAAACTTATT TTCATAATTG TTTAATAACT TTTGTATAAT CTTCATTGCT ATTATGAGAG 4680 
60 AGAATGTATA TATCAAATAT GTGTAATGAT AAAATCTGAA TTGTAAAATT TTTGTATATT 4740 
GTTAAAATTG TAATTCTAAA TTGTATTTCA AAAATGATTA TTTCTGATAT TGTTTTTATG 4800 
TCACCCATGA TGAAAACTGG ACTTTATATA TCTAAACATA CAAGTATGAA CTATTCTATT 4860 
TAAAATTTTT AATAGTTTTT TTCTTTTTTG GTGCCTATAA TTGATTGGTC ATTTCTGCTG 4920 
GCTTTTCTCC AATGAACATT GAAATCTTCC TGTATATGTT ACCAATAAGA AAACTACCCT 4980 
65 GGAACAGTAG AAAAACCCAA CAAGAGACTT GGCATTCATC AAGCACATTA TCAGACTTTG 5040 
AGAACATATT GAAGGCATTG ACTTTGAAAA TCATCTCTTT TTCTCAAGAA GAAAGCAATG 5100 
GAGAAGCAAA TTTGTTTCAT TCAGTGAATC CCCAGTTTGG GGCTTGTGGG GCTTAGAGAC 5160 
ATTGTGAAAT CAAATCTTGT GTTATACTTT TCTCCTGGCT CACTTTTTTT GAGAAGGTTT 5220 
ATGGGCTATT TGGCTGGTGA GACACGATCC CCTCCTAAGA AAATGTAGGT GCTCAGACAG 5280 
70 GTAACCACTG CTGCTACTGT TTTTATTTGT TTGTTTGTTC AATTTTATTT AAGATTTGTT 5340 
TTTGTTGTAC TAGGATTTTA AAAAATGTAA TATATTGCAG GATTTATAAC CAG 



75 



Seq ID NO: 72 Protein sequence: 
Protein Accession #: BAA86552 

1 11 21 31 41 51 

I I I I I I 
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MCSSVAAKLW FLTDRRIRED YPQKEILRAL KAKCCEEELD FRAVVMDEVV LTIEQGNLGL 60 
RTNGELITAY PQVVVVRVPT PWVQSDSDIT VLRHLEKMGC RLMNRPQAIL NCVNKFWTFQ 120 
ELAGHGVPLP DTFSYGGHEN FAKMIDEAEV LEFPMVVKNT RGHRGKAVFL ARDKHHLADL 1 80 
SHLIRHEAPY LFQKYVKESH GRDVRVIVVG GRVVGTMLRC STDGRMQSNC SLGGVGMMCS 240 
5 LSEQGKQLAI QVSNILGMDV CGIDLLMKDD GSFCVCEANA NVGFIAFDKA CNLDVAGIIA 300 
DYAASLLPSG RLTRRMSLLS VVSTASETSE PELGPPASTA VDNMSASSSS VDSDPESTER 360 
ELLTKLPGGL FNMNQLLANE IKLLVD 

Seq ID NO: 73 DNA sequence 
1 0 Nucleic Acid Accession #: XM_040080.2 

Coding sequence: 159-1 104 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
,,111111 

1 5 CTGAGTGGGG GCGGGGACTG CTGGAGTTGC GGGGCCTGCC TGGGGTAGGG CGGGGCAGGA 60 
CAGCTTGGAG ATAGGGCCCG GAATTGCGGG CGTCACTCTG CTCCTGCGAC CTAGCCAGGC 120 
GTGAGGG AGT GACAGCAGCG CATTCGCGGG ACGAGAGCG A TG AGTGAGAA CGCCGCACCA 1 80 
5 : == . GGTCTGATCT CAGAGCTGAA GCTGGCTGTG CCCTGGGGCC ACATCGCAGC CAAAGCCTGG 240 

Lg, GGCTCCCTGC AGGGCCCTCC AGTTCTCTGC CTGCACGGCT GGCTGGACAA TGCCAGCTCC 300 

W) TTCGACAGAC TCATCCCTCT TCTCCCGCAA GACTTTTATT ACGTTGCCAT GGATTTCGGA 360 
f =| GGTCATGGGC TCTCGTCCCA TTACAGCCCA GQTGTCCCAT ATTACCTCCA GACTTTTGTG 420 

fEI AGTGAGATCC GAAGAGTTGT GGCAGCCTTG AAATGGAATC GATTCTCCAT TCTGGGCCAC 480 

II ! AGCTTCGGTG GCGTCGTGGG CGGAATGTTT TTCTGTACCT TCCCCGAGAT GGTGGATAAA 540 

fr I CTTATCTTGC TGGACACGCC GCTCTTTCTC CTGGAATCAG ATGAAATGGA GAACTTGCTG 600 

:25 ACCTACAAGC GGAGAGCCAT AGAGCACGTG CTGCAGGTAG AGGCCTCCCA GGAGCCCTCG 660 
fy CACGTGTTCA GCCTGAAGCA GCTGCTG CAG AGGTTACTGA AGAGCAATAG CCACTTGAGT 720 

i GAGGAGTGCG GGGAGCTTCT CCTGCAAAGA GGAACCACGA AGGTGGCCAC AGGTCTGGTT 780 

'■' CTGAACAGAG ACCAGAGGCT CGCCTGGGCA GAGAACAGCA TTGACTTCAT CAGCAGGGAG 840 

O CTGTGTGCGC ATTCCATCAG GAAGCTGCAG GCCCATGTCC TGTTGATCAA AGCAGTCCAC 900 

"3 0 GGATATTTTG ATTCAAG AC A G AATTACTCT GAGAAGGAGT CCCTGTCGTT CATGATAGAC 960 

ACGATGAAAT CCACCCTCAA AGAGCAGTTC CAGTTTGTGG AAGTCCCAGG CAATCACTGT 1020 
Q GTCCACATGA GCGAACCCCA GCACGTGGCC AGTATCATCA GCTCCTTCTT ACAGTGCACA 1080 

• ' r CACATGCTCC CAGCCCAGCT G TAG CTCTGG GCCTGGAACT ATGAAG ACCT AGTGCTCCCA 1 140 

GACTCAACAC TGGGACTCTG AGTTCCTGAG CCCCACAACA AGGCCAGGGA TGGTGGGGAC 1200 
f35 AGGCCTCACT AGTCTTGAGG CCCAGCCTAG GATGGTAGTC AGGGGAAGGA GCGAGATTCC 1260 

AACTTCAACA TCTGTGACCT CAAGGGGGAG ACAGAGTCTG GGTTCCAGGG CTGCTTTCTC 1320 
#■• CTGGCTA ATA ATAAATATCC AGCCAGCTGG AGGAAGGAAG GGCAGGCTGG GCCCACCTAG 1380 

p. CCTTTCCCTG CTGCCCAACT GGATGGAAAA TAAAAGGTTC TTGTATTCTC A 

mo Seq ID NO: 74 Protein sequence: 

Protein Accession #: XP_O40080.1 

1. 11 21 31 41 51 

I I I I I I 

45 MSENAAPGLI SELKLAVPWG HIAAKAWGSL QGPPVLCLHG WLDNASSFDR LIPLLPQDFY 60 
YVAMDFGGHG LSSHYSPGVP YYLQTFVSEI RRVVAALKWN RFSILGHSFG GWGGMFFCT 120 
FPEMVDKLIL LDTPLFLLES DEMENLLTYK RRAIEHVLQV EASQEPSHVF SLKQLLQRLL 1 80 
KSNSHLSEEC GELLLQRGTT KVATGLVLNR DQRLAWAENS IDFISRELCA HSIRKLQAHV 240 
LLIKAVHGYF DSRQNYSEKE SLSFMIDTMK STLKEQFQFV EVPGNHCVHM SEPQHVASH 300 

5 0 SSFLQCTHML PAQL 

Seq ID NO: 75 DNA sequence 

Nucleic Acid Accession #: NM_005794 

Coding sequence: 434-1276 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

GGTTCCCTTC CACGCTGTGA AGCTTTGTTC TTTTGGTCTT CATGATAAAT CTTGCTGCTG 60 
CTCACTCGTT GGGTCCGTGC CACCTTTAAG AGCTGTAACA CTCACCGCGA AGGTCTGCAA 120 

60 CTTCACTCCT GGGGCCAGCA AGACCACGAA TGCACCGAGA GGAATGAACA ACTCTGGACA 180 
CACCATCTTT AAGAACCGTA ATACTCACCG CAAGGGTCTG CAACTTCATT CTTGAAGTCA 240 
GTGAGGCCAA GAACCCATCA ATTCCGTACA CATTTTGGTG ACTTTGAAGA GACTGTCACC 300 
TATCACCAAG TGGTG AG ACT ATTGCCAAGC AGTGAGACTA TTGCCAAGTG GTGAGACCAT 360 
CACCAAGCGG TGAGACTATC ACCTATCGCC AAGTGGCCTG ATTCAGCAGG AAGCATCTCA 420 

65 GACACCAACC ACT ATGC TGT CAGCAGTTGC CCGGGGCTAC CAGGGCTGGT TTCATCCCTG 480 
TGCTAGGCTT TCTGTGAGGA TGAGCAGCAC CGGGATAGAC AGGAAGGGCG TCCTGGCTAA 540 
CCGGGTAGCC GTGGTCACGG GGTCCACCAG TGGGATCGGC TTTGCCATCG CCCGACGTCT 600 
GGCCCGGGAC GGGGCCCACG TGGTCATCAG CAGCCGGAAG CAGCAGAACG TGGACCGGGC 660 
CATGGCCAAG CTGCAGGGGG AGGGGCTGAG TGTGGCGGGC ATTGTGTGCC ACGTGGGGAA 720 

70 GGCTGAGGAC CGGGAGCAGC TGGTGGCCAA GGCCCTGGAG CACTGTGGGG GCGTCGACTT 780 
CCTGGTGTGC AGCG CAGGGG TCAACCCTCT GGTAGGGAGC ACTCTGGGGA CCAGTGAGCA 840 
GATCTGGGAC AAGATCCTAA GTGTGAACGT GAAGTCCCCA GCCCTGCTGC TGAGCCAGTT 900 
GCTGCCCTAC ATGGAGAACA GGAGGGGTGC TGTCATCCTG GTCTCTTCCA TTGCAGCTTA 960 
TAATCCAGTA GTGGCGCTGG GTGTCTACAA TGTCAGCAAG ACAGCGCTGC TGGGTCTCAC 1 020 

75 TAGAACACTG GCATTGGAGC TGGCCCCCAA GGACATCCGG GTAAACTGCG TGGTTCCAGG 1080 
AATTATAAAA ACTGACTTCA GCAAAGTGTT TCATGGGAAT GAGTCTCTCT GGAAGAACTT 1 140 
CAAGGAACAT CATCAGCTGC AGAGGATTGG GGAGTCAGAG GACTGTGCAG GAATCGTGTC 1200 
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CTTCCTGTGC TCTCCAGATG CCAGCTACGT CAACGGGGAG AACATTGCGG TGGCAGGCTA 1260 
CTCCACTCGG CTC TGA GAGG AGTGGGGGCG GCTGCGTAGC TGTGGTCCCA GCCCAGGAGC 1320 
CTGAGGGGGT GTCTAGGTGA TCATTTGGAT CTGGAGCAGA GTCTGCCATT CTGCCAGACT 1380 
AGCAATTTGG GGGCTTACTC ATGCTAGGCT TGAGGAAGAA GAAAAACGCT TCGGCATTCT 1440 
5 CC 

Seq ID NO: 76 Protein sequence: 
Protein Accession #: NP_005785 

10 1 11 21 31 41 51 

I I I I I I 

MLSAVARGYQ GWFHPCARLS VRMSSTGIDR KGVLANRVAV VTGSTSGIGF AIARRLARDG 60 
AHVVISSRKQ QNVDRAMAKL QGEGLSVAGI VCHVGKAEDR EQLVAKALEH CGGVDFLVCS 120 
AGVNPLVGST LGTSEQIWDK ILSVNVKSPA LLLSQLLP YM ENRRG AVILV SSI A AYNPVV 1 80 
1 5 ALGVYNVSKT ALLGLTRTLA LELAPKDIRV NCVVPGIKT DFSKVFHGNE SLWKNFKEHH 240 
QLQRIGESED CAGIVSFLCS PDASYVNGEN IAVAGYSTRL 

~ Seq ID NO: 77 DNA sequence 

120 Nucleic Acid Accession #: NM_002425 

f=l Coding sequence: 26-1453 (underlined sequences correspond to start and stop codons) 

01 1 11 21 31 41 51 

=25 AAAGAAGGTA AGGGCAGTGA GA ATG ATGCA TCTTGCATTC CTTGTGCTGT TGTGTCTGCC 60 
Ifl i AGTCTGCTCT GCCTATCCTC TGAGTGGGGC AGCAAAAGAG GAGGACTCCA ACAAGGATCT 120 

I '" • TGCCCAGCAA TACCTAGAAA AGTACTACAA CCTCGAAAAG GATGTGAAAC AGTTTAGAAG 1 80 

AAAGGACAGT AATCTCATTG TTAAAA AAAT CCAAGGAATG C AGAAGTTCC TTGGGTTGGA 240 
f=j GGTGACAGGG AAGCTAGACA CTGACACTCT GGAGGTGATG CGCAAGCCCA GGTGTGGAGT 300 

"30 TCCTGACGTT GGTCACTTCA GCTCCTTTCC TGGCATGCCG AAGTGGAGGA AAACCCACCT 360 
* TACATACAGG ATTGTGAATT ATACACCAGA TTTGCCAAGA GATGCTGTTG ATTCTGCCAT 420 

f = i TG AG AAAGCT CTG AAAGTCT GGGAAGAGGT GACTCCACTC ACATTCTCCA GGCTGTATGA 480 

AGGAGAGGCT GATATAATGA TCTCTTTCGC AGTTAAAGAA CATGGAGACT TTTACTCTTT 540 
j=~ TGATGGCCCA GGACACAGTT TGGCTCATGC CTACCCACCT GGACCTGGGC TTTATGGAGA 600 

=35 TATTCACTTT GATGATGATG AAAAATGGAC AGAAGATGCA TCAGGCACCA ATTTATTCCT 660 
CGTTGCTGCT CATGAACTTG GCCACTCCCT GGGGCTCTTT CACTCAGCCA ACACTGAAGC 720 
=1! i TTTGATGTAC CCACTCTACA ACTCATTCAC AGAGCTCGCC CAGTTCCGCC TTTCGCAAGA 780 

p=, TGATGTGAAT GGCATTCAGT CTCTCTACGG ACCTCCCCCT GCCTCTACTG AGGAACCCCT 840 

GGTGCCCACA AAATCTGTTC CTTCGGGATC TGAGATGCCA GCCAAGTGTG ATCCTGCTTT 900 
r?tO GTCCTTCGAT GCCATCAGCA CTCTGAGGGG AGAATATCTG TTCTTTAAAG ACAGATATTT 960 
TTGGCGAAGA TCCCACTGGA ACCCTGAACC TGAATTTCAT TTGATTTCTG CATTTTGGCC 1020 
CTCTCTTCCA TCATATTTGG ATGCTGCATA TGAAGTTAAC AGCAGGGACA CCGTTTTTAT 1080 
TTTTAAAGGA AATGAGTTCT GGGCCATCAG AGGAAATGAG GTACAAGCAG GTTATCCAAG 1 140 
AGGCATCCAT ACCCTGGGTT TTCCTCCAAC CATAAGGAAA ATTGATGCAG CTGTTTCTGA 1200 
45 CAAGGAAAAG AAGAAAACAT ACTTCTTTGC AGCGGACAAA TACTGGAGAT TTGATGAAAA 1260 
TAGCCAGTCC ATGGAGCAAG GCTTCCCTAG ACTAATAGCT GATGACTTTC CAGGAGTTGA 1320 
GCCTAAGGTT GATGCTGTAT TACAGGCATT TGGATTTTTC TACTTCTTCA GTGGATCATC 1 380 
ACAGTTTGAG TTTGACCCCA ATGCCAGGAT GGTGACACAC ATATTAAAGA GTAACAGCTG 1440 
GTTACATTGC TAG GCGAGAT AGGGGGAAGA CAGATATGGG TGTTTTTAAT AAATCTAATA 1 500 
50 ATTATTCATC TAATGTATTA TGAGCCAAAA TGGTTAATTT TTCCTGCATG TTCTGTGACT 1560 
GAAGAAGATG AGCCTTGCAG ATATCTGCAT GTGTCATGAA GAATGTTTCT GGAATTCTTC 1620 
ACTTGCTTTT GAATTGCACT GAACAGAATT AAGAAATACT CATGTGCAAT AGGTGAGAGA 1680 
ATGTATTTTC ATAGATGTGT TATTACTTCC TCAATAAAAA GTTTTATTTT GGGCCTGTTC 1740 
CTT 



55 



Seq ID NO: 78 Protein sequence: 
Protein Accession #: NP 002416 



1 11 21 31 41 51 

60 i i i i i I 

MHLAFLVLLC LPVCSAYPLS GAAKEEDSNK DLAQQYLEKY YNLEKDVKQF RRKDSNLIVK 60 
KIQGMQKFLG LEVTGKLDTD TLEVMRKPRC GVPDVGHFSS FPGMPKWRKT HLTYRTVNYT 120 
PDLPRDA VDS AIEKALKVWE EVTPLTFSRL YEGEADIMIS FAVKEHGDFY SFDGPGHSLA 1 80 
HAYPPGPGLY GDIHFDDDEK WTEDASGTNL FLVAAHELGH SLGLFHSANT EALMYPLYNS 240 
65 FTELAQFRLS QDDVNGIQSL YGPPPASTEE PLVPTKSVPS GSEMPAKCDP ALSFDAISTL 300 

RGEYLFFKDR YFWRRSHWNP EPEFHLISAF WPSLPSYLDA AYEVNSRDTV FIFKGNEFWA 360 
IRGNEVQAGY PRGIHTLGFP PTIRKIDAAV SDKEKKKTYF FAADKYWRFD ENSQSMEQGF 420 
PRLIADDFPG VEPKVDAVLQ AFGFFYFFSG SSQFEFDPNA RMVTHILKSN SWLHC 

70 Seq ID NO: 79 DNA sequence 

Nucleic Acid Accession #: NM_000493. 1 

Coding sequence: 1-2043 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

75 i i i i i i 

ATGCTGCCAC AAATACCCTT TTTGCTGCTA GTATCCTTGA ACTTGGTTCA TGGAGTGTTT 60 
TACGCTGAAC GATACCAAAC GCCCACAGGC ATAAAAGGCC CACTACCCAA CACCAAGACA 1 20 

382 



C AGTTCTTCA TTCCCTACAC CATAAAGAGT AAAGGTATAG CAGTAAGAGG AGAGCAAGGT 1 80 
ACTCCTGGTC CACCAGGCCC TGCTGGACCT CGAGGGCACC CAGGTCCTTC TGGACCACCA 240 
GGAAAACCAG GCTACGGAAG TCCTGGACTC CAAGGAGAGC CAGGGTTGCC AGGACCACCG 300 
GGACCATCAG CTGTAGGGAA ACCAGGTGTG CCAGGACTCC CAGGAAAACC AGGAGAGAGA 360 
GGACCATATG GACCAAAAGG AGATGTTGGA CCAGCTGGCC TACCAGGACC CCGGGGCCCA 420 
CCAGGACCAC CTGGAATCCC TGGACCGGCT GGAATTTCTG TGCCAGGAAA ACCTGGACAA 480 
CAGGGACCCA CAGGAGCCCC AGGACCCAGG GGCTTTCCTG GAGAAAAGGG TGCACCAGGA 540 
GTCCCTGGTA TGAATGGACA GAAAGGGGAA ATGGGATATG GTGCTCCTGG TCGTCCAGGT 600 
GAGAGGGGTC TTCCAGGCCC TCAGGGTCCC ACAGGACCAT CTGGCCCTCC TGGAGTGGGA 660 
AAAAGAGGTG AAAATGGGGT TCCAGGACAG CCAGGCATCA AAGGTGATAG AGGTTTTCCG 720 
GGAGAAATGG GACCAATTGG CCCACCAGGT CCCCAAGGCC CTCCTGGGGA ACGAGGGCCA 780 
GAAGGCATTG GAAAGCCAGG AGCTGCTGGA GCCCCAGGCC AGCCAGGGAT TCCAGGAACA 840 
AAAGGTCTCC CTGGGGCTCC AGGAATAGCT GGGCCCCCAG GGCCTCCTGG CTTTGGGAAA 900 
CCAGGCTTGC CAGGCCTGAA GGGAGAAAGA GGACCTGCTG GCCTTCCTGG GGGTCCAGGT 960 
GCCAAAGGGG AACAAGGGCC AGCAGGTCTT CCTGGGAAGC CAGGTCTGAC TGGACCCCCT 1020 
GGGAATATGG GACCCCAAGG ACCAAAAGGC ATCCCGGGTA GCCATGGTCT CCCAGGCCCT 1080 
AAAGGTGAGA CAGGGCCAGC TGGGCCTGCA GGATACCCTG GGGCTAAGGG TGAAAGGGGT 1 140 
TCCCCTGGGT CAGATGGAAA ACCAGGGTAC CCAGGAAAAC CAGGTCTCGA TGGTCCTAAG 1200 
GGTAACCCAG GGTTACCAGG TCCAAAAGGT GATCCTGGAG TTGGAGGACC TCCTGGTCTC 1260 
CCAGGCCCTG TGGGCCCAGC AGGAGCAAAG GGAATGCCCG GACACAATGG AGAGGCTGGC 1320 
CCAAGAGGTG CCCCTGGAAT ACCAGGTACT AGAGGCCCTA TTGGGCCACC AGGCATTCCA 1380 
GGATTCCCTG GGTCTAAAGG GGATCCAGGA AGTCCCGGTC CTCCTGGCCC AGCTGGCATA 1440 
GCAACTAAGG GCCTCAATGG ACCCACCGGG CCACCAGGGC CTCCAGGTCC AAGAGGCCCC 1500 
TCTGGAGAGC CTGGTCTTCC AGGGCCCCCT GGGCCTCCAG GCCCACCAGG TCAAGCAGTC 1560 
ATGCCTGAGG GTTTTATAAA GGCAGGCCAA AGGCCCAGTC TTTCTGGGAC CCCTCTTGTT 1620 
AGTGCCAACC AGGGGGTAAC AGGAATGCCT GTGTCTGCTT TTACTGTTAT TCTCTCCAAA 1680 
GCTTACCCAG CAATAGGAAC TCCCATACCA TTTGATAAAA TTTTGTATAA CAGGCAACAG 1740 
CATTATGACC CAAGGACTGG AATCTTTACT TGTCAGATAC CAGGAATATA CTATTTTTCA 1800 
TACCACGTGC ATGTGAAAGG GACTCATGTT TGGGTAGGCC TGTATAAGAA TGGCACCCCT 1860 
GTAATGTACA CCTATGATGA ATACACCAAA GGCTACCTGG ATCAGGCTTC AGGGAGTGCC 1920 
ATCATCGATC TCACAGAAAA TGACCAGGTG TGGCTCCAGC TTCCCAATGC CGAGTCAAAT 1980 
GGCCTATACT CCTCTGAGTA TGTCCACTCC TCTTTCTCAG GATTCCTAGT GGCTCCAATG 2040 
TGA GTACACC CCACAGAGCT AATCTAAATC TTGTGCTAGA AAAAGCATTC TCTAACTCTA 2100 
CCCCACCCTA CAAAATGCAT ATGGAGGTAG GCTGAAAAGA ATGTAATTTT TATTTTCTGA 2160 
AATACAGATT TGAGCTATCA GACCAACAAA CCTTCCCCCT GAAAAGTGAG CAGCAACGTA 2220 
AAAACGTATG TGAAGCCTCT CTTGAATTTC TAGTTAGCAA TCTTAAGGCT CTTTAAGGTT 2280 
TTCTCCAATA TTAAAAAATA TCACCAAAGA AGTCCTGCTA TGTTAAAAAC AAACAACAAA 2340 
AAACAAAGCA ACAAAAAAAA AAATTAAAAA AAAAAACAGA AATAGAGCTC TAAGTTATGT 2400 
GAAATTTGAT TTGAGAAACT CGGCATTTCC TTTTTAAAAA AGCCTGTTTC TAACTATGAA 2460 
TATGAGAACT TCTAGGAAAC ATCCAGGAGG TATCATATAA CTTTGTAGAA CTTAAATACT 2520 
TGAATATTCA AATTTAAAAG ACACTGTATC CCCTAAAATA TTTCTGATGG TGCACTACTC 2580 
TGAGGCCTGT ATGGCCCCTT TCATCAATAT CTATTCAAAT ATACAGGTGC ATATATACTT 2640 
GTTAAAGCTC TTATATAAAA AAGCCCCAAA ATATTGAAGT TCATCTGAAA TGCAAGGTGC 2700 
TTTCATCAAT GAACCTTTTC AAAACTTTTC TATGATTGCA GAGAAGCTTT TTATATACCC 2760 
AGCATAACTT GGAAACAGGT ATCTGACCTA TTCTTATTTA GTTAACACAA GTGTGATTAA 2820 
TTTGATTTCT TTAATTCCTT ATTGAATCTT ATGTGATATG ATTTTCTGGA TTTACAGAAC 2880 
ATTAGCACAT GTACCTTGTG CCTCCCATTC AAGTGAAGTT ATAATTTACA CTGAGGGTTT 2940 
CAAAATTCGA CTAGAAGTGG AGATATATTA TTTATTTATG CACTGTACTG TATTTTTATA 3000 
TTGCTGTTTA AAACTTTTAA GCTGTGCCTC ACTTATTAAA GCACAAAATG TTTTACCTAC 3060 
TCCTTATTTA CGACACAATA AAATAACATC AATAGATTTT TAGGCTGAAT TAATTTGAAA 3120 
GCAGCAATTT GCTGTTCTCA ACCATTCTTT CAAGGCTTTT CATTCGACAC AATAAAATAA 3180 
CATCAATAGA TTTTTAGG 

Seq ID NO: 80 Protein sequence: 
Protein Accession #: NP_000484. 1 



1 11 21 31 41 51 

I I I I I I 

MLPQIPFLLL VSLNLVHGVF YAERYQTPTG KGPLPNTKT QFFIPYTIKS KGIAVRGEQG 60 
TPGPPGPAGP RGHPGPSGPP GKPGYGSPGL QGEPGLPGPP GPSAVGKPGV PGLPGKPGER 120 
GPYGPKGDVG PAGLPGPRGP PGPPGIPGPA GISVPGKPGQ QGPTGAPGPR GFPGEKG APG 1 80 
VPGMNGQK.GE MGYGAPGRPG ERGLPGPQGP TGPSGPPGVG KRGENGVPGQ PGQCGDRGFP 240 
GEMGPIGPPG PQGPPGERGP EGIGKPGAAG APGQPGIPGT KGLPGAPGIA GPPGPPGFGK 300 
PGLPGLKGER GPAGLPGGPG AKGEQGPAGL PGKPGLTGPP GNMGPQGPKG IPGSHGLPGP 360 
KGETGPAGPA GYPGAKGERG SPGSDGKPGY PGKPGLDGPK GNPGLPGPKG DPGVGGPPGL 420 
PGPVGPAGAK GMPGHNGEAG PRGAPGIPGT RGPIGPPGIP GFPGSKGDPG SPGPPGPAGI 480 
ATKGLNGPTG PPGPPGPRGP SGEPGLPGPP GPPGPPGQAV MPEGFDCAGQ RPSLSGTPLV 540 
SANQGVTGMP VSAFTVILSK AYPAIGTPIP FDKILYNRQQ HYDPRTGIFT CQ1PGIYYFS 600 
YHVHVKGTHV WVGLYKNGTP VMYTYDEYTK GYLDQASGSA IIDLTENDQV WLQLPNAESN 660 
GLYSSEYVHS SFSGFLVAPM 

Seq ID NO: 81 DNA sequence 

Nucleic Acid Accession #: NM_000786 

Coding sequence: 332-1861 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 
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1 I I I I I 

CGCGATTCTC AGGGATTGAT CCGCCTCTTC AGGTAAGTTA TCTTCCGGCC CCGTACCACT 60 
GTGCCACAGG CGCAGCCCGC TTCCTCAGGT GCCCTATCCC GCGCAGAAGA CCACGGCTTC 120 
ACAGAGTGTT ATTTAAGGGC GTGGCCAGCG GAACATCCCG CCCCATTCTG TGACGCACGG 1 80 
5 GGTGGCGCGC GTGGGACCCG AGGGGTGGGG CTGGGTTTAG TAGGAGACCT GGGGCAAGGC 240 
CCCCTGTGGA CGACCATCTG CCAGCTTCTC TCGTTCCGTC GATTGGGAGG AGCGGTGGCG 300 
ACCTCGGCCT TCAGTGTTTC CGACGGAGTG AATGGCGGCG GCGGCTGGGA TGCTGCTGCT 360 
GGGCTTGCTG CAGGCGGGTG GGTCGGTGCT GGGCCAGGCG ATGGAGAAGG TGACAGGCGG 420 
CAACCTCTTG TCCATGCTGC TGATCGCCTG CGCCTTCACC CTCAGCCTGG TCTACCTGAT 480 
1 0 CCGTCTGGCC GCCGGCCACC TGGTCCAGCT GCCCGCAGGG GTGAAAAGTC CTCCATACAT 540 
TTTCTCCCCA ATTCCATTCC TTGGGCATGC CATAGCATTT GGGAAAAGTC CAATTGAATT 600 
TCTAGAAAAT GCATATGAGA AGTATGGACC TGTATTTAGT TTTACCATGG TAGGCAAGAC 660 
ATTTACTTAC CTTCTGGGGA GTGATGCTGC TGCACTGCTT TTTAATAGTA AAAATGAAGA 720 
CCTGAATGCA GAAGATGTCT ACAGTCGCCT GACAACACCT GTGTTTGGGA AGGGAGTTGC 780 
15 ATACGATGTG CCTAATCCAG TTTTCTTGGA GCAGAAGAAA ATGTTAAAAA GTGGCCTTAA 840 
CATAGCCCAC TTTAAACAGC ATGTTTCTAT AATTGAAAAA GAAACAAAGG AATACTTTGA 900 
GAGTTGGGGA GAAAGTGGAG AAAAAAATGT GTTTGAAGCT CTTTCTGAGC TCATAATTTT 960 
L.i AACAGCTAGC CATTGTTTGC ATGGAAAGGA AATCAGAAGT CAACTCAATG AAAAGGTAGC 1020 

~1" ACAGCTGTAT GCAGATTTGG ATGGAGGTTT CAGCCATGCA GCCTGGCTCT TACCAGGTTG 1080 

00 GCTGCCTTTG CCTAGTTTCA GACGCAGGGA CAGAGCTCAT CGGGAAATCA AGGATATTTT 1 1 40 
CTATAAGGCA ATCCAGAAAC GCAGACAGTC TCAAGAAAAA ATTGATGACA TTCTCCAAAC 1200 

TZ. TTTACTAGAT GCTACATACA AGGATGGGCG TCCTTTGACT GATGATGAAG TAGCAGGGAT 1260 

HI GCTTATTGGA TTACTCTTGG CAGGGCAGCA TACATCCTCA ACTACTAGTG CTTGGATGGG 1 320 

CTTCTTTTTG GCCAGAGACA AAACACTTCA AAAAAAATGT TATTTAGAAC AGAAAACAGT 1380 
^25 CTGTGGAGAG AATCTGCCTC CTTTAACTTA TGACCAGCTC AAGGATCTAA ATTTACTTGA 1440 
ftj TCGCTGTATA AAAGAAACAT TAAGACTTAG ACCTCCTATA ATGATCATGA TGAGAATGGC 1500 

1 "■ CAGAACTCCT CAGACTGTGG CAGGGTATAC CATTCCTCCA GGACATCAGG TGTGTGTTTC 1560 
""-•i TCCCACTGTC AATCAAAGAC TTAAAGACTC ATGGGTAGAA CGCCTGGACT TTAATCCTGA 1620 
f=i TCGCTACTTA CAGGATAACC CAGCATCAGG GGAAAAGTTT GCCTATGTGC CATTTGGAGC 1680 
~3 0 TGGGCGTCAT CGTTGTATTG GGGAAAATTT TGCCTATGTT CAAATTAAG A CAATTTGGTC 1 740 
S CACTATGCTT CGTTTATATG AATTTGATCT CATTGATGGA TACTTTCCCA CTGTG AATTA 1 800 

TACAACTATG ATTCACACCC CTGAGAACCC AGTTATCCGT TACAAACGAA G ATCAAAATG 1 860 
fe AAAAAGGTTG CAAGGAACGA ATATATGTGA TTATCACTGT AAGCCACAAA GGCATTCGAA 1920 

M' GAGAATGAAG TGTACAAAAC AACTCTTGTA GTTTACTGTT TTTTTAAGTG TGTAATTCTA 1980 

fO, 5 A A AGCC AGTT TATGATTTAG GATTTTGTTA ACTG AATGGT TCTATCAAAT ATAATAGC AT 2040 
^ V- TTGACACATT TTCTAATAGT TATGATACTT ATACATGTGC TTTCAGGAAG TTCCTTGGTG 2100 

.JC: AAACAATTGT TGAGGGGGGA TCTAGGTAAT TGGCAGATTC TAAATAATAT AATTTCCAGA 2160 

~ B . TAGTAATTTT AAGAGTACTC ATCGCTCTTG CCAAATAAGT TCAGGGTATT CAAATCTTGG 2220 

ACTAGTCCTG CAAGGTATAA AGAATAAAAA TCCCAGTGAG ATACTTGGAA ACCACAGTTT 2280 
f 40 ATTATTATTT ATCTGGGCAA TTATTGTGTG TGTGAGGATG GAAGGGTAGG GAATAATC G A 2340 
5 = ACATCTAAAG CCTTGAATAA GAGAATACTA ATTGTTTTGG TATGATG ATA CTCAG AAATG 2400 

GAGATATTAT AGGAAAAAGA AATCCTTTGG AATTTTAACT AAAATCACTG CATATGGGAA 2460 
ATTAAGAGAT CCAGGACCAT ATTTGATAAG AGTTCCTAAA AATAATGTAA TTATTAATGC 2520 
TAAAGACTGC TCATGTATCT TGATCTAATT ACTAAATAAA TTACATATTT ATTTACCTGA 2580 
45 TAAATATGTA TCTAGTTCTA CAAGGTCACA TTTATGTGGA AGTCCAAAGT CAAGTCCTTA 2640 
GGGGATAATT TTGTTTTGGG CTCAGTTGTT CCCTGCTTCC TTTTTTTTTT TTTTTTTTTT 2700 
TTGAGATGGA GTCTCGCTCT GTTGCCCAGG CTGGAGTGCA GTGGTGCGAT CTCAGCTCAC 2760 
TGCATCCTCT GCCTCCCGGG TTCAAGCAAT TCTCTGCCTC AGCCTCCCAA GTAGTTGGGA 2820 
TTACAGGCAC CTGCCACCAT GCCTGGCTAA TTTTTTGTAT TTTTAGTAGA GACGGGGGTT 2880 
5 0 TCACTATGTT GGCTAGGCTG GTCTTG AACT CCTGAGCCTC GTGAGTCCAC CCGCCTTGGC 2940 
CTCCCAAAGT GCTGGGATTA CAGGCATGAG CCACCGCACC TGGCCTTCCC TGCTTCCTCT 3000 
CTAGAATCCA ATTAGGGATG TTTGTTACTA CTCATATTGA TTAAAACAGT TAACAAACTT 3060 
TTTTCTTTTT AAAATGTGAG ATCAGTGAAC TCTGGTTTTA AGATAATCTG AAACAAGGTC 3120 
CTTGGGAGTA ATAAAATTGG TCACATTCTG TAAAGCACAT TCTGTTTAGG AATCAACTTA 3180 
5 5 TCTCAAATTG TAACTCGGGG CCTAACTATA TGAGATGGCT GAAAA AAT AC CACATCGTCT 3240 
GTTTTCACTA GGTGATGCCA AAATATTTTG CTTTATGTAT ATTACAGTTC TTTTTAAAAC 3300 
ACTGGAAGAC TCATGTTAAA CTCTAATTGT GAAGGCAGAA TCTCTGCTAA TTTTTCAGAT 3360 
TAAAATTCTC TTTGAAAAAA T 

60 Seq ID NO: 82 Protein sequence 

Protein Accession #: NP000777 

1 11 21 31 41 51 

65 MAAAAGMLLL GLLQAGGSVL GQAMEKVTGG NLLSMLLIAC AFTLSLVYLI RLAAGHLVQL 60 
PAGVKSPPYI FSPIPFLGHA IAFGKSPIEF LENAYEKYGP VFSFTMVGKT FTYLLGSDAA 120 
ALLFNSKNED LNAEDVYSRL TTPVFGKGVA YDVPNPVFLE QKKMLKSGLN IAHFKQHVSI 180 
IEKETKEYFE SWGE SGEKNV FEALSELIIL TASHCLHGKE IRSQLNEKVA QLYADLDGGF 240 
SHAAWLLPGW LPLPSFRRRD RAHREIKDIF YKAIQKRRQS QEKIDDILQT LLDATYKDGR 300 

70 PLTDDEVAGM LIGLLLAGQH TSSTTSAWMG FFLARDKTLQ KKCYLEQKTV CGENLPPLTY 360 
DQLKDLNLLD RCIKETLRLR PPIMIMMRMA RTPQTVAGYT IPPGHQVCVS PTVNQRLKDS 420 
WVERLDFNPD RYLQDNPASG EKFAYVPFGA GRHRCIGENF AYVQKTIWS TMLRLYEFDL 480 
IDGYFPTVNY TTMIHTPENP VIRYKRRSK 

75 Seq ID NO: 83 DNA sequence 

Nucleic Acid Accession #■ NM_00655 1 .2 

Coding sequence: 64-336 (underlined sequences correspond to start and stop codons) 
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1 11 21 31 41 51 

I I I I I I 

AATTCTAGAA GTCCAAATCA CTCATTGTTT GTGAAAGCTG AGCTCACAGC AAAACAAGCC 60 
ACC ATG AAGC TGTCGGTGTG TCTCCTGCTG GTCACGCTGG CCCTCTGCTG CTACC AGGCC 1 20 
AATGCCGAGT TCTGCCCAGC TCTTGTTTCT GAGCTGTTAG ACTTCTTCTT CATTAGTG AA 1 80 
CCTCTGTTCA AGTTAAGTCT TGCCAAATTT GATGCCCCTC CGGAAGCTGT TGCAGCCAAG 240 
TTAGGAGTGA AGAGATGCAC GGATCAGATG TCCCTTCAGA AACGAAGCCT CATTGCGGAA 300 
GTCCTGGTGA AAATATTGAA GAAATGTAGT GTG TGA CATG TAAAAACTTT CATCCTGGTT 360 
TCCACTGTCT TTCAATGACA CCCTGATCTT CACTGCAGAA TGTAAAGGTT TCAACGTCTT 420 
GCTTTAATAA ATCACTTGCT CTAC 

Seq ID NO: 84 Protein sequence: 
Protein Accession #: NP 006542. 1 

1 11 21 31 41 51 
I I I 1 I I 

MKLSVCLLLV TLALCCYQAN AEFCPALVSE LLDFFFISEP LFKLSLAKFD APPEAVAAKL 60 
GVKRCTDQMS LQKRSLIAEV LVKILKKCSV 

Seq ID NO: 85 DNA sequence 

Nucleic Acid Accession #: NM_0023 17.1 

Coding sequence: 231-1484 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

GGGCCAGGAC TGAGAAAGGG GAAAGGGAAG GGTGCCACGT CCGAGCAGCC GCCTTGACTG 60 
GGGAAGGGTC TGAATCCCAC CCTTGGCATT GCTTGGTGGA GACTGAGATA CCCGTGCTCC 120 
GCTCGCCTCC TTGGTTG AAG ATTTCTCCTT CCCTCACGTG ATTTG AGCCC CGTTTTTATT 1 80 
TTCTGTGAGC CACGTCCTCC TCGAGCGGGG TCAATCTGGC AAAAGGAGTG ATG CGCTTCG 240 
CCTGGACCGT GCTCCTGCTC GGGCCTTTGC AGCTCTGCGC GCTAGTGCAC TGCGCCCCTC 300 
CCGCCGCCGG CCAACAGCAG CCCCCGCGCG AGCCGCCGGC GGCTCCGGGC GCCTGGCGCC 360 
AGCAGATCCA ATGGGAGAAC AACGGGCAGG TGTTCAGCTT GCTGAGCCTG GGCTCACAGT 420 
ACCAGCCTCA GCGCCGCCGG GACCCGGGCG CCGCCGTCCC TGGTGCAGCC AACGCCTCCG 480 
CCCAGCAGCC CCGCACTCCG ATCCTGCTGA TCCGCGACAA CCGCACCGCC GCGGGGCGAA 540 
CGCGGACGGC CGGCTCATCT GGAGTCACCG CTGGCCGCCC CAGGCCCACC GCCCGTCACT 600 
GGTTCCAAGC TGGCTACTCG ACATCTAGAG CCCGCGAAGC TGGGCCCTCG CGCGCGGAGA 660 
ACCAGACAGC GCCGGGAGAA GTTCCTGCTC TCAGTAACCT GCGGCCGCCC AGCCGCGTGG 720 
ACGGCATGGT GGGCGACGAC CCTTACAACC CCTACAAGTA CTCTGACGAC AACCCTTATT 780 
ACAACTACTA CGATACTTAT GAAAGGCCCA GACCTGGGGG CAGGTACCGG CCCGGATACG 840 
GCACTGGCTA CTTCCAGTAC GGTCTCCCAG ACCTGGTGGC CGACCCCTAC TACATCCAGG 900 
CGTCCACGTA CGTGCAGAAG ATGTCCATGT ACAACCTGAG ATGCGCGGCG GAGGAAAACT 960 
GTCTGGCCAG TACAGCATAC AGGGCAGATG TCAGAGATTA TGATCACAGG GTGCTGCTCA 1020 
GATTTCCCCA AAGAGTGAAA AACCAAGGGA CATCAGATTT CTTACCCAGC CGACCAAGAT 1080 
ATTCCTGGGA ATGGCACAGT TGTCATCAAC ATTACCACAG TATGGATGAG TTTAGCCACT 1 140 
TGTACCTGCT TGATGCCAAC ACCCAGAGGA GATGGGCTGA AGGCCACAAA GCAAGTTTCT 1200 
GTCTTGAAGA CACATCCTGT GACTATGGCT ACCACAGGCG ATTTGCATGT ACTGCACACA 1260 
CACAGGGATT GAGTCCTGGC TGTTATGATA CCTATGGTGC AGACATAGAC TGCCAGTGGA 1320 
TTGATATTAC AGATGTAAAA CCTGGAAACT ATATCCTAAA GGTCAGTGTA AACCCCAGCT 1380 
ACCTGGTTCC TGAATCTGAC TATACCAACA ATGTTGTGCG CTGTGACATT CGCTACACAG 1440 
GACATCATGC GTATGCCTCA GGCTGCACAA TTTCACCGTA T TAGA AGGCA AAGCAAAACT 1500 
CCCAATGGAT AAATCAGTGC CTGGTGTTCT GAAGTGGGAA AAAATAGACT AACTTCAGTA 1560 
GGATTTATGT ATTTTGAAAA AGAGAACAGA AAACAACAAA AGAATTTTTG TTTGGACTGT 1620 
TTTCAATAAC AAAGCACATA ACTGGATTTT GAACGCTTAA GTCATCATTA CTTGGGAAAT 1680 
TTTTAATGTT TATTATTTAC ATCACTTTGT GAATTAACAC AGTGTTTCAA TTCTGTAATT 1740 
ACATATTTGA CTCTTTCAAA AAAAAAAAAA AAAAAAAAAA 

Seq ID NO: 86 Protein sequence: 
Protein Accession #: NP 002308. 1 

1 11 21 31 41 51 
I I I I I I 

MRFAWTVLLL GPLQLCALVH CAPPAAGQQQ PPREPPAAPG AWRQQIQWEN NGQVFSLLSL 60 
GSQYQPQRRR DPG AAVPGAA NASAQQPRTP ILLIRDNRTA AGRTRTAGSS GVTAGRPRPT 1 20 
ARHWFQAGYS TSRAREAGPS RAENQTAPGE VPALSNLRPP SRVDGMVGDD PYNPYKYSDD 1 80 
NPYYNYYDTY ERPRPGGRYR PGYGTGYFQY GLPDLVADPY YIQASTYVQK MSMYNLRCAA 240 
EENCLASTAY RADVRDYDHR VLLRFPQRVK NQGTSDFLPS RPRYSWEWHS CHQHYHSMDE 300 
FSHLYLLDAN TQRRWAEGHK ASFCLEDTSC DYGYHRRFAC TAHTQGLSPG CYDTYGADID 360 
CQWIDITDVK PGNYILKVSV NPSYLVPESD YTNNVVRCDI RYTGHHAYAS GCTISPY 

Seq ID NO: 87 DNA sequence 

Nucleic Acid Accession #: NM_006419. 1 

Coding sequence: 91-420 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I 1 I I 

TTCGGCACTT GGGAGAAGAT GTTTGAAAAA ACTGACTCTG CTAATGAGCC TGGACTCAGA 60 
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GCTCAAGTCT GAACTCTACC TCCAGACAGA ATGAAGTTCA TCTCGACATC TCTGCTTCTC 120 
ATGCTGCTGG TCAGCAGCCT CTCTCCAGTC CAAGGTGTTC TGGAGGTCTA TTACACAAGC 1 80 
TTGAGGTGTA GATGTGTCCA AGAGAGCTCA GTCTTTATCC CTAGACGCTT CATTGATCGA 240 
ATTCAAATCT TGCCCCGTGG GAATGGTTGT CCAAGAAAAG AAATCATAGT CTGGAAGAAG 300 
5 AACAAGTCAA TTGTGTGTGT GGACCCTCAA GCTGAATGGA TACAAAGAAT GATGGAAGTA 360 
TTGAGAAAAA GAAGTTCTTC AACTCTACCA GTTCCAGTGT TTAAGAGAAA GATTCCCTGA 420 
TGCTGATATT TCCACTAAGA ACACCTGCAT TCTTCCCTTA TCCCTGCTCT GGATTTTAGT 480 
TTTGTGCTTA GTTAAATCTT TTCCAGGGAG AAAGAACTTC CCCATACAAA TAAGGCATGA 540 
GGACTATGTG AAAAATAACC TTGCAGGAGC TGATGGGGCA AACTCAAGCT TCTTCACTCA 600 
10 CAGCACCCTA TATACACTTG GAGTTTGCAT TCTTATTCAT CAGGGAGGAA AGTTTCTTTG 660 
AAAATAGTTA TTCAGTTATA AGTAATACAG GATTATTTTG ATTATATACT TGTTGTTTAA 720 
TGTTTAAAAT TTCTTAGAAA ACAATGGAAT GAGAATTTAA GCCTCAAATT TGAACATGTG 780 
GCTTGAATTA AGAAGAAAAT TATGGCATAT ATTAAAAGCA GGCTTCTATG AAAGACTCAA 840 
AAAGCTGCCT GGGAGGCAGA TGGAACTTGA GCCTGTCAAG AGGCAAAGGA ATCCATGTAG 900 
15 TAGATATCCT CTGCTTAAAA ACTCACTACG GAGGAGAATT AAGTCCTACT TTTAAAGAAT 960 
TTCTTTATAA AATTTACTGT CTAAGATTAA TAGCATTCGA AGATCCCCAG ACTTCATAGA 1020 
ATACTC A GGG AAAGCATTTA AAGGGTGATG TACACATGTA TCCTTTCACA CATTTGCCTT 1080 

M< GACAAACTTC TTTCACTCAC ATCTTTTTCA CTGACTTTTT TTGTGGGGGC GGGGCCGGGG 1 140 

GGACTCTGGT ATCTAATTCT TTAATGATTC CTATAAATCT AATGACATTC AATAAAGTTG 1200 

^j0 AGCAAACATT TTACTT 

W 

=; Seq ID NO: 88 Protein sequence- 

Ul Protein Accession #: NP_006410.1 

ZZ5 1 11 21 31 41 51 

ru i i i i i i 

i. i MKFISTSLLL MLLVSSLSPV QGVLEVYYTS LRCRCVQESS VFIPRRFIDR IQILPRGNGC 60 

y PRKEIIVWKK NKSIVCVDPQ AEWIQRMMEV LRKRSSSTLP VPVFKRKIP 

10 Seq ID NO: 89 DNA sequence 

Nucleic Acid Accession #• NM_002652 
Ci Coding sequence: 37-477 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

fB5 i i i i i I 

-I • CTTCTCTGGG ACACATTGCC TTCTGTTTTC TCCAGCATGC GCTTGCTCCA GCTCCTGTTC 60 

AGGGCCAGCC CTGCCACCCT GCTCCTGGTT CTCTGCCTGC AGTTGGGGGC CAACAAAGCT 1 20 
m CAGGACAACA CTCGGAAGAT CATAATAAAG AATTTTGACA TTCCCAAGTC AGTACGTCCA 1 80 

%s=' AATGACGAAG TCACTGCAGT GCTTGCAGTT CAAACAGAAT TGAAAGAATG CATGGTGGTT 240 

040 AAAACTTACC TCATTAGCAG CATCCCTCTA CAAGGTGCAT TTAACTATAA GTATACTGCC 300 
TGCCTATGTG ACGACAATCC AAAAACCTTC TACTGGGACT TTTACACCAA CAGAACTGTG 360 
CAAATTGCAG CCGTCGTTGA TGTTATTCGG GAATTAGGCA TCTGCCCTGA TGATGCTGCT 420 
GTAATCCCCA TCAAAAACAA CCGGTTTTAT ACTATTGAAA TCCTAAAGGT AGAATAATGG 480 
AAGCCCTGTC TGTTTGCCAC ACCCAGGTGA TTTCCTCTAA AGAAACTTGG CTGGAATTTC 540 
45 TGCTGTGGTC TATAAAATAA ACTTCTTAAC ATGCTT 



Seq ID NO: 90 Protein sequence: 
Protein Accession #: NP_002643.1 

50 1 11 21 31 41 51 

MRLLQLLFRA SPATLLLVLC LQLGANKAQD NTRKIIIKNF DIPKSVRPND EVTAVLAVQT 60 
ELKECMVVKT YLISSIPLQG AFNYKYTACL CDDNPKTFYW DFYTNRTVQI AAVVDVIREL 120 
^ GICPDDAAVI PKNNRFYTI EILKV 

Seq ID NO: 91 DNA sequence 

Nucleic Acid Accession #: AK000341 

Coding sequence: 85-975 (underlined sequences correspond to start and stop codons) 
60 1 11 21 31 41 51 

gata'gcgccg ggcagaggga CCCGGCTACC CTGGACAGCG CATCGCCGCC CGCCCGGGTC 60 
GCCGCGCCAC AGCCGCTGCG GATC ATGG AA CATCTAAAGG CCTTTGATGA TGAAATCAAT 120 
GCTTTTTTGG ACAATATGTT TGGACCGCGA GATTCTCGAG TCAGAGGGTG GTTCACGTTG 1 80 

65 GACTCTTACC TTCCTACCTT TTTTCTTACT GTCATGTATC TGCTCTCAAT ATGGCTGGGT 240 

AACAAGTATA TGAAGAACAG ACCTGCTCTT TCTCTCAGGG GTATCCTCAC CTTGTATAAT 300 
CTTGGAATCA CACTTCTCTC CGCGTACATG CTGGCAGAGC TCATTCTCTC CACTTGGGAA 360 
GGAGGCTACA ACTTACAGTG TCAAGATCTT ACCAGCGCAG GGGAAGCTGA CATCCGGGTA 420 
GCCAAGGTGC TTTGGTGGTA CTATTTCTCC AAATCAGTAG AGTTCCTGGA CACAATTTTC 480 

70 TTCGTTTTGC GGAAAAAAAC GAGTCAGATT ACTTTTCTTC ATGTATATCA TCATGCTTCT 540 
ATGTTTAACA TCTGGTGGTG TGTCTTGAAC TGGATACCTT GTGGACAAAG TTTCTTTGGA 600 
CCAACACTGA ACAGTTTTGT CCACATTCTT ATGTACTCCT ACTATGGACT TTCTGTGTTT 660 
CCATCTATGC ACAAGTATCT TTGGTGGAAG AAATATCTCA CACAGGCTCA GCTGGTGCAG 720 
TTCGTGCTCA CCATCACGCA CACCATGAGC GCCGTCGTGA AACCGTGTGG CTTCCCCTTC 780 

75 GGTTGTCTCA TCTTCCAGTC ATCTTATATG CTAACGTTAG TCATCCTCTT CTTAAATTTT 840 

TATGTTCAGA CATACCGAAA AAAGCCAATG AAGAAAGATA TGCAAGAGCC ACCTGCAGGG 900 
AAAGAAGTGA AGAATGGTTT TTCCAAAGCC TACTTCACTG CAGCAAATGG AGTGATGAAC 960 
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AAGAAAGCAC AATAAAAATG AGTAACAGAA AAAGCACATA TACTAGCCTA ACAGATTGGC 1020 
TTGTTTTAAA GCAAAGACTG AATTGAAGGT TACATGTTTT AGGATAAACT AATTTCTTTT 1080 
GAGTTCATAA ATCATTTGTA CCCAGAATGT ATTAATATAT TGCTATTAGG TTAATCTGTT 1140 
AACTGAATGC TTTGATCAGC ATTGAGGTGA TGCTCACCTC CGAGGACCTC AGAACTGGTG 1200 
5 CAGCTTCTCT CTCCCTCCCT CCCACAGACT GAACCTTTCG CCAGAAGCTG TCCTTATAAC 1260 
GCCTTATACG CATACACAGC CAGGAAACGT GGAGCATTGT TTCTCACAGA GAGTCTCCAA 1320 
ATAAAAAGGG TTTTGTTCAG ATTAAAATGT TTACAACAAA ATGTTAATTA TATTCTAAAT 1380 
ACAGGGTATG TTCTAATCTA TATTAAGCAA TAATGCCAGT GCATAATCAT TCCATTTGTT 1440 
CCTTTAGCAA TCAACCCCAG AAAATATTAA AATGGGATCA TACACAGAAG ATAGAAAAAT 1500 
1 0 CTAGCAAAAC TTCTCTTTCT GTAAGCCAGA GTCTTGTCTA TCAG ATTCCC AC AACCACTC 1 560 
CTGATTCTAA ATTTAGTGAT ATGGTAATGA AATTGGTATT TATTTTAAAT ATTAGTTATT 1 620 
CTAAGGAGAA AAAAATGCTT CTGCAAGATT TTCATAATTC AGGGGCTGTG GATAGGATTG 1680 
TTCCTCTGTT TCCCTAATCA TTCATCTGTT CATGTCTCCC TCTTGTGCCA GTCAGCCTAG 1740 
GTTATACAGA TGCCATGCTC CACACCACGA GCAGTGTACA AATCTGGCTG CCCGTTTACT 1800 
1 5 TTCTGAGCAA GCACTGGAGT CCACTCCGAC CTTTTTCTTT GAACATGCAT GCTGCTGGAA 1 860 

TATGTATAAA TCAGAACTAG CAGAAGTAGC AGAGTGATGG GAGCAAAATA GGCACTGAAT 1920 
TCGTCAACTC TTTTTTGTGA GCCTACTTGT GAATATTACC TCAGATACCT GTTGTCACTC 1980 
TTCACAGGTT ATTTAAGTTC TTGAAGCTGG GAGGAAAAAG ATGGAGTAGC TTGGAAAGAT 2040 
UT TCCAGCACTG AGCCGTGAGC CGGTCATGAG CCACGATAAA AAATGCCAGT TTGGCAAACT 2100 

£20 CAGCACTCCT GTTCCCTGCT CAGGTATATG CGATCTCTAC TGAGAAGCAA GCACAAAAGT 2 1 60 
j== , AGACCAAAGT ATTAATGAGT ATTTCCTTTC TCCATAAGTG CAGG ACTGTT ACTCACTACT 2220 

"!f ! AAACTCTACC AAGAATGGAA ACCAAGAATA TTTTCTGAAG ATTTTTTTGA AGATTAATTT 2280 

Li] ATACCCTATA AAATAAAACT TGTTAGCTTC GATGAAGTCA AAAAAAAAAA AAAAAAAAAA 

Seq ID NO: 92 Protein sequence: 
ffj Protein Accession #: BAA91096.1 

1 11 21 31 41 51 

a A i i i i i i 

3 0 MEHLKAFDDE INAFLDNMFG PRDSRVRGWF TLDSYLPTFF LTVMYLLS1W LGNKYMKNRP 60 
S ALSLRGILTL YNLGITLLSA YMLAELILST WEGGYNLQCQ DLTSAGEADI RVAKVLWWYY 120 

f=; FSKSVEFLDT IFFVLRKKTS QITFLHVYHH ASMFNIWWCV LNWIPCGQSF FGPTLNSFVH 180 

ILMYSYYGLS VFPSMHKYLW WKKYLTQAQL VQFVLTITHT MSAVVKPCGF PFGCLIFQSS 240 
YMLTLVILFL NFYVQTYRKK PMKKDMQEPP AGKEVKNGFS KAYFTAANGV MNKKAQ 
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Seq ID NO: 93 DNA sequence 

Nucleic Acid Accession #: NM_O00044 

Coding sequence: 1 1 15-3874 (underlined sequences correspond to start and stop codons) 



fl40 1 11 21 31 41 51 

I I I I I I 

CGAGATCCCG GGGAGCCAGC TTGCTGGGAG AGCGGGACGG TCCGGAGCAA GCCCACAGGC 60 
AGAGGAGGCG ACAGAGGGAA AAAGGGCCGA GCTAGCCGCT CCAGTGCTGT ACAGGAGCCG 120 
AAGGGACGCA CCACGCCAGC CCCAGCCCGG CTCCAGCGAC AGCCAACGCC TCTTGCAGCG 180 

45 CGGCGGCTTC GAAGCCGCCG CCCGGAGCTG CCCTTTCCTC TTCGGTGAAG TTTTTAAAAG 240 
CTGCTAAAGA CTCGGAGGAA GCAAGGAAAG TGCCTGGTAG GACTGACGGC TGCCTTTGTC 300 
CTCCTCCTCT CCACCCCGCC TCCCCCCACC CTGCCTTCCC CCCCTCCCCC GTCTTCTCTC 360 
CCGCAGCTGC CTCAGTCGGC TACTCTCAGC CAACCCCCCT CACCACCCTT CTCCCCACCC 420 
GCCCCCCCGC CCCCGTCGGC CCAGCGCTGC CAGCCCGAGT TTGCAGAGAG GTAACTCCCT 480 

50 TTGGCTGCGA GCGGGCGAGC TAGCTGCACA TTGCAAAGAA GGCTCTTAGG AGCCAGGCGA 540 
CTGGGGAGCG GCTTCAGCAC TGCAGCCACG ACCCGCCTGG TTAGAATTCC GGCGGAGAGA 600 
ACCCTCTGTT TTCCCCCACT CTCTCTCCAC CTCCTCCTGC CTTCCCCACC CCGAGTGCGG 660 
AGCAGAGATC AAAAGATGAA AAGGCAGTCA GGTCTTCAGT AGCCAAAAAA CAAAACAAAC 720 
AAAAACAAAA AAGCCGAAAT AAAAGAAAAA G ATAATAACT CAGTTCTTAT TTGCACCTAC 780 

5 5 TTC AGTGGAC ACTGAATTTG GAAGGTGGAG GATTTTGTTT TTTTCTTTTA AGATCTGGGC 840 
ATCTTTTGAA TCTACCCTTC AAGTATTAAG AGACAGACTG TGAGCCTAGC AGGGCAGATC 900 
TTGTCCACCG TGTGTCTTCT TCTGCACGAG ACTTTGAGGC TGTCAGAGCG CTTTTTGCGT 960 
GGTTGCTCCC G CAAGTTTCC TTCTCTGGAG CTTCCCGCAG GTGGGCAGCT AGCTGCAGCG 1020 
ACTACCGCAT CATCACAGCC TGTTGAACTC TTCTGAGCAA GAGAAGGGGA GGCGGGGTAA 1080 

60 GGGAAGTAGG TGGAAGATTC AGCCAAGCTC AAGG ATG GAA GTGCAGTTAG GGCTGGGAAG 1 140 
GGTCTACCCT CGGCCGCCGT CCAAGACCTA CCGAGGAGCT TTCCAGAATC TGTTCCAGAG 1200 
CGTGCGCGAA GTGATCCAGA ACCCGGGCCC CAGGCACCCA GAGGCCGCGA GCGCAGCACC 1260 
TCCCGGCGCC AGTTTGCTGC TGCTGCAGCA GCAGCAGCAG CAGCAGCAGC AGCAGCAGCA 1320 
GCAGCAGCAG CAGCAGCAGC AGCAGCAAGA GACTAGCCCC AGGCAGCAGC AGCAGCAGCA 1380 

65 GGGTGAGGAT GGTTCTCCCC AAGCCCATCG TAGAGGCCCC ACAGGCTACC TGGTCCTGGA 1440 
TGAGGAACAG CAACCTTCAC AGCCGCAGTC GGCCCTGGAG TGCCACCCCG AGAGAGGTTG 1500 
CGTCCCAGAG CCTGGAGCCG CCGTGGCCGC CAGCAAGGGG CTGCCGCAGC AGCTGCCAGC 1560 
ACCTCCGGAC GAGGATGACT CAGCTGCCCC ATCCACGTTG TCCCTGCTGG GCCCCACTTT 1620 

_ n CCCCGGCTTA AGCAGCTGCT CCGCTGACCT TAAAGACATC CTGAGCGAGG CCAGCACCAT 1680 

70 GCAACTCCTT CAGCAACAGC AGCAGGAAGC AGTATCCGAA GGCAGCAGCA GCGGGAGAGC 1 740 
GAGGGAGGCC TCGGGGGCTC CCACTTCCTC CAAGGACAAT TACTTAGGGG GCACTTCGAC 1800 
CATTTCTGAC AACGCCAAGG AGTTGTGTAA GGCAGTGTCG GTGTCCATGG GCCTGGGTGT 1860 
GGAGGCGTTG GAGCATCTGA GTCCAGGGGA ACAGCTTCGG GGGGATTGCA TGTACGCCCC 1920 
ACTTTTGGG A GTTCCACCCG CTGTGCGTCC CACTCCTTGT GCCCCATTGG CCGAATGCAA 1 980 

7 5 AGGTTCTCTG CTAGACGACA GCGCAGGCAA G AGCACTGAA GATACTGCTG AGTATTCCCC 2040 
TTTCAAGGGA GGTTACACCA AAGGGCTAGA AGGCGAGAGC CTAGGCTGCT CTGGCAGCGC 2100 
TGCAGCAGGG AGCTCCGGGA CACTTGAACT GCCGTCTACC CTGTCTCTCT ACAAGTCCGG 2160 
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agcactggac gaggcagctg cgtaccagag tcgcgactac tacaactttc cactggctct 2220 
ggccggaccg ccgccccctc cgccgcctcc ccatccccac gctcgcatca agctggagaa 2280 
cccgctggac tacggcagcg cctgggcggc tgcggcggcg cagtgccgct atggggacct 2340 
ggcgagcctg catggcgcgg gtgcagcggg acccggttct gggtcaccct cagccgccgc 2400 
5 ttcctcatcc tggcacactc tcttcacagc cgaagaaggc cagttgtatg gaccgtgtgg 2460 
tggtggtggg ggtggtggcg gcggcggcgg cggcggcggc ggcggcggcg gcggcggcgg 2520 
cggcggcggc gaggcgggag ctgtagcccc ctacggctac actcggcccc ctcaggggct 2580 
ggcgggccag gaaagcgact tcaccgcacc tgatgtgtgg taccctggcg gcatggtgag 2640 
cagagtgccc tatcccagtc ccacttgtgt caaaagcgaa atgggcccct ggatggatag 2700 
1 0 ctactccgga ccttacgggg acatgcgttt ggagactgcc agggaccatg ttttgcccat 2760 
tgactattac tttccacccc agaagacctg cctgatctgt ggagatgaag cttctgggtg 2820 
tcactatgga gctctcacat gtggaagctg caaggtcttc ttcaaaagag ccgctgaagg 2880 
gaaacagaag tacctgtgcg ccagcagaaa tgattgcact attgataaat tccgaaggaa 2940 
aaattgtcca tcttgtcgtc ttcggaaatg ttatgaagca gggatgactc tgggagcccg 3000 
1 5 gaagctgaag aaacttggta atctgaaact acaggaggaa ggagaggctt ccagcaccac 3060 
cagccccact gaggagacaa cccagaagct gacagtgtca cacattgaag gctatgaatg 3120 
tcagcccatc tttctgaatg tcctggaagc cattgagcca ggtgtagtgt gtgctggaca 3180 
cgacaacaac cagcccgact cctttgcagc cttgctctct agcctcaatg aactgggaga 3240 
li gagacagctt gtacacgtgg tcaagtgggc c aaggccttg cctggcttcc gcaacttaca 3300 
i20 cgtggacgac cagatggctg tcattcagta ctcctggatg gggctcatgg tgtttgccat 3360 
1=j gggctggcga tccttcacca atgtcaactc caggatgctc tacttcgccc ctgatctggt 3420 
pi tttcaatgag taccgcatgc acaagtcccg gatgtacagc cagtgtgtcc gaatgaggca 3480 
cctctctcaa gagtttggat ggctccaaat caccccccag gaattcctgt gcatgaaagc 3540 
111! actgctactc ttcagcatta ttccagtgga tgggctgaaa aatcaaaaat tctttgatga 3600 
f =25 acttcgaatg aactacatca aggaactcga tcgtatcatt gcatgcaaaa gaaaaaatcc 3660 
=* ! cacatcctgc tcaagacgct tctaccagct caccaagctc ctggactccg tgcagcctat 3720 
ft! tgcgagagag ctgcatcagt tcacttttga cctgctaatc aagtcacaca tggtgagcgt 3780 
t ; ggactttccg gaaatgatgg cagagatcat ctctgtgcaa gtgcccaaga tcctttctgg 3840 
gaaagtcaag cccatctatt tccacaccca g tga agcatt ggaaacccta tttccccacc 3900 
c3 0 ccagctcatg ccccctttca gatgtcttct gcctgttata actctgcact actcctctgc 3960 
agtgccttgg ggaatttcct ctattgatgt acagtctgtc atgaacatgt tcctgaattc 4020 

= TATTTGCTGG GCTTTTTTTT TCTCTTTCTC TCCTTTCTTT TTCTTCTTCC CTCCCTATCT 4080 

f""i AACCCTCCCA TGGCACCTTC AGACTTTGCT TCCCATTGTG GCTCCTATCT GTGTTTTGAA 4140 

ri _ TGGTGTTGTA TGCCTTTAAA TCTGTGATGA TCCTCATATG GCCCAGTGTC AAGTTGTGCT 4200 

r"3 5 TGTTTACAGC ACTACTCTGT GCCAGCCACA CAAACGTTTA CTTATCTTAT GCCACGGGA A 4260 

fp j GTTTAGAGAG CTAAGATTAT CTGGGGAAAT CAAAACAAAA AACAAGCAAA CAAAAAAAAA 4320 

f°*> Seq ID NO: 94 Protein sequence: 

540 Protein Accession #: NP_000035.1 

1 11 21 31 41 51 

I I I I I I 

MEVQLGLGRV YPRPPSKTYR GAFQNLFQSV REVIQNPGPR HPEAASAAPP GASLLLLQQQ 60 
45 QQQQQQQQQQ QQQQQQQQET SPRQQQQQQG EDGSPQAHRR GPTGYLVLDE EQQPSQPQSA 120 
LECHPERGCV PEPGAAVAAS KGLPQQLPAP PDEDDSAAPS TLSLLGPTFP GLSSCSADLK 180 
DILSEASTMQ LLQQQQQEA V SEGSSSGRAR EASGAPTSSK DNYLGGTSTI SDNAKELCKA 240 
VSVSMGLGVE ALEHLSPGEQ LRGDCMYAPL LGVPPAVRPT PCAPLAECKG SLLDDSAGKS 300 
TEDTAEYSPF KGGYTKGLEG ESLGCSGSAA AGSSGTLELP STLSLYKSGA LDEAAAYQSR 360 
5 0 DYYNFPLALA GPPPPPPPPH PHARIKLENP LDYGSAWAAA AAQCRYGDLA SLHGAGAAGP 420 

GSGSPSAAAS SSWHTLFTAE EGQLYGPCGG GGGGGGGGGG GGGGGGGGGG GGEAGAVAPY 480 
GYTRPPQGLA GQESDFTAPD VWYPGGMVSR VPYPSPTCVK SEMGPWMDSY SGPYGDMRLE 540 
TARDHVLPID YYFPPQKTCL ICGDEASGCH YGALTCG SCK VFFKRAAEGK QKYLCASRND 600 
CTIDKFRRKN CPSCRLRKCY EAGMTLGARK LKKLGNLKLQ EEGEASSTTS PTEETTQKLT 660 
5 5 VSHIEGYECQ PIFLNVLEAI EPGWCAGHD NNQPDSFAAL LSSLNELGER QLVHVVKWAK 720 

ALPGFRNLHV DDQMAVIQYS WMGLMVFAMG WRSFTNVNSR MLYFAPDLVF NEYRMHKSRM 780 
YSQCVRMRHL SQEFGWLQIT PQEFLCMKAL LLFSI1PVDG LKNQKFFDEL RMNYKELDR 840 
IIACKRKNPT SCSRRFYQLT KLLDSVQPIA RELHQFTFDL LIKSHMVSVD FPEMMAEIIS 900 
VQVPKILSGK VKPIYFHTQ 
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Seq ID NO: 95 DNA sequence 

Nucleic Acid Accession #: NM 002497 

Coding sequence: 135-1472 (underlined sequences correspond to start and stop codons) 



65 1 11 21 31 41 51 

I I I I I I 

GGCACGAGTA GGGGTGGCGG GTCAGTGCTG CTCGGGGGCT TCTCCATCCA GGTCCCTGGA 60 
GTTCCTGGTC CCTGGAGCTC CGCACTTGGC GCGCAACCTG CGTGAGGCAG CGCGACTCTG 120 
GCGACTGGCC GGCC ATGC CT TCCCGGGCTG AGGACTATGA AGTGTTGTAC ACCATTGGCA 180 

70 CAGGCTCCTA CGGCCGCTGC CAGAAGATCC GGAGGAAGAG TGATGGCAAG ATATTAGTTT 240 
GGAAAGAACT TGACTATGGC TCCATGACAG AAGCTGAGAA ACAGATGCTT GTTTCTGAAG 300 
TGAATTTGCT TCGTGAACTG AAACATCCAA ACATCGTTCG TTACTATGAT CGGATTATTG 360 
ACCGGACCAA TACAACACTG TACATTGTAA TGGAATATTG TGAAGGAGGG GATCTGGCTA 420 
GTGTAATTAC AAAGGGAACC AAGGAAAGGC AATACTTAGA TGAAGAGTTT GTTCTTCGAG 480 

75 TGATGACTCA GTTGACTCTG GCCCTGAAGG AATGCCACAG ACGAAGTGAT GGTGGTCATA 540 
CCGTATTGCA TCGGGATCTT AAACCAGCCA ATGTTTTCCT GGATGGCAAG CAAAACGTCA 600 
AGCTTGGAGA CTTTGGGCTA GCTAGAATAT TAAACCATGA CACGAGTTTT GCAAAAACAT 660 

388 



TTGTTGGCAC ACCTTATTAC ATGTCTCCTG AACAAATGAA TCGCATGTCC TACAATGAGA 720 
AATCAGATAT CTGGTCATTG GGCTGCTTGC TGTATGAGTT ATGTGCATTA ATGCCTCCAT 780 
TTACAGCTTT TAGCCAGAAA GAACTCGCTG GGAAAATCAG AGAAGGCAAA TTCAGGCGAA 840 
TTCCATACCG TTACTCTGAT GAATTGAATG AAATTATTAC GAGGATGTTA AACTTAAAGG 900 
ATTACCATCG ACCTTCTGTT GAAGAAATTC TTGAGAACCC TTTAATAGCA GATTTGGTTG 960 
CAGACGAGCA AAGAAGAAAT CTTGAGAGAA GAGGGCGACA ATTAGGAGAG CCAGAAAAAT 1020 
CGCAGGATTC CAGCCCTGTA TTGAGTGAGC TGAAACTGAA GGAAATTCAG TTACAGGAGC 1080 
GAGAGCGAGC TCTCAAAGCA AGAGAAGAAA GATTGGAGCA GAAAGAACAG GAGCTTTGTG 1 140 
TTCGTGAGAG ACTAGCAGAG GACAAACTGG CTAGAGCAGA AAATCTGTTG AAGAACTACA 1200 
GCTTGCTAAA GGAACGGAAG TTCCTGTCTC TGGCAAGTAA TCCAGAACTT CTTAATCTTC 1260 
CATCCTCAGT AATTAAGAAG AAAGTTCATT TCAGTGGGGA AAGTAAAGAG AACATCATGA 1320 
GGAGTGAGAA TTCTGAGAGT CAGCTCACAT CTAAGTCCAA GTGCAAGGAC CTGAAGAAAA 1380 
GGCTTCACGC TGCCCAGCTG CGGGCTCAAG CCCTGTCAGA TATTGAGAAA AATTACCAAC 1440 
TGAAAAGCAG ACAGATCCTG GGCATGCGC T AG CCAGGTAG AGAGACACAG AGCTGTGTAC 1500 
AGGATGTAAT ATTACCAACC TTTAAAGACT GATATTCAAA TGCTGTAGTG TTGAATACTT 1560 
GGCCCCATGA GCCATGCCTT TCTGTATAGT ACACATGATA TTTCGG AATT GGTTTTACTG 1 620 
TTCTTCAGCA ACTATTGTAC AAAATGTTCA CATTTAATTT TTCTTTCTTC TTTTAAGAAC 1680 
ATATTATAAA AAGAATACTT TCTTGGTTGG GCTTTTAATC CTGTGTGTGA TTACTAGTAG 1740 
GAACATGAGA TGTGACATTC TAAATCTTGG GAGAAAAAAT AATATTAGGA AAAAAATATT 1800 
TATGCAGGAA GAGTAGCACT CACTGAATAG TTTTAAATGA CTGAGTGGTA TGCTTACAAT 1860 
TGTCATGTCT AGATTTAAAT TTTAAGTCTG AGATTTTAAA TGTTTTTGAG CTTAGAAAAC 1920 
CCAGTTAGAT GCAATTTGGT CATTAATACC ATGACATCTT GCTTATAAAT ATTCCATTGC 1980 
TCTGTAGTTC AAATCTGTTA GCTTTGTGAA AATTCATCAC TGTGATGTTT GTATTCTTTT 2040 
TTTTTTTCTG TTTAACAGAA TATGAGCTGT CTGTCATTTA CCTACTTCTT TCCCACTAAA 2100 
TAAAAGAATT CTTCAGTTA 

Seq ID NO: 96 Protein sequence: 
Protein Accession #: NPJ302488 

1 11 21 31 41 51 

I I I I I I 

GIREFNPNIM ANEVERINMI TOSISGENEA RELATEDKIN ASEHOMOSAP IENSMPSRAE 60 
DYEVLYTIGT GSYGRCQKIR RKSDGKILVW KELDYGSMTE AEKQMLVSEV NLLRELKHPN 120 
IVRYYDRIID RTNTTLYIVM EYCEGGDLAS VITKGTKERQ YLDEEFVLRV MTQLTLALKE 180 
CHRRSDGGHT VLHRDLKPAN VFLDGKQNVK LGDFGLARIL NHDTSFAKTF VGTPYYMSPE 240 
QMNRMSYNEK SDIWSLGCLL YELCALMPPF TAFSQKELAG KIREGKFRRI PYRYSDELNE 300 
IITRMLNLKD YHRPSVEEIL ENPLIADLVA DEQRRNLERR GRQLGEPEKS QDSSPVLSEL 360 
KLKEIQLQER ERALKAREER LEQKEQELCV RERLAEDKLA RAENLLKNYS LLKERKFLSL 420 
ASNPELLNLP SSVIKKKVHF SGESKENIMR SENSESQLTS KSKCKDLKKR LHAAQLRAQA 480 
LSDIEKNYQL KSRQILGMR 

Seq ID NO: 97 DNA sequence 

Nucleic Acid Accession #: NM 007050.2 

Coding sequence: 1 85-4576 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

CCTCCCGCCT CAGTTCGCGC CGCGCCTCGG CTTGGAACGC AGGAGCGCCG GCTCCGGGAG 60 
CCCGAGCGGA GCCAGCCGCG CGCACAGCCA GCGGCCGCGC CGGCGATGCG GGGCCACCCC 120 
GCGCCCGCCC CAGTCCCGGC CCCGGCCCCC GCGGG AAGGG GCTGAGCTGC CCGCCGCCGC 1 80 
CCGG ATG GCG AGCCTCGCCG CGCTCGCCCT CAGCCTGCTC CTGAGGCTGC AGCTGCCGCC 240 
ACTGCCCGGC GCCCGGGCTC AGAGCGCCCC AGGTGGCTGT TCCTTTGATG AGCACTACAG 300 
CAACTGTGGT TATAGTGTGG CTCTAGGGAC CAATGGGTTC ACCTGGGAGC AGATTAACAC 360 
AACGGAGAAA CCAATGCTGG ACCAGGCAGT GCCCACAGGA TCTTTCATGA TGGTGAACAG 420 
CTCTGGGAGA GCCTCTGGCC AGAAGGCCCA CCTTCTCCTG CCAACCCTGA AGGAGAATGA 480 
CACCCACTGC ATCGACTTCC ATTACTACTT CTCCAGCCGT GACAGGTCCA GCCCAGGGGC 540 
CTTGAACGTC TACGTGAAGG TGAATGGTGG CCCCCAAGGG AACCCTGTGT GGAATGTGTC 600 
CGGGGTCGTC ACTGAGGGCT GGGTGAAGGC AGAGCTCGCC ATCAGCACTT TCTGGCCACA 660 
TTTCTATCAG GTGATATTTG AATCCGTCTC ATTGAAGGGT CATCCTGGCT ACATCGCCGT 720 
GGACGAGGTC CGGGTCCTTG CTCATCCATG CAGAAAAGCA CCTCATTTTC TGCGACTCCA 780 
AAACGTGGAG GTGAATGTGG GGCAGAATGC CACATTTCAG TGCATTGCTG GTGGGAAGTG 840 
GTCTCAGCAT GACAAGCTTT GGCTCCAGCA ATGGAATGGC AGGGACACGG CCCTGATGGT 900 
CACCCGTGTG GTCAACCACA GGCGCTTCTC AGCCACAGTC AGTGTGGCAG ACACTGCCCA 960 
GCGGAGCGTC AGCAAGTACC GCTGTGTGAT CCGCTCTGAT GGTGGGTCTG GTGTGTCCAA 1020 
CTACGCGGAG CTGATCGTGA AAGAGCCTCC CACGCCCATT GCTCCCCCAG AGCTGCTGGC 1080 
TGTGGGGGCC ACATACCTGT GGATCAAGCC AAATGCCAAC TCCATCATCG GGGATGGCCC 1 140 
CATCATCCTG AAGGAAGTGG AATATCGCAC CACCACAGGC ACGTGGGCAG AGACCCACAT 1200 
AGTCGACTCT CCCAACTATA AGCTGTGGCA TCTGGACCCC GATGTTGAGT ATGAGATCCG 1260 
AGTGCTCCTC ACACGACCAG GTGAGGGGGG TACGGGACCG CCAGGGGCTC CCCTCACCAC 1320 
CAGGACCAAG TGTGCAGATC CGGTACATGG CCCACAGAAC GTGGAAATCG TAGACATCAG 1380 
AGCCCGGCAG CTGACCCTGC AGTGGGAGCC CTTCGGCTAC GCGGTGACCC GCTGCCATAG 1440 
CTACAACCTC ACCGTGCAGT ACCAGTATGT GTTCAACCAG CAGCAGTACG AGGCCGAGGA 1500 
GGTCATCCAG ACCTCCTCCC ACTACACCCT GCGAGGCCTG CGCCCCTTCA TGACCATCCG 1560 
GCTGCGACTC TTGCTGTCTA ACCCCGAGGG CCGAATGGAG AGCGAGGAGC TGGTGGTGCA 1620 
GACTGAGGAA GACGTTCCAG GAGCTGTTCC TCTAGAATCC ATCCAAGGGG GGCCCTTTGA 1680 
GGAGAAGATC TACATCCAGT GGAAACCTCC CAATGAGACC AATGGGGTCA TCACGCTCTA 1740 
CGAGATCAAC TACAAGGCTG TCGGCTCGCT GGACCCAAGT GCTGACCTCT CGAGCCAGAG 1800 
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GGGGAAAGTG TTCAAGCTCC GGAATGAAAC CCACCACCTC TTTGTGGGTC TGTACCCAGG 1860 
GACCACCTAT TCCTTCACCA TCAAGGCCAG CACAGCAAAG GGCTTTGGGC CCCCTGTCAC 1920 
CACTCGGATT GCCACCAAAA TTTCAGCTCC ATCCATGCCT GAGTACGACA CAGACACCCC 1980 
ATTGAATGAG ACAGACACGA CCATCACAGT GATGCTGAAA CCCGCTCAGT CCCGGGGAGC 2040 
TCCTGTCAGT GTTTATCAGC TGGTTGTCAA GGAGGAGCGA CTTCAGAAGT CACGGAGGGC 2100 
AGCTGACATT ATTGAGTGCT TTTCGGTGCC CGTGAGCTAT CGGAATGCCT CCAGCCTCGA 2160 
TTCTCTACAC TACTTTGCTG CTGAGTTGAA GCCTGCCAAC CTGCCTGTCA CCCAGCCATT 2220 
TACAGTGGGT GACAATAAGA CATACAATGG CTACTGGAAC CCTCCTCTCT CTCCCCTGAA 2280 
AAGCTACAGC ATCTACTTCC AGGCACTCAG CAAAGCCAAT GGAGAGACCA AAATCAACTG 2340 
TGTTCGTCTG GC TACAAAAG CACCAATGGG CAGCGCCCAG GTGACCCCGG GGACTCCACT 2400 
CTGCCTCCTC ACCACAGGTG CCTCCACCCA GAATTCTAAC ACTGTGGAGC CAGAGAAGCA 2460 
GGTGGACAAC ACCGTGAAGA TGGCTGGCGT GATCGCTGGC CTCCTCATGT TCATCATCAT 2520 
TCTCCTGGGC GTGATGCTCA CCATCAAAAG GAGAAGAAAT GCTTATTCCT ACTCCTATTA 2580 
CTTGTCCCAA AGGAAGCTGG CCAAGAAGCA GAAGGAGACC CAGAGTGGAG CCCAGAGGGA 2640 
GATGGGGCCT GTGGCCTCTG CCGACAAACC CACCACCAAG CTCAGCGCCA GCCGCAATGA 2700 
TGAAGGCTTC TCTTCTAGTT CTCAGGACGT CAACGGATTC ACAGATGGCA GCCGCGGGGA 2760 
GCTTTCCCAG CCCACCCTCA CGATCCAGAC TCATCCCTAC CGCACCTGTG ACCCTGTGGA 2820 
GATGAGCTAC CCCCGGGACC AGTTCCAACT CGCCATCCGG GTGGCTGACT TGCTGCAGCA 2880 
CATCACGCAG ATGAAGAGAG GCCAGGGCTA CGGGTTCAAG GAGGAATACG AGGCCTTACC 2940 
AGAGGGGCAG ACAGCTTCGT GGGACACAGC CAAGGAGGAT GAAAACCGCA ATAAGAATCG 3000 
ATATGGGAAC ATCATATCCT ACGACCATTC CCGGGTGAGG CTGCTGGTGC TGGATGGAGA 3060 
CCCGCACTCT GACTACATCA ATGCCAACTA CATTGACGGA TACCATCGAC CTCGGCACTA 3120 
CATTGCGACT CAAGGTCCGA TGCAGGAGAC TGTAAAGGAC TTTTGGAGAA TGATCTGGCA 3180 
GGAGAACTCC GCCAGCATCG TCATGGTCAC AAACCTGGTG GAAGTGGGCA GGGTGAAATG 3240 
TGTGCGATAC TGGCCAGATG ACACGGAGGT CTACGGAGAC ATTAAAGTCA CCCTGATTGA 3300 
AACAGAGCCC CTGGCAGAAT ACGTCATACG CACCTTCACA GTCCAGAAGA AAGGCTACCA 3360 
TGAGATCCGG GAGCTCCGCC TCTTCCACTT CACCAGCTGG CCTGACCACG GCGTTCCCTG 3420 
CTATGCCACT GGCCTTCTGG GCTTCGTCCG CCAGGTCAAG TTCCTCAACC CCCCGGAAGC 3480 
TGGGCCCATA GTGGTCCACT GCAGTGCTGG GGCTGGGCGG ACTGGCTGCT TCATTGCCAT 3540 
TGACACCATG CTTGACATGG CCGAGAATGA AGGGGTGGTG GACATCTTCA ACTGCGTGCG 3600 
TGAGCTCCGG GCCCAAAGGG TCAACCTGGT ACAGACAGAG GAGCAATATG TGTTTGTGCA 3660 
CGATGCCATC CTGGAAGCGT GCCTCTGTGG CAACACTGCC ATCCCTGTGT GTGAGTTCCG 3720 
TTCTCTCTAC TACAATATCA GCAGGCTGGA CCCCCAGACA AACTCCAGCC AAATCAAAGA 3780 
TGAATTTCAG ACCCTCAACA TTGTGACACC CCGTGTGCGG CCCGAGGACT GCAGCATTGG 3840 
GCTCCTGCCC CGGAACCATG ATAAGAATCG AAGTATGGAC GTGCTGCCTC TGGACCGCTG 3900 
CCTGCCCTTC CTTATCTCAG TGGACGGAGA ATCCAGCAAT TACATCAACG CAGCACTGAT 3960 
GGATAGCCAC AAGCAGCCTG CCGCCTTCGT GGTCACCCAG CACCCTCTAC CCAACACCGT 4020 
GGCAGACTTC TGGAGGCTGG TGTTCGATTA CAACTGCTCC TCTGTGGTGA TGCTGAATGA 4080 
GATGGACACT GCCCAGTTCT GTATGCAGTA CTGGCCTGAG AAGACCTCCG GGTGCTATGG 4140 
GCCCATCCAG GTGGAGTTCG TCTCCG CAG A CATCGACGAG GACATCATCC ACAGAATATT 4200 
CCGCATCTGT AACATGGCCC GGCCAC AG G A TGGTTATCGT ATAGTCCAGC ACCTCCAGTA 4260 
CATTGGCTGG CCTGCCTACC GGGACACGCC CCCCTCCAAG CGCTCTCTGC TCAAAGTGGT 4320 
CCGACGACTG GAGAAGTGGC AGGAGCAGTA TGACGGGAGG GAGGGACGTA CTGTGGTCCA 4380 
CTGCCTAAAT GGGGGAGGCC GTAGTGGAAC CTTCTGTGCC ATCTGCAGTG TGTGTGAGAT 4440 
GATCCAGCAG CAAAACATCA TTGACGTGTT CCACATCGTG AAAACACTGC GTAACAACAA 4500 
ATCCAACATG GTGGAGACCC TGGAACAGTA TAAATTTGTA TACGAGGTGG CACTGGAATA 4560 
TTTAAGCTCC TTT TAG CTCA ATGGGATGGG GAACTGCCGG AGTCCAGAGG CTGCTGTGAC 4620 
CAAGCCCCCT TTTGTGTGAA TGGCAGTAAC TGGGCTCAGG AGCTCTGAGG TGGCACCCTG 4680 
CCTGACTCCA AGGAGAAGAC TGGTGGCCCT GTGTTCCACG GGGGGCTCTG CACCTTCTGA 4740 
GGGGTCTCCT GTTGCCGTGG GAGATGCTGC TCCAAAAGGC CCAGGCTTCC TTTTCAACCT 4800 
AACCAGCCAC AGCCAAGGGC CCAAGCAGAA GTACACCCAC AAGCAAGGCC TTGGATTTCT 4860 
GGCTCCCAGA CCACCTGCTT TTGTTCTGAG TTTGTGGATC TCTTGG CAAG CCAACTGTGC 4920 
AGGTGCTGGG GAGTGGGAGG CTCCCCTGCC CTCCTTCTCC TTAGGAGTGG AGGAGATGTG 4980 
TGTTCTGCTC CTCTACGTCA TGGAAAAGAT TGAGGCTCTT GGGGGTCACT GCTCTGCTGC 5040 
CCCCTGCAAC CTCCTTCAGG GGCCTCTGGC ACCAGACATT TGCAGTCTGG ACCAGTGTGA 5100 
CCTTACGATG TTCCCTAGGC CACAAGAGAG GCCCCCCATC CTCACACCTA ACCTGCATGG 5160 
GGCTTCGCCC ACAACCATTC TGTACCCCTT CCCCAGCCTG GGCCTTGACC GTCCAGCATT 5220 
CACTGGCCGG CCAGCTGTGT CCACAGCAGT TTTTGATAAA GGTGTTCTTT GCTTTTTTGT 5280 
GTGGTCAGTG GGAGGGGGTG GAACTGCAGG GAACTTCTCT GCTCCTCCTT GTCTTTGTAA 5340 
AAAGGGACCA CCTCCCTGGG GCAGGGCTTG GGCTGACCTG TAGGATGTAA CCCCTGTGTT 5400 
TCTTTGGTGG TAGCTTTCTT TGGAAGAGAC AAACAAGATA AGATTTGATT ATTTTCCAAA 5460 
GTGTATGTGA AAAGAAACTT TCTTTTGGAG GGTGTAAAAT CTTAGTCTCT TATGTCAAAA 5520 
AGAAGGGGGC GGGGGAGTTT GAGTATGTAC CTCTAAGACA AATCTCTCGG GCCTTTTATT 5580 
TTTTCCTGGC AATGTCCTTA AAAGCTCCCA CCCTGGGACA GCATGCCACT GAGCAAGGAG 5640 
AGATGGGTGA GCCTGAAGAT GGTCCCTTTG GTTTCTGGGG CAAATAGAGC ACCAGCTTTG 5700 
TGCATAATTT GGATGTCCAA ATTTGAACTC CTTCCTAAAG AAACCCAGCA GCCACCTTGA 5760 
AAAAGGCCAT TGTGGAGCCC ATTATACTTT GATTTAAAAT AGGCCAAGAG AATCAGGCCT 5820 
GGAGATCTAG GGTCTTGTCC AAAGTGTGAG TGAGTCAATG AGAGGGAACC AACATTTGCT 5880 
AAGTCTCTAC TGTATGCCAG GGATCATGCT TGGCACTTTC CATAGGACAT TTCACACAGT 5940 
CCTTAGAACC CCCAGGAGAG AGCTACTGAC TTGTTATCAT CTCCATTTGA TCATCTCCTC 6000 
CAATGAGGAA ACCCACGCAC CTTCCTTAGT AATGAAATCC TGGGTTCCAA AGGGGCAGGT 6060 
AATGGCAATG AGACTTCTCC GTGCTGTTTT CTTCATCTTC TCTAAGCCAA GCAATTATTT 6120 
TATGGAGGGA AAATAAGGCC AGAAACTTCT GAGCAGATAA CTCCACAAAT GGAAATTTAG 6180 
TACTTTCTTC CTGATGCCAG TTCTTCTGGG AAGCGCAGAA TTTCAGATAT ATTTTAGTAA 6240 
CACATTCCCA GCTCCCCAGG AAAGCCAGTC TCATCTAATT TCTTAGTCAG TAAAAACAAT 6300 
TCCCTGTTCC TTCAGGCTAT GAATGGACCA GCCAGGGAAA CTCTCGACCT TGATCTCTAG 6360 
CCAGTGCTTA GGCCCAATAT CTGACAGCCT CAGGTGGGCT GGGACCTAGG AAGCTCCATC 6420 
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TTGAAGGCTG GTCTAGCCCC AGACAGGGCA TGAGGGGCAG AGAATTCAAG AAGGTACAGC 6480 
TTTGGCCCTC AAGAGCCCAC TGTATGCTGG GGAAATGGAA CCATGGTGCA GTAGTGTGGA 6540 
GTGGATGAGT GTTCCATGAG CCTAGGAGCA AGAAAGTCTC TTCGGCCTCG GGCTTCCTGG 6600 
AGAAGGGGAC GTCCATTCCT GCTGGGTCTT AACAAGCATA AAAAGGAAAA AAAGGAAACT 6660 
5 CAGGCAAAGG GATCCATATG TGCAATGGCA AAGAAATGTG AAAAGGCATT GGGAGAAGCA 6720 
GTCTGGGGGA GGCCAGCCCA GTGCGGGCAC AGCACAACAC GGGGAGCAGC AAGAGATGAG 6780 
CCAGGGTCCA GGAGACAGAT GCCCATCGCG AGTACAGACT TTGTCCTATT GGCAACAAGG 6840 
AGTCCATGGA GCTTTAGAGA GATGCACTCA GCTTCGTGTT GGCCAAGACT CCTTCTGGGC 6900 
CAATGGGGCT GCCTCTTTTC CTTTCATCAG ACACTGTGAA AACATTCCCT TAAGCGTGCA 6960 
1 0 CTTTTTAATA TCACATCTAT TTGTCTGTCT GCTCATTGTT TTGTTGCTGG AACTAAATAT 7020 

GCAATGGATC ATG AG ACTC A GATTCTATGA GAAACCCAGG GTCTCTGCTT TACCACGGAG 7080 
CAGGGTCACC AACCCAGATC TCCAGGCCCA TGAGGATGGA ACATGAAAGG AGCCGACAAA 7140 
AGTTGCTTCC ATTGGCATGG GCTCTGGAGC TGTCCAGAAG TCCAGGGACA CCAGACTTGA 7200 
TCAAGGAAGG GCTGTCACTT TAGAGGTTCA AAAGGAAGTG CCTCAAAGCA AAGGCAAGCA 7260 
1 5 AAGGAACCCC ACGATGAACT TGCTCTTTTC CTTTGATGAG CCTCTCCCCA GGTGTATTTC 7320 
AGCAGACCCC GGGGACCCAC CCCCACTGGG CCTGCTGGCC TCCCTCGGCT CCAGCCCAAT 7380 
GCCCCAGCTG GCCTTCCCCA GCCTGCAAGG AGCCTGTAGC ATGGCAAATC TGCCTGCTGT 7440 
ATGCTATTTT CTTAG ATCTT GGTACATCCA GACAGGATGA GGGTGGAGGG AGAGCTATTT 7500 
M- AACACAAATC CTAAGATTTT TTTCTGCTCA GGAAGGGGTG AAATAGCTGG CAGATACAAA 7560 

1*20 AGACAGTGGC TTTTATCATT TTAAATGGTA GGAATTTAAG GTGTGACTTC AGGGAGAAAC 7620 
M AAACTTGCAA AAAAAAAAAA TCTCAGGCCA TGTTGGGGTA ACCCAGCAAG GGCCAGTGAT 7680 

f** GATTTCCCCC AGCTCATCCC CTTATTTTCC CACAACCCAA CCATTCTCTA AAGCAGGACA 7740 

S: GTGAATAGGT CTTAGGCCAG TGCACACAGG AAGAAATTGA GGCTTATGGA TGGGGATGAC 7800 

yD TTCCCTAAGA TCCCATGGGA CAAGGATGTG GCAAGGCTTG GATGAGATGG GGCACCAGTG 7860 

r==25 CCCAGGAATT TGAACATTTT CCTTTACCCA GGAAATCTCC GGAGCCAACA CCACCACCCC 7920 
**! CAGGGGGTCT CCCCACCCCA CCCCATTTAC AGGGTGAGCT CAGCCTG TCA TGAGCAGAGG 7980 

Fy AAAATATTAT TAATGCTCTC TGAGTCTTTA CAACAGGAGC TCTTACCTCA TAGATGTGGG 8040 

-„ ' i CTCTGTTTGG GGAAGATGCA AGGAAGTAAT GAGAAGCCCA GGAAATTTCT CCACCTGTGT 8100 

"y TTATGGCCTA AATAGCTTCA GGATGTATCT TAGCTGCACT CCAACATTGC ATCCTTTCTG 8160 

O 0 GGGTGAAGAA TCTGGGCCAA CCAGGGGTCC TTGGGCCTCT AGAAGGCCAC AGTAGGCCTC 8220 

TCTTTGTGGG AATGGAAGGG GACAGTTTGC TTTTAGTGCT GGCCCTCTCT GTGGGTGTGG 8280 
= CCTGCAAAGG AACCAACAGA CCCTATGCTG GGGACTCTAA CATGTGAGCT CATTAAATTC 8340 

r"! TTCCAGCATT CTAAAGGAGG GTTTGTGATT GTCACCATTT ACTGATGAGG AAACTAAGGC 8400 

TCCTAGGGGA GAAATCACTT GCCCACAGTT CCACAGCTAG TGAGTGAATG AACCAGGATT 8460 
H3 5 TAAACCGGTT TTTTCTCACT ACAGAGACAA TATTTTTCCA CCATTGTATC TCACATTTTT 8520 
F! I CCCAGGAGGT TACCCATAAC AGAAGAGACT AGAGTGGAAC AGATACGTCA GTGGATAAAG 8580 

~ CTCAAAGCAA ACAACAGTAA GCTTAAAATT CCTTCATAGT CTCATGTTTT ACGTTCACAA 8640 

=#=■ TTCATGCAAA ATTTGCATTC CACTTTCTGA TTTAGCCTTG TTGGTTTTAA TATGACTCTA 8700 

f =■ TGAATATTTC AAAAAAAAAT GTGCTCTGTT CCTCATGTTG TTCTGTTCTG TTCACCCCGC 8760 

'"40 TATGACGGAC CCTAGGTCAG CTGGTCTTCA GCTTGACCCT AGAATTGACT CTAGGAGCAG 8820 
TU TGACCCTGCT GCCTCCCAGA GCCAGTTATA GGCTCAAGAT CAAGACCAAC TGACCTTCTC 8880 

CTAGGCAGCT CCTTTGGTGT GTGGGTGCTC TGACCTCACT GTTCATGAGG GGACCTCAAC 8940 
TAAGGCATCT TCCAGTTGGG TGCTGGAAGG AACCCATTAA CTCACACTAG AATGATGAGG 9000 
ATTTGCTCAT CTGGCGTGGA GAAGGATGAG CCCACAAAAC CCTAAAGGGA AAAGAGAAGC 9060 
45 TGGACACAGC TGTACTCAGC AGATTCCTGA ATGCTAGGCT GGAAAGTGGT GCCTGTTGTC 9120 
CAAGTGGAGT CACATGGTTG CTAATGTGGG CAAGTCTGAG GACACACTTC ATGAGCAGCT 9180 
GGGGTCTGGA AGGCTCCTCA CTTTACCCTA GCCACACATA ATTACTGGGT GCCTACAGCA 9240 
CCTAGCACCT TGGAGGGGGC ACTATTAGGA AATCGAGATT ACTATGGCAC AATTAATTCC 9300 
TGGGTAAGGC ATGGGGTTGT GGTGGACAGA GCTCAGTCTT TAGTTTGAAC GAAAACATAC 9360 
50 ATACATGAAA AACATACATG AAAAAAGGAC CCTCATCAAC ATTAGAAGGG GTAGATTTGG 9420 
AGCACTTTAG GCAGGAAAAC AGGAACGCAA GGCCAGGAAA CTGGAACCCA GTGAATACTC 9480 
AGAACCGAGG ATGCAGATGA CTTATTTAGC AAAATGGTCA CTTCTGTGAC ATAGCTGGAG 9540 
AAAGGATGGG TAACAGCTTG CCAGAGCCAC TTGGAACAAG GGCAAATCTC AGTGTCTGGG 9600 
GCAAAAGATG ATGCATTTCC CTCTGACCCA TCATGTTTAT TCATCCTCCA CTCCCCATTG 9660 
5 5 CCACACTAGC TCTTGCTGTA AGTCCTCACC AGGATCTACA TTTCCTCGTC GCTGGTGGGA 9720 
ACCCCTTAGA GTACATAGAG GTATCAGTCC AGTAAGACTG CTCTACACAA CAGAAGTGAG 9780 
GCCCAGGGAG TAGCAGCCAG GCCCTTATCC TGTTACCTCT GCAGGAGTGA CTGCCCAACC 9840 
CAGATCCAGA GACATTGAAG GAAATGATAA TTCCTTGGTA CCTCACTGCC TTGGGACAAA 9900 
ATGAAGAAAG CCACCCTTCC TTAGGCTGCA GCTTGCCACT CCTGGGCTGG GTAAACAGGT 9960 
60 CATCAGCACC AGGCTCAACC AGGAGTAACA TTCTGGAAGA CATGGGTGAG CCCAAGAGGA 10020 
AGCATGAACA GGACGCTGTT CCTAAGTCAT GTCAACAGGT TGTGCTGGGC CAGGATCCCC 10080 
AGGGAAAAAA ATGGTCAACC CAACTGGAGG GTAGGTTAGA AGAAAAAAAA CATAAACGTG 10140 
GATAGTCATG TCATCTCAAA TCCCTGACTT GGCTTCCCCA TTACTTGACA GTCTGAGCTC 10200 
CTTCTTAGCC TGTGACCAGC TTCAAATCAC AGCCAAGTAA AACAAGGAAA TAGGAAAAGT 10260 
65 AAATCCAACT AGAAGAGACA AGCTGAGATT CAGATTTGTT TACTCCTCCC ATGCAAAGTT 10320 
TCCCTGTTGG AGGTTTTCCA TGTATACATG TCTAGAAGTG ATAGAATGCA AGGCCTTGGC 10380 
TTTGTCTTGC AGGGATCTGC CTTTGAGGTC ATAGACTGAA CAGCAGGGAG AGAGGTTAGT 10440 
GGTGG AGTGT GGGGGGAGCT GTTCTAGCTC CAGTTTCTTC TGACACATTT TTCAGGATCA 10500 
TGGATCTGAT CCTCCGAAGC ACAGCAGAGA TATCTAAGCC ATATTTGTGC ACATGAGCAG 10560 
70 ACTCTTCTAG TTTTTTAGTA ACCAGGGATG GGCTTTTGCA TGGCACTGAC TATAGAGATG 1 0620 
TCTTGTAGAG ATCAAGCCAG TCTTTTGCAT CCCACCTGCC CACCTCCAGA AGAGATGGGA 10680 
AAAGGTCATC AAAGGGCATT CACCAACTGA AATCCACTCA TGAATGTTAG GTCTCTAAAA 10740 
GGAGGCATCA ACACTCACAA TGGTAGCCTC CAAACCTAGC ATCCCACCTA TCTAAGAGCT 10800 
CAGGGGTGGT CCACTGGGGC AGATACAAGG GAAGTGCAAG GGCTCAGGAT GAAAGAAAAT 10860 
75 CTATTGGGAA GAGTTTTAGG GGCTTGATCA TTATGGGGCT TCCTTCTATA TCTGAGAACT 10920 
GCTCTGGGTG GTGAGATGTG GACTCTGATC CTTAATTGGA ATGTTCGGAG AATGAGTGTC 10980 
TGGTGGCCTT GAAGTGTTGG ACAGAAAAGT ATCAGTATAA AAGCCTGGAG CTCAGGGTAA 1 1040 



TTAATGTAGT TCATGGTTCC TTAGTGAGCA GGACTCTTGG ATGTGGAGGA GAAAGGGTCA 1 1 1 00 
TAGGAAGTAA ACCACCAAAA TTACAAAATT GAGTCTCTGT ACAATTACTT CAGTGCCTTT 11160 
GGGCTTATGA ATACAAATCA GTGGGCCTTC TCTATG ATGG TCCAACAAAC TCTCAGTGTC 11220 
CACCCTGTCC CTGTATCTCC CATGGAAGAT GAATAATGTC AGGTGTTCTT TGGGTCAAAG 1 1280 
GCCCCAGGGC AGTCTGGAGG CTTAGAGGGC AGAGTGGTGT CATTCCATGT AAAGTTAGGC 1 1340 
TTCTGAGGGG TCAGGCAGAA TATGGTGTCC ATATCTTCCA TAGCTCTGCA GATTCTTGGA 1 1400 
TGAAGTCAAG CACAGTTTGC TAGACCCAGG TCACTCCTCT GAGTATAACT AGGACCCATG 1 1460 
AGTGAAACTT AATAGCTGTA AGGAAGAACC TGCTGTCTGC CAGAGAGGAT AAGCTGCCCA 11520 
TCTCAGCAGC TGTCTAAAAG AAGGCAGGTG TCTCTTTAAA GGGAAGAGAA GCATTGGTGA 11580 
AATGGATTTC AGGTCACTTC CATTCCAGAT GGGTGAGATC TTGTGGAGCT GGGATCATGT 11640 
TTGAACTCAT TCATACCTGT AGAGCACGAA TCCAAGTAG A TTGTGTTTGG TCTGTACAGG 1 1 700 
CTGAAGCCCC CTGCTCTCCC ACCCAAGTGC CCCCACTGAG CAGGCCAACA TGCTGTTGTG 11760 
GCCACATATA CTGGGCTGAT CCAGGCTGGT TATCACCAAA CAGCAAACCA TAGGGAACAG 1 1820 
CTGCTTTGCC ATAGACCCAA TACCCATGTA GATCTCTCAT GAGAGCAGCC ATAACTCAGA 11880 
CCCACTGACC AACAGGGCCA TGAGTGACAG CCAGAACCAG TGAAGGTCCA AGTAGGACAC 11940 
AGAGCAGGGC TTTTCTTACC ATACACATTA TCTCCAGAGG TTATTTCTAC CCCACTCCCT 12000 
ATTCAAGGCC TGTTGGAGCA CACTGCAAAA GCAAAAGCAC AGTAACTCAA TTTACACATG 12060 
ATTATAATCA TTTCCAGTGC ACACATTTCA TCACCAGGTG GATCCTGAGC TAGCCCATGT 12120 
AAATCCGGGT TAACCCATAT TGGTAATCAT ACTCAAAAGC ACTTTTCACC CTACATTCTA 12180 
CTAGCCAATC AAAGACAAAG AGTTGTGGCC TCTACCATTG CCTTGGCTTC TGGACACCCT 12240 
CACAAGCTAT CCCAAGGTTC CCGCTCAACT CCAGGGAGGC TGACATCTTC ACATCCACTG 12300 
GGCATATAAT ATTGCATGAG ACCAAAGTCT CCACACTCTT TGCAGCCTCC TCCATGAATC 12360 
CCAATGGCCT GCACTTGTAC AGTTTGGGTG TTTGATAGAT AAAGCACGTA TGAGAAGAGA 12420 
AAACAAAATA AATCAACTTT TTAAAAAAGC CAGCACTGTG CTGTCAATGT TTTTT TTTTC 12480 
TTTTCAATTC TAGCTCAGAA AAGCAGAAGG TAAATAATGT CAGGTCAATG AATATCAGAT 12540 
ATATTTTTTG ACTGTACATT ACAGTGAAGT GTAATCTTTT TACACCTGCA AGTCCATCTT 12600 
ATTTATTCTT GTAAATGTTC CCTGACAATG TTTGTAATAT GGCTGTGTTA AAAAATCTAT 12660 
ACAATAAAGC TGTGACCCTG 

Seq ID NO: 98 Protein sequence: 
Protein Accession #: NPJ50898 1 . 1 

1 11 21 31 41 51 

I I I I I I 

GIREFNPPRO TEINTYROSI NEPHOSPHAT ASERECEPTO RTYPETHOMO SAPIENSMAS 60 
LAALALSLLL RLQLPPLPGA RAQSAPGGCS FDEHYSNCGY SVALGTNGFT WEQINTTEKP 120 
MLDQAVPTGS FMMVNSSGRA SGQKAHLLLP TLKENDTHCI DFHYYFSSRD RSSPGALNVY 1 80 
VKVNGGPQGN PVWNVSGVVT EGWVKAELAI STFWPHFYQV IFESVSLKGH PGYIAVDEVR 240 
VLAHPCRKAP HFLRLQNVEV NVGQNATFQC IAGGKWSQHD KLWLQQWNGR DTALMVTRW 300 
NHRRFSATVS VADTAQRSVS KYRCVIRSDG GSGVSNYAEL IVKEPPTPIA PPELLAVGAT 360 
YLWIKPNANS IIGDGPIILK EVEYRTTTGT WAETHIVDSP NYKLWHLDPD VEYEERVLLT 420 
RPGEGGTGPP GAPLTTRTKC ADPVHGPQNV EIVDIRARQL TLQWEPFGYA VTRCHSYNLT 480 
VQYQYVFNQQ QYEAEEVIQT SSHYTLRGLR PFMTIRLRLL LSNPEGRMES EELVVQTEED 540 
VPGAVPLESI QGGPFEEKIY IQWKPPNETN GVITLYEINY KAVGSLDPSA DLSSQRGKVF 600 
KXRNETHHLF VGLYPGTTYS FTKASTAK.G FGPPVTTRIA TKISAPSMPE YDTDTPLNET 660 
DTTITVMLKP AQSRGAPVSV YQLVVKEERL QKSRRAADII ECFSVPVSYR NASSLDSLHY 720 
FAAELKPANL PVTQPFTVGD NKTYNGYWNP PLSPLKSYSI YFQALSKANG ETKINCVRLA 780 
TKAPMGSAQV TPGTPLCLLT TGASTQNSNT VEPEKQVDNT VKMAGVIAGL LMFIIILLGV 840 
MLTKRRRNA YSYSYYLSQR KLAKKQKETQ SGAQREMGPV ASADKPTTKL SASRNDEGFS 900 
SSSQDVNGFT DGSRGELSQP TLTIQTHPYR TCDPVEMSYP RDQFQLAIRV ADLLQHTTQM 960 
KRGQGYGFKE EYEALPEGQT ASWDTAKEDE NRNKNRYGNI ISYDHSRVRL LVLDGDPHSD 1020 
YINANYIDGY HRPRHYIATQ GPMQETVKDF WRMIWQENSA SIVMVTNLVE VGRVKCVRYW 1080 
PDDTEVYGDI KVTLIETEPL AEYVIRTFTV QKKGYHEIRE LRLFHFTSWP DHGVPCYATG 1 140 
LLGFVRQVKF LNPPEAGPIV VHCSAGAGRT GCFIAIDTML DMAENEGVVD IFNCVRELRA 1200 
QRVNLVQTEE QYVFVHDAIL EACLCGNTAI PVCEFRSLYY NISRLDPQTN SSQIKDEFQT 1260 
LNIVTPRVRP EDCSIGLLPR NHDKNRSMDV LPLDRCLPFL ISVDGESSNY INAALMDSHK 1320 
QPAAFVVTQH PLPNTVADFW RLVFDYNCSS VVMLNEMDTA QFCMQYWPEK TSGCYGPIQV 1380 
EFVSADIDED IIHRIFRICN MARPQDGYRI VQHLQYIGWP AYRDTPPSKR SLLKWRRLE 1440 
KWQEQYDGRE GRTVVHCLNG GGRSGTFCAI CSVCEMIQQQ NIIDVFfflVK TLRNNKSNMV 1500 
ETLEQYKFVY EVALEYLSSF 

Seq ID NO: 99 DNA sequence . 

Nucleic Acid Accession #; NM_002988. 1 

Coding sequence: 71-340 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

CCGGCACGAG AGGAGTTGTG AGTTTCCAAG CCCCAGCTCA CTCTGACCAC TTCTCTGCCT 60 
GCCCAGCATC ATGAAGGGCC TTGCAGCTGC CCTCCTTGTC CTCGTCTGCA CCATGGCCCT 120 
CTGCTCCTGT GCACAAGTTG GTACCAAC AA AGAGCTCTGC TGCCTCGTCT ATACCTCCTG 1 80 
GCAGATTCCA CAAAAGTTCA TAGTTGACTA TTCTGAAACC AGCCCCCAGT GCCCCAAGCC 240 
AGGTGTCATC CTCCTAACCA AGAGAGGCCG GCAGATCTGT GCTGACCCCA ATAAGAAGTG 300 
GGTCCAGAAA TACATCAGCG ACCTGAAGCT GAATGCC TGA GGGGCCTGGA AGCTGCGAGG 360 
GCCCAGTGAA CTTGGTGGGC CCAGGAGGGA ACAGGAGCCT GAGCCAGGGC AATGGCCCTG 420 
CCACCCTGGA GGCCACCTCT TCTAAGAGTC CCATCTGCTA TGCCCAGCCA CATTAACTAA 480 
CTTTAATCTT AGTTTATGCA TCATATTTCA TTTTGAAATT GATTTCTATT GTTGAGCTGC 540 
ATTATGAAAT TAGTATTTTC TCTGACATCT CATGACATTG TCTTTATCAT CCTTTCCCCT 600 
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TTCCCTTCAA CTCTTCGTAC ATTCAATGCA TGGATCAATC AGTGTGATTA GCTTTCTCAG 660 
CAGACATTGT GCCATATGTA TCAAATGACA AATCTTTATT GAATGGTTTT GCTCAGCACC 720 
ACCTTTTAAT ATATTGGCAG TACTTATTAT ATAAAAGGTA AACCAGCATT CTCACTGTGA 7S0 
AAAAAAAAAA AAAAAAAAAA AAA 

Seq ID NO: 1 00 Protein sequence: 
Protein Accession #: NP 002979.1 

1 11 21 • 31 41 SI 

I I I I I I 

MKGLAAALLV LVCTMALCSC AQVGTNKELC CLVYTSWQIP QKFIVDYSET SPQCPKPGVI 60 
LLTKRGRQIC ADPNKKWVQK YISDLKLNA 

Seq ID NO: 101 DNA sequence 

Nucleic Acid Accession #: NM_015507.2 

Coding sequence: 241-1902 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

CCGCAGAGGA GCCTCGGCCA GGCTAGCCAG GGCGCCCCCA GCCCCTCCCC AGGCCGCGAG 60 
CGCCCCTGCC GCGGTGCCTG GCCTCCCCTC CCAGACTGCA GGGACAGCAC CCGGTAACTG 120 
CGAGTGGAGC GGAGGACCCG AGCGGCTGAG GAGAGAGGAG GCGGCGGCTT AGCTGCTACG 180 
GGGTCCGGCC GGCGCCCTCC CGAGGGGGGC TCAGGAGGAG GAAGGAGGAC CCGTGCGAGA 240 
ATGC CTCTGC CCTGGAGCCT TGCGCTCCCG CTGCTGCTCT CCTGGGTGGC AGGTGGTTTC 300 
GGGAACGCGG CCAGTGCAAG GCATCACGGG TTGTTAGCAT CGGCACGTCA GCCTGGGGTC 360 
TGTCACTATG GAACTAAACT GGCCTGCTGC TACGGCTGGA GAAGAAACAG CAAGGGAGTC 420 
TGTGAAGCTA CATGCGAACC TGGATGTAAG TTTGGTGAGT GCGTGGGACC AAACAAATGC 480 
AGATGCTTTC CAGGATACAC CGGGAAAACC TGCAGTCAAG ATGTGAATGA GTGTGGAATG 540 
AAACCCCGGC CATGCCAACA CAGATGTGTG AATACACACG GAAGCTACAA GTGCTTTTGC 600 
CTCAGTGGCC ACATGCTCAT GCCAGATGCT ACGTGTGTGA ACTCTAGGAC ATGTGCCATG 660 
ATAAACTGTC AGTACAGCTG TGAAGACACA GAAGAAGGGC CACAGTGCCT GTGTCCATCC 720 
TCAGGACTCC GCCTGGCCCC AAATGGAAGA GACTGTCTAG ATATTGATGA ATGTGCCTCT 780 
GGTAAAGTCA TCTGTCCCTA CAATCGAAGA TGTGTGAACA CATTTGGAAG CTACTACTGC 840 
AAATGTCACA TTGGTTTCGA ACTGCAATAT ATCAGTGGAC GATATGACTG TATAGATATA 900 
AATGAATGTA CTATGGATAG CCATACGTGC AGCCACCATG CCAATTGCTT CAATACCCAA 960 
GGGTCCTTCA AGTGTAAATG CAAGCAGGGA TATAAAGGCA ATGGACTTCG GTGTTCTGCT 1020 
ATCCCTGAAA ATTCTGTGAA GGAAGTCCTC AGAGCACCTG GTACCATCAA AGACAGAATC 1080 
AAGAAGTTGC TTGCTCACAA AAACAGCATG AAAAAGAAGG CAAAAATTAA AAATGTTACC 1 140 
CCAGAACCCA CCAGGACTCC TACCCCTAAG GTGAACTTGC AGCCCTTCAA CTATGAAGAG 1200 
ATAGTTTCCA GAGGCGGGAA CTCTCATGGA GGTAAAAAAG GGAATGAAGA GAAAATGAAA 1260 
GAGGGGCTTG AGGATGAGAA AAGAGAAGAG AAAGCCCTGA AGAATGACAT AGAGGAGCGA 1320 
AGCCTGCGAG GAGATGTGTT TTTCCCTAAG GTGAATGAAG CAGGTGAATT CGGCCTGATT 1380 
CTGGTCCAAA GGAAAGCGCT AACTTCCAAA CTGGAACATA AAGATTTAAA TATCTCGGTT 1440 
GACTGCAGCT TCAATCATGG GATCTGTGAC TGGAAACAGG ATAGAGAAGA TGATTTTGAC 1500 
TGGAATCCTG CTGATCGAGA TAATGCTATT GGCTTCTATA TGGCAGTTCC GGCCTTGGCA 1560 
GGTCACAAGA AAGACATTGG CCGATTGAAA CTTCTCCTAC CTGACCTGCA ACCCCAAAGC 1620 
AACTTCTGTT TGCTCTTTGA TTACCGGCTG GCCGGAGACA AAGTCGGGAA ACTTCGAGTG 1680 
TTTGTGAAAA ACAGTAACAA TGCCCTGGCA TGGGAGAAGA CCACGAGTGA GGATGAAAAG 1740 
TGGAAGACAG GGAAAATTCA GTTGTATCAA GGAACTGATG CTACCAAAAG CATCATTTTT 1800 
GAAGCAGAAC GTGGCAAGGG CAAAACCGGC GAAATCGCAG TGGATGGCGT CTTGCTTGTT 1860 
TCAGGCTTAT GTCCAGATAG CCTTTTATCT GTGGATGAC T GA ATGTTACT ATCTTTATAT 1920 
TTGACTTTGT ATGTCAGTTC CCTGGTTTTT TTGATATTGC ATCATAGGAC CTCTGGCATT 1980 
TTAGAATTAC TAGCTGAAAA ATTGTAATGT ACCAACAGAA ATATTATTGT AAGATGCCTT 2040 
TCTTGTATAA GATATGCCAA TATTTGCTTT AAATATCATA TCACTGTATC TTCTCAGTCA 2100 
TTTCTGAATC TTTCCACATT ATATTATAAA ATATGGAAAT GTCAGTTTAT CTCCCCTCCT 2160 
CAGTATATCT GATTTGTATA AGTAAGTTGA TGAGCTTCTC TCTACAACAT TTCTAGAAAA 2220 
TAGAAAAAAA AGCACAGAGA AATGTTTAAC TGTTTGACTC TTATGATACT TCTTGGAAAC 2280 
TATGACATCA AAGATAGACT TTTGCCTAAG TGGCTTAGCT GGGTCTTTCA TAGCCAAACT 2340 
TGTATATTTA AATTCTTTGT AATAATAATA TCCAAATCAT CAAAAAAAAA AAAAAAAA 

Seq ID NO: 102 Protein sequence: 
Protein Accession #: NP_056322.2 

1 11 21 31 41 51 

I I I I I I 

MPLPWSLALP LLLSWVAGGF GNAASARHHG LLASARQPGV CHYGTKLACC YGWRRNSKGV 60 
CEATCEPGCK. FGECVGPNKC RCFPGYTGKT CSQDVNECGM KPRPCQHRCV NTHGSYKCFC 120 
LSGHMLMPDA TCVNSRTCAM INCQYSCEDT EEGPQCLCPS SGLRLAPNGR DCLDIDECAS 180 
GKVICPYNRR CVNTFGSYYC KCHIGFELQY ISGRYDCIDI NECTMDSHTC SHHANCFNTQ 240 
GSFKCKCKQG YKGNGLRCSA IPENSVKEVL RAPGTIKDRI KKLLAHKNSM KKKAKKNVT 300 
PEPTRTPTPK VNLQPFNYEE IVSRGGNSHG GKKGNEEKMK EGLEDEKREE KALKNDIEER 360 
SLRGDVFFPK VNEAGEFGLI LVQRKALTSK LEHKDLNISV DCSFNHGICD WKQDREDDFD 420 
WNPADRDNAI GFYMAVPALA GHKKDIGRLK LLLPDLQPQS NFCLLFDYRL AGDKVGKLRV 480 
FVKNSNNALA WEKTTSEDEK WKTGKIQLYQ GTDATKSIIF EAERGKGKTG EIAVDGVLLV 540 
SGLCPDSLLS VDD 

Seq ID NO: 103 DNA sequence 

393 



Nucleic Acid Accession #: NM 001 565. 1 

Coding sequence: 67-363 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GAGACATTCC TCAATTGCTT AGACATATTC TGAGCCTACA GCAGAGGAAC CTCCAGTCTC 60 
AGCACC ATG A ATCAAACTGC GATTCTGATT TGCTGCCTTA TCTTTCTGAC TCTAAGTGGC 120 
ATTCAAGGAG TACCTCTCTC TAG AACCGTA CGCTGTACCT GCATCAGCAT TAGTAATCAA 1 80 
CCTGTTAATC CAAGGTCTTT AGAAAAACTT GAAATTATTC CTGCAAGCCA ATTTTGTCCA 240 
CGTGTTGAGA TCATTGCTAC AATGAAAAAG AAGGGTGAGA AGAGATGTCT GAATCCAGAA 300 
TCGAAGGCCA TCAAGAATTT ACTGAAAGCA GTTAGCAAGG AAATGTCTAA AAGATCTCCT 360 
TAA AACCAGA GGGGAGCAAA ATCGATGCAG TGCTTCCAAG GATGGACCAC ACAGAGGCTG 420 
CCTCTCCCAT CACTTCCCTA CATGGAGTAT ATGTCAAGCC ATAATTGTTC TTAGTTTGCA 480 
GTTACACTAA AAGGTGACCA ATGATGGTCA CCAAATCAGC TGCTACTACT CCTGTAGGAA 540 
GGTTAATGTT CATCATCCTA AGCTATTCAG TAATAACTCT ACCCTGGCAC TATAATGTAA 600 
GCTCTACTGA GGTGCTATGT TCTTAGTGGA TGTTCTGACC CTGCTTCAAA TATTTCCCTC 660 
ACCTTTCCCA TCTTCCAAGG GTACTAAGGA ATCTTTCTGC TTTGGGGTTT ATCAGAATTC 720 
TCAGAATCTC AAATAACTAA AAGGTATGCA ATCAAATCTG CTTTTTAAAG AATGCTCTTT 780 
ACTTCATGGA CTTCCACTGC CATCCTCCCA AGGGGCCCAA ATTCTTTCAG TQGCTACCTA 840 
CATACAATTC CAAACACATA CAGGAAGGTA GAAATATCTG AAAATGTATG TGTAAGTATT 900 
CTTATTTAAT GAAAGACTGT ACAAAGTATA AGTCTTAGAT GTATATATTT CCTATATTGT 960 
TTTCAGTGTA CATGGAATAA CATGTAATTA AGTACTATGT ATCAATGAGT AACAGGAAAA 1020 
TTTTAAAAAT ACAGATAGAT ATATGCTCTG CATGTTACAT AAGATAAATG TGCTGAATGG 1080 
TTTTCAAATA AAAATGAGGT ACTCTCCTGG AAATATTAAG 

Seq ID NO: 104 Protein sequence: 
Protein Accession #; NP_001556.1 

1 11 21 31 41 51 

I I I I I I 

MNQTAILICC LIFLTL SGIQ G VPLSRTVRC TCISISNQP V NPRSLEKLEI IPASQFCPRV 60 
EIIATMKKKG EKRCLNPESK AIKNLLKAVS KEMSKRSP 

Seq ID NO: 105 DNA sequence 

Nucleic Acid Accession #: NM_015068.1 

Coding sequence: 1 170-2243 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

GTAACAACCG TCACCCTGGG TCCCGACTGC CCACCTCCTC CTCCTCCCCC TCCCCCCAAC 60 
AACAACAACA ACAACAACTC CAAGCACACC GGCCATAAGA GTGCGTGTGT CCCCAACATG 120 
ACCG AACGAA GAAGGGACGA GCTCTCTG AA G AGATCAACA ACTTAAGAGA GAAGGTCATG 1 80 
AAGCAGTCGG AGGAGAACAA CAACCTGCAG AGCCAGGTGC AGAAGCTCAC AGAGGAGAAC 240 
ACCACCCTTC GAGAGCAAGT GGAACCCACC CCTGAGGATG AGGATGATGA CATCG AG CTC 300 
CGCGGTGCTG CAGCAGCTGC TGCCCCACCC CCTCCAATAG AGGAAGAGTG CCCAGAAGAC 360 
CTCCCAGAGA AGTTCGATGG CAACCCAGAC ATGCTGGCTC CTTTCATGGC CCAGTGCCAG 420 
ATCTTCATGG AAAAGAGCAC CAGGGATTTC TCAGTTGATC GTGTCCGTGT CTGCTTCGTG 480 
ACAAGCATGA TGACCGGCCG TGCTGCCCGT TGGGCCTCAG CAAAGCTGGA GCGCTCCCAC 540 
TACCTGATGC ACAACTACCC AGCTTTCATG ATGGAAATGA AGCATGTCTT TGAAGACCCT 600 
CAGAGGCGAG AGGTTGCCAA ACGCAAGATC AGACGCCTGC GCCAAGGCAT GGGGTCTGTC 660 
ATCGACTACT CCAATGCTTT CCAGATGATT GCCCAGGACC TGGATTGGAA CGAGCCTGCG 720 
CTGATTGACC AGTACCACGA GGGCCTCAGC GACCACATTC AGGAGGAGCT CTCCCACCTC 780 
GAGGTCGCCA AGTCGCTGTC TGCTCTGATT GGGCAGTGCA TTCACATTGA GAGAAGGCTG 840 
GCCAGGGCTG CTGCAGCTCG CAAGCCACGC TCGCCACCCC GGGCGCTGGT GTTGCCTCAC 900 
ATTGCAAGCC ACCACCAGGT AGATCCAACC GAGCCGGTGG GAGGTGCCCG CATGCGCCTG 960 
ACGCAGGAAG AAAAAGAAAG ACGCAGAAAG CTGAACCTGT GCCTCTACTG TGGAACAGGA 1020 
GGTCACTACG CTGACAATTG TCCTGCCAAG GCCTCAAAGT CTTCGCCGGC GGGAAACTCC 1080 
CCGGCCCCGC TGTAGAGGGA CCTTCAGCGA CCGGGCCAGA AATAATAAGG TCCCCACAAG 1 140 
ATGATGCCTC ATCTCCACAC TTGCAAGTG A TGC TCCAGAT TCATCTTCCG GGCAGACACA 1200 
CCCTGTTCGT CCGAGCCATG ATCGATTCTG GTGCTTCTGG CAACTTCATT GATCACGAAT 1260 
ATGTTGCTCA AAATGGAATT CCTCTAAGAA TCAAGGACTG GCCAATACTT GTGGAAGCAA 1320 
TTGATGGGCG CCCCATAGCA TCGGGCCCAG TTGTCCACGA AACTCACGAC CTGATAGTTG 1380 
ACCTGGGAGA TCACCGAGAG GTGCTGTCAT TTGATGTGAC TCAGTCTCCA TTCTTCCCTG 1440 
TCGTCCTAGG GGTTCGCTGG CTGAGCACAC ATGATCCCAA TATCACATGG AGCACTCGAT 1500 
CTATCGTCTT TGATTCTGAA TACTGCCGCT ACCACTGCCG GATGTATTCT CCAATACCAC 1560 
CATCGCTCCC ACCACCAGCA CCACAACCGC CACTCTATTA TCCAGTAGAT GGATACAGAG 1620 
TTTACCAACC AGTGAGGTAT TACTATGTCC AGAATGTGTA CACTCCAGTA GATGAGCACG 1680 
TCTACCCAGA TCACCGCCTG GTTGACCCTC ACATAGAAAT GATACCTGGA GCACACAGTA 1740 
TTCCCAGTGG ACATGTGTAT TCACTGTCCG AACCTGAAAT GGCAGCTCTT CGAGATTTTG 1800 
TGGCAAGAAA TGTAAAAGAT GGGCTAATTA CTCCAACGAT TGCACCTAAT GGAGCCCAAG 1860 
TTCTCCAGGT GAAGAGGGGG TGGAAACTGC AAGTTTCTTA TGATTGCCGA GCTCCAAACA 1920 
ATTTTACTAT CCAGAATCAG TATCCTCGCC TATCTATTCC AAATTTAGAA GACCAAGCAC 1980 
ACCTGGCAAC GTACACTGAA TTCGTACCTC AAATACCTGG ATACCAAACA TACCCCACAT 2040 
ATGCCGCGTA CCCGACCTAC CCAGTAGGAT TCGCCTGGTA CCCAGTGGGA CGAGACGGAC 2100 
AAGGAAGATC ACTATATGTA CCTGTGATGA TCACTTGGAA TCCACACTGG TACCGCCAGC 21 60 
CTCCGGTACC ACAGTACCCG CCGCCACAGC CGCCGCCTCC ACCACCACCA CCGCCGCCGC 2220 
CTCCATCTTA CAGTACCCTG TAA ATACCTG TCATGTCCTT CAGGATCTCT GCCCTCAAAA 2280 
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TTTATTCCTG TTCAGCTTCT CAATCAGTGA CTGTGTGCTA AATTTTAGGC TACTGTATCT 2340 
TCAGGCCACC TGAGGCACAT CCTCTCTGAA ACGGCTATGG AAGGTTAGGG CCACTCTGGA 2400 
CTGGCACACA TCCTAAAGCA CCAAAAGACC TTCAACATTT TCTGAGAGCA ACAGAGTATT 2460 
TGCCAATAAA TGATCTCTCA TTTTTCCACC TTGACTGCCA ATCTAACTAA AATAATTAAT 2520 
5 AAGTTTACTT TCCAGCCAGT CCTGGAAGTC TGGGTTTTAC CTGCCAAAAC CTCCATCACC 2580 
ATCTAAATTA TAGGCTGCCA AATTTGCTGT TTAACATTTA CAGAGAAGCT GATACAAACG 2640 
CAGGAAATGC TGATTTCTTT ATGGAGGGGG AGACGAGGAG GAGGAGGACA TGACTTTTCT 2700 
TGCGGTTTCG GTACCCTCTT TTTAAATCAC TGGAGGACTG AGGCCTTATT AAGGAAGCCA 2760 
AAATTATCGG TGCAGTGTGG AAAGGCTTCC GTGATCCTCT CGCTGCACCC TTAGAAACTT 2820 
1 0 CACCGTCTTC AAACTCCATT TCCATGGTTC TGTTAATTCT CAAGGAGCAG CAACTCGACT 2880 

GGTTCTCCCA GGAGCAGGAA AAACCCTTGT GACATGAAAC ATCTCAGGCC TGAAAAGAAA 2940 
GTGCTCTCTC AGATGGACTC TTGCATGTTA AGACTATGTC TTCACATCAT GGTGCAAATC 3000 
ACATGTACCC AATGACTCCG GCTTTGACAC AACACCTTAC CATCATCATG CCATGATGGC 3060 
TTCCACAAAG CATTAAACCT GGTAACCAGA GATTACTGGT GGCTCCAGCG TTGTTAGATG 3120 
1 5 TTCATG AAAT GTG ACCACCT CTCAATCACC TTTGAGGGCT AA AGAGTAGC AC ATCA AAAG 3 1 80 
GACTCCAAAA TCCCATACCC AACTCTTAAG AGATTTGTCC TGGTACTTCA GAAAGAATTT 3240 
TCATGAGTGT TCTTAATTGG CTGGAAAAGC ACCAGCTGAC GTTTTGGAAG AATCTATCCA 3300 
y . TGTGTCTGCC TCCAT ATGCA TCTGGGC ATT TCATCTTCAG TCCCCTCATT AGACTGTAGC 3360 

--. ATTAGGATGT GTGGAGAGAG GAGAAATGAT TTAGCACCCA GATTCACACT CCTATGCCTG 3420 

120 G A AGGGGGAC ATCTTTGAAG AAGAGGAATT AGGGCTGTGG ACACTGTCTT GAGGATGTGG 3480 
f=- ACTTCCTTAG TGAGCTCCAC ATTACTTGAT GGTAACCACT TCAAAAGGAT CAGAATCCAC 3540 

' :r= ' GTAATGAAAA AGGTCCCTCT AGAGGATGGA GCTGATGTGA AGCTGCCAAT GGATGAAAAG 3600 

Lf ! CCTCAGAAAG CAACTCAAAG GACTCAAAGC AACGGACAAC ACAAGAGTTG TCTTCAGCCC 3660 

AGTGACACCT CTGATGTCCC CTGGAAGCTT TGTGCTAACC TGGGACTGCC TGACTTCCTT 3720 
*Z ; 5 TAGCCTGGTC CCTTGCTACT ACCTTGAACT GTTTTATCTA ACCTCTCTTT TTCTGTTTAA 3780 
fl ! TTCTTTGCTA CTGCCATTGA CCCTGCTGCA GGATTTGTGT CATTTTCCTG CCTGGTTGCT 3840 

I "] GAGACTCCAT TTTGCTGCCA CACACAGAGA TGTAAGAGGC AGGCTTTAAT TGCCAAAGCA 3900 

CAGTTTGAGC AGTAGAAAAC AACATGGTGT ATATCTCAAA TTGCCTGACA TGAAGAGGAG 3960 
f= : ■ TCTAACGGTG AAGTTTCACT TTTCATCAGC ATCATCTTTC ACATGTTCAT TATCATCCGC 4020 

^30 TCTTATTCTT GCATGTTTAA ACACTTAAAA TTTTTAGTAT AATTTTTAGT GTGTTTTGAA 4080 
3 GTGGTGACTA GGCTTTCAAA AACTTCCATT GAATTACAAA GCACTATCCA GTTCTTATTG 4140 

rr=: TTAAACTAAG TAAAAATGAT AAGTAACATA GTGTAAAATA TTCCTTTACT GTGAACTTCT 4200 

*~* ; TACAATGCTG TGAATGAGAG GCTCCTCAGA ACTGGAGCAT TTGTATAATA ATTCATCCTG 4260 

M ' TTCATCTTCA ATTTT A ACAT CATATATA AT TTCAATTCTA TCAATTGGGC CTTTAAAAAT 4320 

=3=5 CATATAAAAG GATATAAAAT TTGAAAAGAG AAACCTAATT GGCTATTTAA TCCAAAACAA 4380 
| y CTTTTTTTTT TCCTTCAATG GAATCAGAAA GCTTGTCAAT CACTCATGTG TTTTAGAGTA 4440 

J: ! ATTACTTTTA AAATGGTGCA TTTGTGCTTC TG AACTATTT TGAAGAGTCA CTTCTGTTTA 4500 

X i CCTCAAGTAT CAATTCATCC TCCATACATT TGAATTCAAG TTGTTTTTTG TCAAATTTAC 4560 

fc= 1 AGTTGTCAAT TGATCTTCAA GCTGCAGGGT GCCTAGAAAT GGGCCGTTGT CTGTAGCCCT 4620 

rtfcO GGCATGTGCA CACGGACATT TGCCACCACT GCAAGCAAAA GTCTG GAG AA GTTCACCAAC 4680 
GACAAGAACG ATTAGGGAAA ATATGCTGCT GTGGGTTAAC AACTCAGAAA GTCCCTGATC 4740 
CACATTTGGC TGTTTACTAA AGCTTGTGAT TAACTTTTTG GCAGTGTGTA CTATGCTCTA 4800 
TTGCTATATA TGCTATCTAT AAATGTAGAT GTTAAGGATA AGTAATTCTA AATTTATTAT 4860 
TCTATAGTTT TGAAGTTTGG TTAAGTTTCC TTTCACTCAA TTGATTTATT TTGTTGTTAA 4920 
45 TCAAATTTAT GTTAATTGGA TCCTTTAAAT TTTTTTTGGC ATTTTCCAAC AAAAATGGCT 4980 

TTATTCATAA GAAAGGAAAA AAATCAATGG AATTTGATAT CTAAAGAAGT TAGAAA GGGA 5040 
GCAAAATAAA AAACATAAAG GAGATAGATG AATTAGTAAG CAAATCAGTA GTCGAGTTTT 5100 
TCAAACTGGC AAAATTAATT AATTGACTTT TAGCCCAAAT TTACATTGTT AATTAAATCA 5160 
AGAAGGAAGA AGATCTAAGA GCTCCCATTG ATAGGCAAGC CTAGAGAGAA CTAGCTAAAT 5220 
50 TTATCATGCT AGG ATATTG A AACACAGAAA GTTTACATAC ATTTATGAAG GGTCAATTTA 5280 
GTTTGGACAG TGAGGTATTT GTCTTAGTGG AAAAAAGGAG AATTAGTCTG ATCAAATCGT 5340 
GAAGTAATAC AGTGAACTTG CAGGTGCACA AAATAAGAGG GCCACATCTA TATGGTGCAG 5400 
TCTGGAATTC TGTTTAAGTT TGTAGGTACC TCTTGGACTT CTGAATTGAT CCAGTTGTCA 5460 
TCCACCACAG ACATCTCACA TCAGATACAG ACAGTTCCAA GATTGACAAC AGAGAACAAC 5520 
5 5 CTGCTGGAAA GACCTGGGCA GAAATGGAGA GCCCTGCGGG AACCATGCTA CATTTTCATC 5580 
TAAAGAGAGA ATGCACATCT GATGAGACTG AAAGTTCTTT GTTGTTTTAG ATTGTAGAAT 5640 
GGTATTGAAT TGGTCTGTGG AAAATTGCAT TGCTTTTATT TCTTTGTGTA ATCAAGTTTA 5700 
AGTAATAGGG GATATATAAT CATAAGCATT TTAGGGTGGG AGGGACTATT AAGTAATTTT 5760 
AAGTGGGTGG GGTT ATTTAG AATGTTAGAA TAATATTATG TATTAGATAT CGCTATAAGT 5820 
60 GGACATGCGT ACTTACTTGT AACCCTTTAC CCTATAATTG CTATCCTTAA AGATTTCAAA 5880 

TAAACTCGGA GGGAACTGCA GGGAGACCAA CTTATTTAGA GCGAATTGGA CATGGATAAA 5940 
AACCCCAGTG GGAGAAAGTT CAAAGGTGAT TAGATTAATA ATTTAATAGA GGATGAGTGA 6000 
CCTCTGATAA ATTACTGCTA GAATGAACTT GTCAATGATG GATGGTAAAT TTTCATGGAA 6060 
GTTATAAAAG TGATAAATAA AAACCCTTGC TTTTACCCCT GTCAGTAGCC CTCCTCCTAC 6120 
65 CACTGAACCC CATTGCCCCT ACCCCTCCTT CTAACTTTAT TGCTGTATTC TCTTCACTCT 61 80 
ATATTTCTCT CTATTTGCTA ATATTGCATT GCTGTTACAA TAAAAATTCA ATAAAGATTT 6240 
AGTGGTTAAG TGC 

Sea ID NO: 106 Protein sequence: 
70 Protein Accession U: NP_055883.1 

1 11 21 31 41 51 

I I I I I I 

MTERRRDELS EEINNLREKV MKQSEENNNL QSQVQKLTEE NTTLREQVEP TPEDEDDDIE 60 
75 LRGAAAAAAP PPPIEEECPE DLPEKFDGNP DMLAPFMAQC QIFMEKSTRD FSVDRVRVCF 120 

VTSMMTGRAA RWASAKLERS HYLMHNYPAF MMEMKHVFED PQRREVAKRK IRRLRQGMGS 1 80 
VIDYSNAFQM IAQDLDWNEP ALIDQYHEGL SDHIQEELSH LEVAKSLSAL IGQCIHIERR 240 
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LARAAAARKP RSPPRALVLP HIASHHQVDP TEPVGGARMR LTQEEKERRR KLNLCLYCGT 300 
GGHYADNCPA KASKSSPAGN SPAPL 

Seq ID NO: 107 DNA sequence 

Nucleic Acid Accession #: NM 003679. 1 

Coding sequence: 47-1507(underIined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

GGCACGAGCA GAAGCAACAA TAATTGTGAA AAATACTTCA GCAGTTATGG ACTCATCTGT 60 
CATTCAAAGG AAAAAAGTAG CTGTCATTGG TGGTGGCTTG GTTGGCTCAT TACAAGCATG 120 
CTTTCTTGCA AAGAGGAATT TCCAGATTGA TGTATATG AA GCTAGGGAAG ATACTCGAGT 1 80 
GGCTACCTTC ACACGTGGAA GAAGCATTAA CTTAGCCCTT TCTCATAGAG GACGACAAGC 240 
CTTGAAAGCT GTTGGCCTGG AAGATCAGAT TGTATCCCAA GGTATTCCCA TGAGAGCAAG 300 
AATGATCCAC TCTCTTTCAG GAAAAAAGTC TGCAATTCCC TATGGGACAA AGTCTCAGTA 360 
TATTCTTTCT GTAAGCAGAG AAAATCTAAA CAAGGATCTA TTGACTGCTG CTGAGAAATA 420 
CCCCAATGTG AAAATGCACT TTAACCACAG GCTGTTGAAA TGTAATCCAG AGGAAGGAAT 480 
GATCACAGTG CTTGGATCTG ACAAAGTTCC CAAAGATGTC ACTTGTGACC TCATTGTAGG 540 
ATGTGATGGA GCCTATTCAA CTGTCAGATC TCACCTGATG AAGAAACCTC GCTTTGATTA 600 
CAGTCAGCAG TACATTCCTC ATGGGTACAT GGAGTTGACT ATTCCACCTA AGAACGGAGA 660 
TTATGCCATG GAACCTAATT ATCTGCATAT TTGGCCTAGA AATACCTTTA TGATGATTGC 720 
ACTTCCTAAC ATGAACAAAT CATTCACATG TACTTTGTTC ATGCCCTTTG AAGAGTTTGA 780 
AAAACTTCTA ACCAGTAATG ATGTGGTAGA TTTCTTCCAG AAATACTTTC CGGATGCCAT 840 
CCCTCTAATT GGAGAGAAAC TCCTAGTGCA AGATTTCTTC CTGTTGCCTG CCCAGCCCAT 900 
GATATCTGTA AAGTGCTCTT CATTTCACTT TAAATCTCAC TGTGTACTGC TGGGAGATGC 960 
AGCTCATGCT ATAGTGCCGT TTTTTGGGCA AGGAATGAAT GCGGGCTTTG AAGACTGCTT 1020 
GGTATTTGAT GAGTTAATGG ATAAATTCAG TAACGACCTT AGTTTGTGTC TTCCTGTGTT 1080 
CTCAAGATTG AGAATCCCAG ATGATCACGC GATTTCAGAC CTATCCATGT ACAATTACAT 1140 
AG AG ATGCG A GCACATGTCA ACTCAAGCTG GTTCATTTTT CAGAAGAACA TGGAGAGATT 1200 
TCTTCATGCG ATTATGCCAT CGACCTTTAT CCCTCTCTAT ACAATGGTCA CTTTTTGCAG 1260 
AATAAGATAC CATGAGGCTG TGCAGCGTTG GCATTGGCAA AAAAAGGTGA TAAACAAAGG 1320 
ACTCTTTTTC TTGGGATCAC TGATAGCCAT CAGCAGTACC TACCTACTTA TACACTACAT 1380 
GTCACCACGA TCTTTCCTCT GCTTGAGAAG ACCATGGAAC TGGATAGCTC ACTTCCGGAA 1440 
TACAACATGT TTCCCCGCAA AGGCCGTGGA CTCCCTAGAA CAAATTTCCA ATCTCATTAG 1500 
CAGG TGAT AG AAAGGTTTTG TGGTAGCAAA TGCATGATTT CTCTGTGACC AAAATTAAGC 1560 
ATGAAAAAAA TGTTTCCATT GCCATATTTG ATTCACTAGT GGAAGATAGT GTTCTGCTTA 1620 
TAATTAAACT GAATGTAGAG TATCTCTGTA TGTTAATTGC AATTACTGGT TGGGGGGTGC 1680 
ATTTTAAAAG ATGAAACATG CAGCTTCCCT ACATTACACA CACTCAGGTT GAGTCATTCT 1740 
AACTATAAAA GTGCAATGAC TAAGATCCTT CACTTCTCTG AAAGTAAGGC CCTAGATGCC 1800 
TCAGGGAAGA CAGTAATCAT GCCTTTTCTT TAAAAGACAC AATAGGACTC GCAACAGCAT 1 860 
TGACTCAACA CCTAGGACTA AAAATCACAA CTTAACTAGC ATGTTAACTG CACTTTTCAT 1920 
TACGTGAATG GAACTTACCT AACCACAGGG CTCAGACTTA CTAGATAAAA CCAGAAATGG 1980 
AAATAAGGAA TTCAGGGGAG TTCCAGAGAC TTACAAAATG AACTCATTTT ATTTTCCCAC 2040 
CTTCAAA TAT AAGTATTATC ATCTATCTGT TTATCGTCTA TCTATCTATC ATCTATCTAT 2100 
CTATCTATCA TCTATCTATC TATCTATCTA TCTATCTATC TATCTATCTA TCTATCTATC 2160 
TCTATTTATT TATGTATTTA GAGATCAGGT CTCACTCTGT TGACCAGGCT GGAGTGCAGT 2220 
GGTGAGATCT GGGTTCACTG CAACCTCTGC CTCCTGGGCT CAAGCAATCC TCCCACTTCA 2280 
GCCTCCCAAA TAGCTGGGGC TACCATGGTA TTTTTCAGTA GAGACCGGGT CTTGCCATGC 2340 
TGCCCAGGCC AGTCTCAAAC TCCTGGCCTC ATGTGATCTG CCCACCTCAG CCTCCCAAAG 2400 
TACAGGGATT AGAGTTGTGA GCCACCGCTG CCAGCCCAGA GTTACCCTCT AAAGATAAGA 2460 
AAAAGGCTAT TAATATCATA CTAAGTGAAG GACAGGAAAG GGTTTTATTC ATAAATTAAA 2520 
TGTCTACATG TGCCAGAATG GAAAGGAAAC AAGGGGAGAC AACTTTTATA GAAATACAAA 2580 
GCCATTACTT TATTCAATTT CAGACCCTCA GAAGCAATTT ACTAATTTAT TCTTCGACTA 2640 
CATACTGCAG CAGAACCAGC AATACACTTG ATTTTTAAAA GCACATTTAG TGAAATGTTT 2700 
TCTTTGGTTC ATCCTTCTTT AACAGGCTGC TGAGTCACTC AGAAATCCTT CAAACATGAT 2760 
TAATTATGAA GATGAAACAC TAGAGTCATA TAAGAAATAA AAATTGGGCA ATAAAATAAA 2820 
ATGATTCAGT GTTTCTTTTC TATATTGTCA ATGAAAACCT TGAGTTCTAA TAATCCATGT 2880 
TCAGTTTGTA GGGAAAGAAA AAATAATTTT TCCTTCTACC CACTTTAGGT TCCTTGGCTG 2940 
GGGCCCCTAT AACAAAAGAC AGATTGACAA GAGAAAAACA AACATAAATT TATTAGCGGG 3000 
TATATGTAAT ATATATGTGG GAAATACAGG GGAATGAGCA AATCTCAAAG AGCTGGCGTC 3060 
TTAGAACTCC CTGGCTTATA TAGCATCGAC AAAGAACAGT AAATTTTTAG AGAAACAACA 3120 
AAACAAAGAA AAAGAGCTTT GAGTCTGTAG GGGCAGCAAT TTGGGGGAAG CAAATATATG 3180 
GGAGTTTGCC TTGTAGATTC CTCTGGTGGT GGTCTCCAGG CTGACAAGGA TTCAAAGTTG 3240 
TCTCTGAAAC TCCTCTTTGT CATACTGCAC ATATAAAACG TCTTTTGTTT CCAACAAGAG 3300 
GATTTCTTTT TCATTCTAGA ATTATCTCCT TGATAACTTG ATCAGATATA GGACATGACA 3360 
CTGAATAGAG TCCAACAGTA CAAAAAAAAT TCAGTATGTT CTAGCTACTT CACACATGTG 3420 
TACGCGACAG TTATTTTTAC AGTAAGGTAT TTTCGAGAAA AATGCATTAC GTGTTTTGGA 3480 
AAATAGAGTA ATTTAAAAAA TATATTTGAA ATGAAAATCT CCAACACATT AGAAGATGAT 3540 
GATGTTAGAT GCCCATCGTG TGCCACAAGT GGTTTTTTCA TTATGTAAAG CACCCGTTGA 3600 
ATTAAAAGAA TTTGTTTTTG TTCAACCTCT TCCTGAGGCC CAAGAGCATA TGGGCAATTC 3660 
GGATTTCCTG CTGGACCACA AGGTTCTGTT GATATTACAT AGAAACGGGT ATTCCAGACA 3720 
CTTCTTATGA TGAAAGTCCA AAAGTGGCAT CCAATTTAAG GCCCCATCTT TCGTTGCCAT 3780 
TCTTCATTCC TACAAAGGAC GAACTTGGAT TACATCAACT TTGGACCCAT TGGTTTTGTC 3840 
GCTGTCGTCA ACTGACAGTG ATTCACCACT GGTGATGATA AAAATGATGG AAGAAGAGTT 3900 
GAAAGTCACT TTTTTCTTTG GCCTGTCCCC ATCTTTCTGT GACATCACAA TGGGTCTGAT 3960 
CTGCATTTCA CTTCCAGCTG CTGGTAGGTC TTTAGCAGGC CTCTGGCACC TCAGCAGTCG 4020 
GAGGCACAGA AGCTGCAAAA GGGATCTTCG AAACTGGGCA GAGAAAAAAT AAAGTGGAAT 4080 
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ATTAAGTAAA AGTTGGGCAC TAATCTGGAT TAACATTCGA GGAAATCAGT TGAGCTGATT 4140 
TAAGTTGTTT TTTGTTTGTT AGCAGGTGTG GATGTGGGGT TATGTGGTCA TGCTCAGATC 4200 
TACCTAAATC ACCCCAGAGC TTTATGTCTT TT ATTCATTC TAAATCTTAT TAACCGGAAT 4260 
ATGTAGGACC ATTTCAATAC CTTGTAATCC TCCAAGCTTC AATCTGCACA CACTTTCTAT 4320 
5 GAGGGCAGGT ACAACTATTA AGAGATTTTG AACATTAAGT TAGTCCACAA ATATTCAGTG 4380 
GGCATCTACT AGGTGACAGC CACTGTGCTA TAATTAGAGA CTTTTTACTA TAAGCATCAA 4440 
AAACAGATAA GGCTCTTCCT GGCAGAGTTT ACAGCCTGGT GTACTTGCTA ATGTCTCTTT 4500 
AATTAGGTGA AGAATTTTTT TTTTCTATCG AAATTACTAA TCAGTTGGGG AAAAAAATAC 4560 
TATAGCAGAC AGCACTAATG TCATCAACAA ACATTGTTCT TCTCCGTGTC CTGGGTACAA 4620 

1 0 CATCGAATAA TATTTCTTGG CCTCCTTTCC GCTTCTCCTC TCTGCTGTTC CTCTCTACAA 4680 

GAACCTGGGA GGCCAACGCC TAAAGATCAT AATATCACAA TGGAAGGAAC CTAGATTCCT 4740 
AAATGACTGC ATAGGACAGA TCCCATCTCC TCCACCCAAT ACATTATTAG ACTGAACTGT 4800 
GACCTGAAAT GAGCAATAAA CTCTGTATTA ATTCACTGAA ATGTTGGGGT TGCTTGTTAT 4860 
AGTAGTCGGT CCATCATGAC CAGTAAAACA TAAATCAAAA GTTAATGTAA TTGTTATCCC 4920 

1 5 ATTATTTAGA GCGAAATAAA TGTTGAATAT ATGGACTTTC TCAGATTAGG AAATACCAAT 4980 
TAAAAATATA ATAAATAGCT 
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Seq ID NO: 108 Protein sequence: 
Protein Accession #: NP_003670.1 



1 11 21 31 41 SI 

I I I I I I 

ill MDSSVIQRKK VAVIGGGLVG SLQACFLAKR NFQIDVYEAR EDTRVATFTR GRSINLALSH 60 

W RGRQALKAVG LEDQIVSQGI PMRARMIHSL SGKKSADPYG TKSQYILSVS RENLNKDLLT 1 20 

H?25 AAEKYPNVKM HFNHRLLKCN PEEGMITVLG SDKVPKDVTC DLIVGCDGAY STVRSHLMKK 180 

PRFDYSQQYIPHGYMELTJPPK^GDYAMEPNYLHIWPRNTFMMIALPNMNKSFTCTLFMP 240 

i 5 FEEFEKLLTS NDVVDFFQKY FPDAIPLIGE KLLVQDFFLL PAQPMISVKC SSFHFKSHCV 300 

" ; LLGDAAHAIV PFFGQGMNAG FEDCLVFDEL MDKFSNDLSL CLPVFSRLRI PDDHAISDLS 360 

5 MYNYIEMRAH VNSSWFIFQK NMERFLHAM PSTFIPLYTM VTFSRIRYHE AVQRWHWQKK 420 

' 30 VINKGLFFLG SLIAISSTYL LIHYMSPRSF LCLRRPWNWI AHFRNTTCFP AKAVDSLEQI 480 

!_ SNLISR 

i ■ Seq ID NO: 109 DNA sequence 

F =; Nucleic Acid Accession #: NM_006115.1 

fli3 5 Coding sequence: 236-1765 (underlined sequences correspond to start and stop codons) 

5=P^ 1 11 21 31 41 51 

I I I I I I 
fee' GCTTCAGGGT ACAGCTCCCC CGCAGCCAGA AGCCGGGCCT GCAGCCCCTC AGCACCGCTC 60 

HMO CGGGACACCC CACCCGCTTC CCAGGCGTGA CCTGTCAAC A GCAACTTCGC GGTGTGGTGA 120 
ACTCTCTGAG GAAAAACCAT TTTGATTATT ACTCTCAG AC GTGCGTGGC A ACAAGTGACT 1 80 
GAGACCTAGA AATCCAAGCG TTGGAGGTCC TGAGGCCAGC CTAAGTCGCT TCAAAATGGA 240 
ACGAAGGCGT TTGTGGGGTT CCATTCAGAG CCGATACATC AGCATGAGTG TGTGGACAAG 300 
CCCACGGAGA CTTGTGGAGC TGGCAGGGCA GAGCCTGCTG AAGGATGAGG CCCTGGCCAT 360 
45 TGCCGCCCTG GAGTTGCTGC CCAGGGAGCT CTTCCCGCCA CTCTTCATGG CAGCCTTTGA 420 
CGGGAGACAC AGCCAGACCC TGAAGGCAAT GGTGCAGGCC TGGCCCTTCA CCTGCCTCCC 480 
TCTGGGAGTG CTGATGAAGG GACAACATCT TCACCTGGAG ACCTTCAAAG CTGTGCTTGA 540 
TGGACTTGAT GTGCTCCTTG CCCAGGAGGT TCGCCCCAGG AGGTGGAAAC TTCAAGTGCT 600 
GGATTTACGG AAGAACTCTC ATCAGGACTT CTGGACTGTA TGGTCTGGAA ACAGGGCCAG 660 
5 0 TCTGTACTCA TTTCCAGAGC CAGAAGCAGC TCAGCCCATG ACAAAGAAGC G AAAAGTAG A 720 
TGGTTTGAGC ACAGAGGCAG AGCAGCCCTT CATTCCAGTA GAGGTGCTCG TAGACCTGTT 780 
CCTCAAGGAA GGTGCCTGTG ATGAATTGTT CTCCTACCTC ATTGAGAAAG TGAAGCGAAA 840 
GAAAAATGTA CTACGCCTGT GCTGTAAGAA GCTGAAGATT TTTGCAATGC CCATGCAGGA 900 
TATCAAGATG ATCCTGAAAA TGGTGCAGCT GGACTCTATT GAAGATTTGG AAGTGACTTG 960 
5 5 TACCTGGAAG CTACCCACCT TGGCGAAATT TTCTCCTTAC CTGGGCCAGA TGATTAATCT 1020 
GCGTAG ACTC CTCCTCTCCC ACATCCATGC ATCTTCCTAC ATTTCCCCGG AGAAGGAAGA 1080 
GCAGTATATC GCCCAGTTCA CCTCTCAGTT CCTCAGTCTG CAGTGCCTGC AGGCTCTCTA 1140 
TGTGGACTCT TTATTTTTCC TTAGAGGCCG CCTGGATCAG TTGCTCAGGC ACGTGATGAA 1200 
CCCCTTGGAA ACCCTCTCAA TAACTAACTG CCGGCTTTCG GAAGGGGATG TGATGCATCT 1260 
60 GTCCCAGAGT CCCAGCGTCA GTCAGCTAAG TGTCCTGAGT CTAAGTGGGG TCATGCTGAC 1320 
CGATGTAAGT CCCGAGCCCC TCCAAGCTCT GCTGGAGAGA GCCTCTGCCA CCCTCCAGGA 1380 
CCTGGTCTTT GATGAGTGTG GGATCACGGA TGATCAGCTC CTTGCCCTCC TGCCTTCCCT 1440 
GAGCCACTGC TCCCAGCTTA CAACCTTAAG CTTCTACGGG AATTCCATCT CCATATCTGC 1500 
CTTGCAGAGT CTCCTGCAGC ACCTCATCGG GCTGAGCAAT CTGACCCACG TGCTGTATCC 1560 
65 TGTCCCCCTG GAGAGTTATG AGGACATCCA TGGTACCCTC CACCTGGAGA GGCTTGCCTA 1620 
TCTGCATGCC AGGCTCAGGG AGTTGCTGTG TGAGTTGGGG CGGCCCAGCA TGGTCTGGCT 1680 
TAGTGCCAAC CCCTGTCCTC ACTG TGGGG A CAGAACCTTC TATGACCCGG AGCCCATCCT 1740 
GTGCCCCTGT TTCATGCCTA ACTAGCTGGG TGCACATATC AAATGCTTCA TTCTGCATAC 1800 
TTGGACACTA AAGCCAGGAT GTGCATGCAT CTTGAAGCAA CAAAGCAGCC ACAGTTTCAG 1860 
70 ACAAATGTTC AGTGTGAGTG AGGAAAACAT GTTCAGTGAG GAAAAAACAT TCAGACAAAT 1920 
GTTCAGTGAG GAAAAAAAGG GGAAGTTGGG GATAGGCAGA TGTTGACTTG AGGAGTTAAT 1980 
GTGATCTTTG GGGAGATACA TCTTATAGAG TTAGAAATAG AATCTGAATT TCTAAAGGGA 2040 
GATTCTGGCT TGGGAAGTAC ATGTAGGAGT TAATCCCTGT GTAGACTGTT GTAAAGAAAC 2100 
TGTTGAAAAT AAAGAGAAGC AATGTGAAGC AAAAAAAAAA AA AAA AAA 



Seq ID NO: 110 Protein sequence: 
Protein Accession #: NP 006106. 1 
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1 11 21 31 41 51 

I 1 I 1 I I 

MERRRLWGSI QSRYISMSVW TSPRRLVELA GQSLLKDEAL AIAALELLPR ELFPPLFMAA 50 
5 FDGRHSQTLK AMVQAWPFTC LPLGVLMKGQ HLHLETFKAV LDGLDVLLAQ EVRPRRWKLQ 120 
VLDLRKNSHQ DFWTVWSGNR ASLYSFPEPE AAQPMTKKRK VDGLSTEAEQ PFIPVEVLVD 180 
LFLKEGACDE LFSYLIEKVK RKKNVLRLCC KKLKIFAMPM QDDCMILKMV QLDSIEDLEV 240 
TCTWKLPTLA KFSPYLGQMI NLRRLLLSHI HASSYISPEK EEQYIAQFTS QFLSLQCLQA 300 
LYVDSLFFLR GRLDQLLRHV MNPLETLSIT NCRLSEGDVM HLSQSPSVSQ LSVLSLSGVM 360 
1 0 LTDVSPEPLQ ALLERASATL QDLVFDECGI TDDQLLALLP SLSHCSQLTT LSFYGNSISI 420 

SALQSLLQHL IGLSNLTHVL YPVPLESYED IHGTLHLERL AYLHARLREL LCELGRPSMV 480 
WLSANPCPHC GDRTFYDPEP ILCPCFMPN 



1 5 Seq ID NO: 1 1 1 DNA sequence 

Nucleic Acid Accession #: NM_003 8 1 5 

Coding sequence: 8-2452 (underlined sequences correspond to start and stop codons) 

H 

js=, 1 11 21 31 41 51 

"r20 | | | | | | 

D CGCTGCCATG CGGCTGGCGC TGCTCTGGGC CCTGGGGCTC CTGGGCGCGG GCAGCCCTCT 60 

= =: GCCTTCCTGG CCGCTCCCAA ATATAGGTGG CACTGAGGAG CAGCAGGCAG AGTCAGAGAA 120 

« i GGCCCCGAGG GAGCCCTTGG AGCCCCAGGT CCTTCAGGAC GATCTCCCAA TTAGCCTCAA 1 80 

ft i AAAGGTGCTT CAGACCAGTC TGCCTGAGCC CCTGAGGATC AAGTTGGAGC TGGACGGTGA 240 

i"25 CAGTCATATC CTGGAGCTGC TACAGAATAG GGAGTTGGTC CCAGGCCGCC CAACCCTGGT 300 
I V GTGGTACCAG CCCGATGGCA CTCGGGTGGT CAGTGAGGGA CACACTTTGG AGAACTGCTG 360 

%. ! CTACCAGGGA AGAGTGCGGG GATATGCAGG CTCCTGGGTG TCCATCTGCA CCTGCTCTGG 420 

J: ! GCTCAGAGGC TTGGTGGTCC TGACCCCAGA GAGAAGCTAT ACCCTGGAGC AGGGGCCTGG 480 

U GGACCTTCAG GGTCCTCCCA TTATTTCGCG AATCCAAGAT CTCCACCTGC CAGGCCACAC 540 

3 0 CTGTGCCCTG AGCTGGCGGG AATCTGTACA CACTCAGACG CCACCAGAGC ACCCCCTGGG 600 
* ACAGCGCCAC ATTCGCCGGA GGCGGGATGT GGTAACAGAG ACCAAGACTG TGGAGTTGGT 660 

p GATTGTGGCT GATCACTCGG AGGCCCAGAA ATACCGGGAC TTCCAGCACC TGCTAAACCG 720 

£> CACACTGGAA GTGGCCCTCT TGCTGGACAC ATTCTTCCGG CCCCTGAATG TACGAGTGGC 780 

~ : ACTAGTGGGC CTGGAGGCCT GGACCCAGCG TGACCTGGTG GAGATCAGCC CAAACCCAGC 840 

H ! 35 TGTCACCCTC GAAAACTTCC TCCACTGGCG CAGGGCACAT TTGCTGCCTC GATTGCCCCA 900 
5 ~ TGACAGTGCC CAGCTGGTGA CTGGTACTTC ATTCTCTGGG CCTACGGTGG GCATGGCCAT 960 

* ! TCAGAACTCC ATCTGTTCTC CTGACTTCTC AGGAGGTGTG AACATGGACC ACTCCACCAG 1020 

p CATCCTGGGA GTCGCCTCCT CCATAGCCCA TGAGTTGGGC CACAGCCTGG GCCTGGACCA 1080 

Z 1 '. TGATTTGCCT GGGAATAGCT GCCCCTGTCC AGGTCCAGCC CC AG CCAAG A CCTGCATCAT 1 140 

ft: 40 GGAGGCCTCC ACAGACTTCC TACCAGGCCT GAACTTCAGC AACTGCAGCC GACGGGCCCT 1200 
GGAGAAAGCC CTCCTGGATG GAATGGGCAG CTGCCTCTTC GAACGGCTGC CTAGCCTACC 1260 
CCCTATGGCT GCTTTCTGCG GAAATATGTT TGTGGAGCCG GGCGAGCAGT GTGACTGTGG 1320 
CTTCCTGGAT GACTGCGTCG ATCCCTGCTG TGATTCTTTG ACCTGCCAGC TGAGGCCAGG 1380 
TGCACAGTGT GCATCTGACG GACCCTGTTG TCAAAATTGC CAGCTGCGCC CGTCTGGCTG 1440 
45 GCAGTGTCGT CCTACCAGAG GGGATTGTGA CTTGCCTGAA TTCTGCCCAG GAGACAGCTC 1500 
CCAGTGTCCC CCTGATGTCA GCCTAGGGGA TGGCGAGCCC TGCGCTGGCG GGCAAGCTGT 1560 
GTGCATGCAC GGGCGTTGTG CCTCCTATGC CCAGCAGTGC CAGTCACTTT GGGGACCTGG 1620 
AGCCCAGCCC GCTGCGCCAC TTTGCCTCCA GACAGCTAAT ACTCGGGGAA ATGCTTTTGG 1680 
GAGCTGTGGG CGCAACCCCA GTGGCAGTTA TGTGTCCTGC ACCCCTAGAG ATGCCATTTG 1740 
50 TGGGCAGCTC CAGTGCCAGA CAGGTAGGAC CCAGCCTCTG CTGGGCTCCA TCCGGGATCT 1800 
ACTCTGGGAG ACAATAGATG TGAATGGGAC TGAGCTGAAC TGCAGCTGGG TGCACCTGGA 1860 
CCTGGGCAGT GATGTGGCCC AGCCCCTCCT GACTCTGCCT GGCACAGCCT GTGGCCCTGG 1920 
CCTGGTGTGT ATAGACCATC GATGCCAGCG TGTGGATCTC CTGGGGGCAC AGGAATGTCG 1980 
AAGCAAATGC CATGGACATG GGGTCTGTGA CAGCAACAGG CACTGCTACT GTGAGGAGGG 2040 
55 CTGGGCACCC CCTGACTGCA CCACTCAGCT CAAAGCAACC AGCTCCCTGA CCACAGGGCT 2100 
GCTCCTCAGC CTCCTGGTCT TATTGGTCCT GGTGATGCTT GGTGCCGGCT ACTGGTACCG 2160 
TGCCCGCCTG CACCAGCGAC TCTGCCAGCT CAAGGGACCC ACCTGCCAGT ACAGGGCAGC 2220 
CCAATCTGGT CCCTCTGAAC GGCCAGGACC TCCGCAGAGG GCCCTGCTGG CACGAGGCAC 2280 
TAAGTCTCAG GGGCCAGCCA AGCCCCCACC CCCAAGGAAG CCACTGCCTG CCGACCCCCA 2340 
60 GGGCCGGTGC CCATCGGGTG ACCTGCCCGG CCCAGGGGCT GGAATCCCGC CCCTAGTGGT 2400 
ACCCTCCAGA CCAGCGCCAC CGCCTCCGAC AGTGTCCTCG CTCTACCTCTGACCTCTCCG 2460 
GAGGTTCCGC TGCCTCCAAG CCGGACTTAG GGCTTCAAGA GGCGGGCGTG CCCTCTGGAG 2520 
TCCCCTACCA TGACTGAAGG CGCCAGAGAC TGGCGGTGTC TTAAGACTCC GGGCACCGCC 2580 
ACGCGCTGTC AAGCAACACT CTGCGGACCT GCCGGCGTAG TTGCAGCGGG GGCTTGGGGA 2640 
65 GGGGCTGGGG GTTGGACGGG ATTGAGGAAG GTCCGCACAG CCTGTCTCTG CTCAGTTGCA 2700 
ATAAACGTGA CATCTTGGGA GCGTTAAAAA AAAAAAAAAA 

Seq ID NO: 1 12 Protein sequence: 
Protein Accession #: NP_003806.2 

1 11 21 31 41 51 

MRLALLWALG LLGAGSPLPS WPLPNIGGTE EQQAESEKAP REPLEPQVLQ DDLPISLKKV 60 
LQTSLPEPLR IKLELDGDSH ILELLQNREL VPGRPTLVWY QPDGTRVVSE GHTLENCCYQ 120 
GRVRGYAGSW VSICTCSGLR GLVVLTPERS YTLEQGPGDL QGPPIISRIQ DLHLPGHTCA 180 
LSWRESVHTQ TPPEHPLGQR HIRRRRDWT ETKTVELVIV ADHSEAQKYR DFQHLLNRTL 240 
EVALLLDTFF RPLNVRVALV GLEAWTQRDL VEISPNPAVT LENFLHWRRA HLLPRLPHDS 300 
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10 



60 



AQLVTGTSFS GPTVGMAIQN SICSPDFSGG VNMDHSTSIL GVASSIAHEL GHSLGLDHDL 360 
PGNSCPCPGP APAKTCIMEA STDFLPGLNF SNCSRRALEK ALLDGMGSCL FERLPSLPPM 420 
AAFCGNMFVE PGEQCDCGFL DDCVDPCCDS LTCQLRPGAQ CASDGPCCQN CQLRPSGWQC 480 
RPTRGDCDLP EFCPGDSSQC PPDVSLGDGE PCAGGQAVCM HGRCASYAQQ CQSLWGPGAQ 540 
PAAPLCLQTA NTRGNAFGSC GRNPSGSYVS CTPRDAICGQ LQCQTGRTQP LLGSIRDLLW 600 
ET1DVNGTEL NCSWVHLDLG SDVAQPLLTL PGTACGPGLV CIDHRCQRVD LLGAQECRSK 660 
CHGHGVCDSN RHCYCEEGWA PPDCTTQLKA TSSLTTGLLL SLLVLLVLVM LGAGYWYRAR 720 
LHQRLCQLKG PTCQYRAAQS GPSERPGPPQ RALLARGTKS QGPAKPPPPR KPLPADPQGR 780 
CPSGDLPGPG AGIPPLVVPS RPAPPPPTVS SLYL 



Seq ID NO: 113 DNA sequence 

Nucleic Acid Accession #: NM_00241 6 

Coding sequence: 40-417 (underlined sequences correspond to start and stop codons) 

15 1 11 21 31 41 51 

I I I I I I 

ATCCAATACA GGAGTGACTT GGAACTCCAT TCTATCACTATGAAGAAAAG TGGTGTTCTT 60 
TTCCTCTTGG GCATCATCTT GCTGGTTCTG ATTGGAGTGC AAGGAACCCC AGTAGTGAGA 120 
- OH AAGGGTCGCT GTTCCTGCAT CAGCACCAAC CAAGGGACTA TCCACCTACA ATCCTTGAAA 180 
r=ZU GACCTTAAAC AATTTGCCCC AAGCCCTTCC TGCGAGAAAA TTGAAATCAT TGCTACACTG 240 
Cj AAGAATGGAG TTCAAACATG TCTAAACCCA GATTCAGCAG ATGTGAAGGA ACTGATTAAA 300 

. AAGTGGGAGA AACAGGTCAG CCAAAAGAAA AAGCAAA AGA ATGGGAAAAA ACATCAAAAA 360 

It= I AAGAAAGTTC TGAAAGTTCG AAAATCTCAA CGTTCTCGTC AAAAGAAG AC TACATAAG AG 420 

IT L ACCACTTCAC CAATAAGTAT TCTGTGTTA A AAATGTTCTA TTTTAATTAT ACCGCTATC A 480 

ZMJ TTCCAAAGGA GGATGGCATA TAATACAAAG GCTTATTAAT TTGACTAGAA AATTTAAAAC 540 
W ATTACTCTGA AATTGTAACT AAAGTTAGAA AGTTGATTTT AAGAATCCAA ACGTTAAGAA 600 

p 1 TTGTTAAAGG CTATGATTGT CTTTGTTCTT CTACCACCCA CCAGTTGAAT TTCATCATGC 660 

I ' ! TTAAGGCCAT G ATTTTAGCA ATACCCATGT CTACACAGAT GTTCACCCAA CCACATCCCA 720 

~~h A CTCACAACAG CTGCCTGGAA GAGCAGCCCT AGGCTTCCAC GTACTGCAGC CTCCAG AGAG 780 
j==pU TATCTGAGGC ACATGTCAGC AAGTCCTAAG CCTGTTAGCA TGCTGGTGAG CCAAGCAGTT 840 
TGAAATTGAG CTGGACCTCA CCAAGCTGCT GTGGCCATCA ACCTCTGTAT TTGAATCAGC 900 
9 CTACAGGCCT CACACACAAT GTGTCTGAGA GATTCATGCT GATTGTTATT GGGTATCACC 960 

?=: ACTGGAGATC ACCAGTGTGT GGCTTTCAGA GCCTCCTTTC TGGCTTTGGA AGCCATGTGA 1020 

- TTCCATCTTG CCCGCTCAGG CTGACCACTT TATTTCTTTT TGTTCCCCTT TGCTTCATTC 1080 
y J J AAGTCAGCTC TTCTCCATCC TACCACAATG CAGTGCCTTT CTTCTCTCCA GTGCACCTGT 1 1 40 
~ . CATATGCTCT GATTTATCTG AGTCAACTCC TTTCTCATCT TGTCCCCAAC ACCCCACAGA 1200 

= W AGTGCTTTCT TCTCCCAATT CATCCTCACT CAGTCCAGCT TAGTTCAAGT CCTGCCTCTT 1 260 

JC AAATAAACCT TTTTGGACAC ACAAATTATC TTAAAACTCC TGTTTCACTT GGTTCAGTAC 1320 

X . _ CACATGGGTG AACACTCAAT GGTTAACTAA TTCTTGGGTG TTTATCCTAT CTCTCCAACC 1380 
L=H-U AGATTGTCAG CTCCTTGAGG GCAAGAGCCA CAGTATATTT CCCTGTTTCT TCCACAGTGC 1440 
fg j CTAATAATAC TGTGGAACTA GGTTTTAATA ATTTTTTAAT TGATGTTGTT ATGGGCAGGA 1500 

• TGGCAACCAG ACCATTGTCT CAGAGCAGGT GCTGGCTCTT TCCTGGCTAC TCCATGTTGG 1560 

CTAGCCTCTG GTAACCTCTT ACTTATTATC TTCAGGACAC TCACTACAGG GACCAGGGAT 1620 
GATGCAACAT CCTTGTCTTT TTATG ACAGG ATGTTTGCTC AGCTTCTCCA ACAATAAG AA 1 680 
43 GCACGTGGTA A AACACTTGC GGATATTCTG G ACTGTTTTT AAAAAATATA CAGTTTACCG 1 740 
AAAATCATAT AATCTTACAA TGAAAAGGAC TTTATAGATC AGCCAGTGAC CAACCTTTTC 1800 
CCAACCATAC AAAAATTCCT TTTCCCGAAG GAAAAGGGCT TTCTCAATAA GCCTCAGCTT I860 
TCTAAGATCT AACAAGATAG CCACCGAGAT CCTTATCGAA ACTCATTTTA GGCAAATATG 1920 
AGTTTTATTG TCCGTTTACT TGTTTCAG AG TTTGrATTGT GATTATC AAT TACCACACCA 1980 
J U TCTCCCATGA AGAAAGGGAA CGGTGAAGTA CTAAGCGCTA GAGGAAGCAG CCAAGTCGGT 2040 
TAGTGGAAGC ATGATTGGTG CCCAGTTAGC CTCTGCAGGA TGTGGAAACC TCCTTCCAGG 2100 
GGAGGTTCAG TGAATTGTGT AGGAGAGGTT GTCTGTGGCC AGAATTTAAA CCTATACTCA 2160 
CTTTCCCAAA TTGAATCACT GCTCACACTG CTGATGATTT AGAGTGCTGT CCGGTGGAGA 2220 
TCCCACCCGA ACGTCTTATC TAATCATGAA ACTCCCTAGT TCCTTCATGT AACTTCCCTG 2280 
3 J AAAAATCTAA GTGTTTCATA AATTTGAGAG TCTGTGACCC ACTTACCTTG CATCTCACAG 2340 
GTAGACAGTA TATAACTAAC AACCAAAGAC TACATATTGT CACTGACACA CACGTTATAA 2400 
TCATTTATCA TATATATACA TACATGCATA CACTCTCAAA GCAAATAATT TTTCACTTCA 2460 
AAACAGTATT GACTTGTATA CCTTGTAATT TGAAATATTT TCTTTGTTAA AATAGAATGG 2520 
TATCAATAAA TAGACCATTA ATCAG 



Seq ID NO: 1 14 Protein sequence: 
Protein Accession #: NP 002407 



,_ 1 11 21 31 41 51 

o5 I l l | | | 

MKKSGVLFLL GIILLVLIGV QGTPVVRKGR CSCISTNQGT IHLQSLKDLK QFAPSPSCEK 60 
IEIIATLKNG VQTCLNPDSA DVKELIKKWE KQVSQKKKQK NGKKHQKKKV LKVRKSQRSR 1 20 
QKKTT 

70 Seq ID NO: 115 DNA sequence 

Nucleic Acid Accession ft: NM_O03238.1 

Coding sequence: 1 82-1426 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

75 I | i | | | 

CAAGCAGGAT ACGTTTTTCT GTTGGGCATT GACTAGATTG TTTGCAAAAG TTTCGCATCA 60 
AAAACAAACA ACAACAACAA AAAACCAAAC AACTCTCCTT GATCTATACT TTGAGAATTG 120 
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TTGATTTCTT TTTTTTTATT CTGACTTTTA AAAACAACTT TTTTTTCCAC TTTTTTAAAA 1 80 
A ATGC ACTAC TGTGTGCTGA GCGCTTTTCT GATCCTGCAT CTGGTCACGG TCGCGCTCAG 240 
CCTGTCTACC TGCAGCACAC TCGATATGGA CCAGTTCATG CGCAAGAGGA TCGAGGCGAT 300 
CCGCGGGCAG ATCCTGAGCA AGCTGAAGCT CACCAGTCCC CCAGAAGACT ATCCTGAGCC 360 
5 CGAGGAAGTC CCCCCGGAGG TGATTTCCAT CTACAACAGC ACCAGGGACT TGCTCCAGGA 420 
GAAGGCGAGC CGGAGGGCGG CCGCCTGCGA GCGCGAGAGG AGCGACGAAG AGTACTACGC 480 
CAAGGAGGTT TACAAAATAG ACATGCCGCC CTTCTTCCCC TCCGAAAATG CCATCCCGCC 540 
CACTTTCTAC AGACCCTACT TCAGAATTGT TCGATTTGAC GTCTCAGCAA TGGAGAAGAA 600 
TGCTTCCAAT TTGGTGAAAG CAGAGTTCAG AGTCTTTCGT TTGCAGAACC CAAAAGCCAG 660 
1 0 AGTGCCTGAA CAACGGATTG AGCTATATCA GATTCTCAAG TCCAAAGATT TAACATCTCC 720 
AACCCAGCGC TACATCGACA GCAAAGTTGT GAAAACAAGA GCAGAAGGCG AATGGCTCTC 780 
CTTCGATGTA ACTGATGCTG TTCATGAATG GCTTCACCAT AAAGACAGGA ACCTGGGATT 840 
TAAAATAAGC TTACACTGTC CCTGCTGCAC TTTTGTACCA TCTAATAATT ACATCATCCC 900 
AAATAAAAGT GAAGAACTAG AAGCAAGATT TGCAGGTATT GATGGCACCT CCACATATAC 960 
1 5 CAGTGGTGAT CAGAAAACTA TAAAGTCCAC TAGGAAAAAA AACAGTGGGA AGACCCCACA 1020 
TCTCCTGCTA ATGTTATTGC CCTCCTACAG ACTTGAGTCA CAACAGACCA ACCGGCGGAA 1080 
GAAGCGTGCT TTGGATGCGG CCTATTGCTT TAGAAATGTG CAGGATAATT GCTGCCTACG 1 140 
TCCACTTTAC ATTGATTTCA AGAGGGATCT AGGGTGGAAA TGGATACACG AACCCAAAGG 1200 
Ms GTACAATGCC AACTTCTGTG CTGGAGCATG CCCGTATTTA TGGAGTTCAG ACACTCAGCA 1260 

f m, 20 CAGCAGGGTC CTG AGCTTAT ATAATACCAT AAATCCAGAA GCATCTGCTT CTCCTTGCTG 1 320 
CGTGTCCCAA GATTTAGAAC CTCTAACCAT TCTCTACTAC ATTGGCAAAA CACCCAAGAT 1380 
f li TGAACAGCTT TCTAATATGA TTGTAAAGTC TTGCAAATGC AGCTAAAATT CTTGGAAAAG 1440 

r£: TGGCAAGACC AAAATGACAA TGATGATGAT AATGATGATG ACGACGACAA CGATGATGCT 1500 

y l - TGTAACAAGA AAACATAAGA GAGCCTTGGT TCATCAGTGT TAAAAAATTT TTGAAAAGGC 1 560 

ffi 25 GGTACTAGTT CAGACACTTT GGAAGTTTGT GTTCTGTTTG TTAAAACTGG CATCTGACAC 1620 
p , AAAAAAAGTT GAAGGCCTTA TTCTACATTT CACCTACTTT GTAAGTGAGA GAGACAAGAA 1 680 

I V GCAAATTTTT TTAAA 

K 2' Seq ID NO: 1 1 6 Protein sequence: 

C-130 Protein Accession #: NP_003229.1 

1 11 21 31 41 51 

G i i i I I i 

= = MHYCVLSAFL ILHLVTVALS LSTCSTLDMD QFMRKRIEAI RGQILSKLKL TSPPEDYPEP 60 

^" 3 5 EEVPPEVISI YNSTRDLLQE KASRRAAACE RERSDEEYYA KEVYKIDMPP FFPSENAIPP 120 

fLl TFYRPYFRIV RFDVSAMEKN ASNLVKAEFR VFRLQNPKAR VPEQRIELYQ ILKSKDLTSP 180 

* a: TQRYIDSKVV KTRAEGEWLS FDVTDAVHEW LHHKDRNLGF KISLHCPCCT FVPSNNYIIP 240 

=¥■ NKSEELEARF AGIDGTSTYT SGDQKTKST RKKNSGKTPH LLLMLLPSYR LESQQTNRRK 300 

fh KRALDAAYCF RNVQDNCCLR PLYIDFKRDL GWKWIHEPKG YNANFCAGAC PYLWSSDTQH 360 

5": 40 SRVLSLYNTI NPEASASPCC VSQDLEPLTI LYYIGKTPKI EQLSNMIVKS CKCS 

s y 

Seq ID NO: 1 17 DNA sequence 

Nucleic Acid Accession #: NM_000095. 1 

Coding sequence: 26-2299 (underlined sequences correspond to start and stop codons) 

45 

1 11 21 31 41 51 
I I I I I I 

CAGCACCCAG CTCCCCGCCA CCGCCATGGT CCCCGACACC GCCTGCGTTC TTCTGCTCAC 60 
CCTGGCTGCC CTCGGCGCGT CCGGACAGGG CCAGAGCCCG TTGGGCTCAG ACCTGGGCCC 120 

50 GCAGATGCTT CGGGAACTGC AGGAAACCAA CGCGGCG CTG CAGGACGTGC GGGACTGGCT 180 
GCGGCAGCAG GTCAGGGAGA TCACGTTCCT GAAAAACACG GTGATGGAGT GTGACGCGTG 240 
CGGGATGCAG CAGTCAGTAC GCACCGGCCT ACCCAGCGTG CGGCCCCTGC TCCACTGCGC 300 
GCCCGGCTTC TGCTTCCCCG GCGTGGCCTG CATCCAGACG GAGAGCGGCG GCCGCTGCGG 360 
CCCCTGCCCC GCGGGCTTCA CGGGCAACGG CTCGCACTGC ACCGACGTCA ACGAGTGCAA 420 

5 5 CGCCCACCCC TGCTTCCCCC GAGTCCGCTG TATCAACACC AGCCCGGGGT TCCGCTGCGA 480 
GGCTTGCCCG CCGGGGTACA GCGGCCCCAC CCACCAGGGC GTGGGGCTGG CTTTCGCCAA 540 
GGCCAACAAG CAGGTTTGCA CGGACATCAA CGAGTGTGAG ACCGGGCAAC ATAACTGCGT 600 
CCCCAACTCC GTGTGCATCA ACACCCGGGG CTCCTTCCAG TGCGGCCCGT GCCAGCCCGG 660 
CTTCGTGGGC GACCAGGCGT CCGGCTGCCA GCGCGGCGCA CAGCGCTTCT GCCCCGACGG 720 

60 CTCGCCCAGC GAGTGCCACG AGCATGCAGA CTGCGTCCTA GAGCGCGATG GCTCGCGGTC 780 
GTGCGTGTGT CGCGTTGGCT GGGCCGGCAA CGGGATCCTC TGTGGTCGCG ACACTGACCT 840 
AGACGGCTTC CCGGACGAGA AGCTGCGCTG CCCGGAGCCG CAGTGCCGTA AGGACAACTG 900 
CGTGACTGTG CCCAACTCAG GGCAGGAGGA TGTGGACCGC GATGGCATCG GAGACGCCTG 960 
CGATCCGGAT GCCGACGGGG ACGGGGTCCC CAATGAAAAG GACAACTGCC CGCTGGTGCG 1020 

65 GAACCCAGAC CAGCGCAACA CGGACGAGGA CAAGTGGGGC GATGCGTGCG ACAACTGCCG 1080 
GTCCCAGAAG AACGACGACC AAAAGGACAC AGACCAGGAC GGCCGGGGCG ATGCGTGCGA 1140 
CGACGACATC GACGGCGACC GGATCCGCAA CCAGGCCGAC AACTGCCCTA GGGTACCCAA 1200 
CTCAGACCAG AAGGACAGTG ATGGCGATGG TATAGGGGAT GCCTGTGACA ACTGTCCCCA 1260 
GAAGAGCAAC CCGGATCAGG CGGATGTGGA CCACGACTTT GTGGGAGATG CTTGTGACAG 1320 

70 CGATCAAGAC CAGGATGGAG ACGGACATCA GGACTCTCGG GACAACTGTC CCACGGTGCC 1380 
TAACAGTGCC CAGGAGGACT CAGACCACGA TGGCCAGGGT GATGCCTGCG ACGACGACGA 1440 
CGACAATGAC GGAGTCCCTG ACAGTCGGGA CAACTGCCGC CTGGTGCCTA ACCCCGGCCA 1500 
GGAGGACGCG GACAGGGACG GCGTGGGCGA CGTGTGCCAG GACGACTTTG ATGCAGACAA 1560 
GGTGGTAGAC AAGATCGACG TGTGTCCGGA GAACGCTGAA GTCACGCTCA CCGACTTCAG 1620 

75 GGCCTTCCAG ACAGTCGTGC TGGACCCGGA GGGTGACGCG CAGATTGACC CCAACTGGGT 1680 
GGTGCTCAAC CAGGGAAGGG AGATCGTGCA GACAATGAAC AGCGACCCAG GCCTGGCTGT 1740 
GGGTTACACT GCCTTCAATG GCGTGGACTT CGAGGGCACG TTCCATGTGA ACACGGTCAC 1800 

400 
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GGATGACGAC TATGCGGGCT TCATCTTTGG CTACCAGGAC AGCTCCAGCT TCTACGTGGT 1860 
CATGTGGAAG CAGATGGAGC AAACGTATTG GCAGGCGAAC CCCTTCCGTG CTGTGGCCGA 1920 
GCCTGGCATC CAACTCAAGG CTGTGAAGTC TTCCACAGGC CCCGGGGAAC AGCTGCGGAA 1980 
CGCTCTGTGG CATACAGGAG ACACAGAGTC CCAGGTGCGG CTGCTGTGGA AGGACCCGCG 2040 
J AAACGTGGGT TGGAAGGACA AGAAGTCCTA TCGTTGGTTC CTGCAGCACC GGCCCCAAGT 2100 
GG GCTACATC AGGGTGCGAT TCTATGAGGG CCCTGAGCTG GTGGCCGACA GCAACGTGGT 2160 
CTTGGACACA ACCATGCGGG GTGGCCGCCT GGGGGTCTTC TGCTTCTCCC AGGAGAACAT 2220 
CATCTGGGCC AACCTGCGTT ACCGCTGCAA TGACACCATC CCAGAGGACT ATGAGACCCA 2280 
TCAGCTGCGG CAAGCCTAGG GACCAGGGTG AGGACCCGCC GGATGACAGC CACCCTCACC 2340 
1 U GCGGCTGGAT GGGGGCTCTG CACCCAGCCC AAGGGGTGGC CGTCCTGAGG GGGAAGTGAG 2400 
AAGGGCTCAG AGAGGACAAA ATAAAGTGTG TGTGCAGGG 
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SeqIDNO: 1 1 8 Protein sequence: 
Protein Accession #: NP 000086. 1 

1 11 21 31 41 51 



i i i i i I 

MVPDTACVLL LTLAALGASG QGQSPLGSDL GPQMLRELQE TNAALQDVRD WLRQQVREIT 60 
y 0 _ FLKNTVMECD ACGMQQSVRT GLPSVRPLLH CAPGFCFPG V ACIQTESGGR CGPCPAGFTG 1 20 
J.~20 NGSHCTDVNE CNAHPCFPRV RCINTSPGFR CEACPPGYSG PTHQG VGLAF AKANKQVCTD 180 
Ci INECETGQHN CVPNSVCINT RGSFQCGPCQ PGFVGDQASG CQRGAQRFCP DGSPSECHEH 240 

~, ADCVLERDGS RSCVCRVGWA GNGILCGRDT DLDGFPDEKL RCPEPQCRKD NCVTVPNSGQ 300 

fe= : EDVDRDGIGD ACDPDADGDG VPNEKDNCPL VRNPDQRNTD EDKWGDACDN CRSQKNDDQK 360 

Lfk c DTDQDGRGDA CDDDIDGDRI RNQADNCPRV PNSDQKDSDG DGIGDACDNC PQKSNPDQAD 420 

VDHDFVGDAC DSDQDQDGDG HQDSRDNCPT VPNSAQEDSD HDGQGDACDD DDDNDGVPDS 480 
W RDNCRLVPNP GQEDADRDGV GDVCQDDFDA DKWDKIDVC PENAEVTLTD FRAFQTVVLD 540 

f | j PEGDAQIDPN WVVLNQGREI VQTMNSDPGL AVGYTAFNGV DFEGTFHVNT VTDDDYAGFI 600 

!*\ FGYQDSSSFY VVMWKQMEQT YWQANPFRAV AEPGIQLKAV KSSTGPGEQL RNALWHTGDT 660 

%L a ESQVRLLWKD PRNVGWKDKK SYRWFLQHRP QVGYIRVRFY EGPELVADSN WLDTTMRGG 720 
f-p U RLG VFCFSQE NIIWANLRYR CNDTIPEDYE THQLRQA 



Seq ID NO: 119 DNA sequence 

Nucleic Acid Accession #: NM_014211 

Coding sequence: 157-1479 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

tl I i I I I I 

JE GGGACAGGGC TGAGGATGAG GAGAACCCTG GGGACCCAGA AGACCGTGCC TTGCCCGGAA 60 

2r. . A GTCCTGCCTG TAGGCCTGAA GGACTTGCCC TAACAGAGCC TCAACAACTA CCTGGTGATT 120 
%J\\J CCTACTTCAG CCCCTTGGTG TG AGCAGCTT CTCAACATGA ACTACAGCCT CCACTTGGCC 1 80 
p I TTCGTGTGTC TGAGTCTCTT CACTGAGAGG ATGTGCATCC AGGGGAGTCA GTTCAACGTC 240 

! G AGGTCGGCA G AAGTGACAA GCTTTCCCT G CCTGGCTTTG AGAACCTCAC AGCAGGATAT 300 

AACAAATTTC TCAGGCCCAA TTTTGGTGGA GAACCCGTAC AGATAGCGCT GACTCTGGAC 360 
ATTGCAAGTA TCTCTAGCAT TTCAGAGAGT AACATGGACT ACACAGCCAC CATATACCTC 420 
4D CGACAGCGCT GGATGGACCA GCGGCTGGTG TTTGAAGGCA ACAAGAGCTT CACTCTGGAT 480 
GCCCGCCTCG TGGAGTTCCT CTGGGTGCCA GATACTTACA TTGTGGAGTC CAAGAAGTCC 540 
TTCCTCCATG AAGTCACTGT GGGAAACAGG CTCATCCGCC TCTTCTCCAA TGGCACGGTC 600 
CTGTATGCCC TCAGAATCAC GACAACTGTT GCATGTAACA TGGATCTGTC TAAATACCCC 660 
ATGG ACACAC AGACATGCAA GTTGCAGCTG G AAAGCTGGG GCTATG ATGG AAATG ATGTG 720 
J U G AGTTCACCT GGCTGAGAGG GAACGACTCT GTGCGTGGAC TGGAACACCT GCGGCTTGCT 780 
CAGTACACCA TAGAGCGGTA TTTCACCTTA GTCACCAGAT CGCAGCAGGA GACAGGAAAT 840 
TACACTAGAT TGGTCTTACA GTTTGAGCTT CGGAGGAATG TTCTGTATTT CATTTTGGAA 900 
ACCTACGTTC CTTCCACTTT CCTGGTGGTG TTGTCCTGGG TTTCATTTTG GATCTCTCTC 960 
GATTCAGTCC CTGC AAGAAC CTGCATTGGA GTGACGACCG TGTTATC AAT GACCACACTG 1 020 
J J ATGATCGGGT CCCGCACTTC TCTTCCCAAC ACCAACTGCT TCATCAAGGC CATCGATGTG 1080 
TACCTGGGGA TCTGCTTTAG CTTTGTGTTT GGGGCCTTGC TAGAATATGC AGTTGCTCAC 1 140 
TACAGTTCCT TACAGCAGAT GGCAGCCAAA GATAGGGGGA CAACAAAGGA AGTAGAAGAA 1200 
GTCAGTATTA CTAATATCAT CAACAGCTCC ATCTCCAGCT TTAAACGGAA GATCAGCTTT 1260 
GCCAGCATTG AAATTTCCAG CGACAACGTT GACTACAGTG ACTTGACAAT GAAAACCAGC 1320 
OU GACAAGTTCA AGTTTGTCTT CCGAGAAAAG ATGGGCAGGA TTGTTGATTA TTTCACAATT 1380 
CAAAACCCCA GTAATGTTGA TCACTATTCC AAACTACTGT TTCCTTTGAT TTTTATGCTA 1440 
GCCAATGTAT TTTACTGGGC ATACTACATG TATTT TTGAG TCAATGTTAA ATTTCTTGCA 1500 
TGCCATAGGT CTTCAACAGG ACAAGATAAT GATGTAAATG GTATTTTAGG CCAAGTGTGC 1560 
ACCCACATCC AATGGTGCTA CAAGTGACTG AAATAATATT TGAGTCTTTC TGCTCAAAGA 1620 
02> ATGAAGCTCC AACCATTGTT CTAAGCTGTG TAGAAGTCCT AGCATTATAG GATCTTGTAA 1680 
TAGAAACATC AGTCCATTCC TCTTTCATCT TAATCAAGGA CATTCCCATG GAGCCCAAGA 1740 
TTACAAATGT ACTCAGGGCT GTTTATTCGG TGGCTCCCTG GTTTGCATTT ACCTCATATA 1800 
AAGAATGGGA AGGAGACCAT TGGGTAACCC TCAAGTGTCA GAAGTTGTTT CTAAAGTAAC 1860 
_ TATACATGTT TTTTACTAAA TCTCTGCAGT GCTTATAAAA TACATTGTTG CCTATTTAGG 1920 

/ U GAGTAACATT TTCTAGTTTT TGTTTCTGGT TAAAATGAAA TATGGGCTTA TGTCAATTCA 1 980 
TTGGAAGTCA ATGCACTAAC TCAATACCAA GATGAGTTTT TAAATAATGA ATATTATTTA 2040 
ATACCACAAC AGAATTATCC CCAATTTCCA ATAAGTCCTA TCATTGAAAA TTCAAATATA 2100 
AGTGAAGAAA AAATTAGTAG ATCAACAATC TAAACAAATC CCTCGGTTCT AAGATACAAT 2160 
GGATTCCCCA TACTGGAAGG ACTCTG AGGC TTTATTCCCC CACTATGCAT ATCTTATCAT 2220 
/ J TTTATTATTA TACACACATC CATCCTAAAC TATACTAAAG CCCTTTTCCC ATGCATGGAT 2280 
GGAAATGGAA GATTTTTTTG TAACTTGTTC TAGAAGTCTT AATATGGGCT GTTGCCATGA 2340 
AGGCTTGCAG AATTGAGTCC ATTTTCTAGC TGCCTTTATT CACATAGTGA TGGGGTACTA 2400 
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AAAGTACTGG GTTGACTCAG AGAGTCGCTG TCATTCTGTC ATTGCTGCTA CTCTAACACT 2460 
GAGCAACACT CTCCCAGTGG CAGATCCCCT GTATCATTCC AAGAGGAGCA TTCATCCCTT 2520 
TGCTCTAATG ATCAGGAATG ATGCTTATTA GAAAACAAAC TGCTTGACCC AGGAACAAGT 2580 
GGCTTAGCTT AAGTAAACTT GGCTTTGCTC AGATCCCTGA TCCTTCCAGC TGGTCTGCTC 2640 
J TGAGTGGCTT ATCCCGCATG AGCAGGAGCG TGCTGGCCCT GAGTACTGAA CTTTCTGAGT 2700 
AACAATGAGA CACGTTACAG AACCTATGTT CAGGTTGCGG GTGAGCTGCC CTCTCCAAAT 2760 
CCAGCCAGAG ATGCACATTC CTCGGCCAGT CTCAGCCAAC AGTACCAAAA GTGATTTTTG 2820 
AGTGTGCCAG GGTAAAGGCT TCCAGTTCAG CCTCAGTTAT TTTAGACAAT CTCGCCATCT 2880 
TTAATTTCTT AGCTTCCTGT TCTAATAAAT GCACGGCTTT ACCTTTCCTG TCAGAAATAA 2940 

1 U ACCAA GGCTC TAAAAGATGA TTTCCCTTCT GTAACTCCCT AGAGCCACAG GTTCTCATTC 3000 
CTTTTCCCAT TATACTTCTC ACAATTCAGT TTCTATGAGT TTGATCACCT GATTTTTTTA 3060 
ACAAAATATT TCTAACGGGA ATGGGTGGGA GTGCTGGTGA AAAGAGATGA AATGTGGTTG 3120 
TATGAGCCAA TCATATTTGT GATTTTTTAA AAAAAGTTTA AAAGGAAATA TCTGTTCTGA 3180 
AACCCCACTT AAGCATTGTT TTTATATAAA AACAATGATA AAGATGTGAA CTGTGAAATA 3240 

I J AATATACCAT ATTAGCTACC CACC 

Seq ID NO: 120 Protein sequence: 
. Protein Accession #: NP 055026.1 

J==20 1 11 21 31 41 51 

S I I I I I I 

;„.! MNYSLHLAFV CLSLFTERMC IQGSQFNVEV GRSDKLSLPG FENLTAGYNK FLRPNFGGEP 60 

i r) VQIALTLDIA SISSISESNM DYTATIYLRQ RWMDQRLVFE GNKSFTLDAR LVEFLWVPDT 120 

ir I YIVESKKSFL HEVTVGNRLI RLFSNGTVLY ALRITTTVAC NMDLSKYPMD TQTCKLQLES 180 

rnZD WGYDGNDVEF TWLRGNDSVR GLEHLRLAQY TIERYFTLVT RSQQETGNYT RLVLQFELRR 240 

i= : NVLYFILETY VPSTFLVVLS WVSFWISLDS VPARTCIGVT TVLSMTTLMI GSRTSLPNTN 300 

J CFIKAIDVYL GICFSFVFGA LLEYAVAHYS SLQQMAAKDR GTTKEVEEVS ITNIINSSIS 360 

% \ SFKRK1SFAS IEISSDNVDY SDLTMKTSDK FKF VFREKMG RIVDYFTIQN PSNVDHYSKL 420 

f—i-jQ LFPLIFMLAN VFYWAYYMYF 

£ Seq ID NO : 1 2 1 DN A sequence 

t_ Nucleic Acid Accession #: NM_0018S4 

t._i Coding sequence: 163-5582 (underlined sequences correspond to start and stop codons) 

;l35 1 11 21 31 41 51 

fy 1 1 1 1 1 1 

£ ■■ AACCATCAAA TTTAGAAGAA AAAGCCCTTT GACTTTTTCC CCCTCTCCCT CCCCAATGGC 60 

TGTGTAGCAA ACATCCCTGG CGATACCTTG GAAAGGACGA AGTTGGTCTG CAGTCGCAAT 120 
EZL n TTCGTGGGTT GAGTTCACAG TTGTGAGTGC GGGGCTCGGA GATGGAGCCG TGGTCCTCTA 180 
i f+U GGTGGAAAAC GAAACGGTGG CTCTGGGATT TCACCGTAAC AACCCTCGCA TTGACCTTCC 240 
1 y TCTTCCAAGC TAGAGAGGTC AGAGGAGCTG CTCCAGTTGA TGTACTAAAA GCACTAGATT 300 

TTCACAATTC TCCAGAGGGA ATATCAAAAA CAACGGGATT TTGCACAAAC AGAAAGAATT 360 
CTAAAGGCTC AGATACTGCT TACAGAGTTT CAAAGCAAGC ACAACTCAGT GCCCCAACAA 420 
AACAGTTATT TCCAGGTGGA ACTTTCCCAG AAGACTTTTC AATACTATTT ACAGTAAAAC 480 

4D CAAAAAAAGG AATTCAGTCT TTCCTTTTAT CTATATATAA TGAGCATGGT ATTCAGCAAA 540 
TTGGTGTTGA GGTTGGGAGA TCACCTGTTT TTCTGTTTGA AGACCACACT GGAAAACCTG 600 
CCCCAGAAGA CTATCCCCTC TTCAGAACTG TTAACATCGC TGACGGGAAG TGGCATCGGG 660 
TAGCAATCAG CGTGGAGAAG AAAACTGTGA CAATGATTGT TGATTGTAAG AAGAAAACCA 720 
CGAAACCACT TGATAGAAGT GAGAGAGCAA TTGTTGATAC CAATGGAATC ACGGTTTTTG 780 

DU GAACAAGGAT TTTGGATGAA GAAGTTTTTG AGGGGGACAT TCAGCAGTTT TTGATCACAG 840 
GTGATCCCAA GGCAGCATAT GACTACTGTG AGCATTATAG TCCAGACTGT GACTCTTCAG 900 
CACCCAAGGC TGCTCAAGCT CAGGAACCTC AGATAGATGA GTATGCACCA GAGGATATAA 960 
TCGAATATGA CTATGAGTAT GGGGAAGCAG AGTATAAAGA GGCTGAAAGT GTAACAGAGG 1020 
GACCCACTGT AACTGAGGAG ACAATAGCAC AGACGGAGGC AAACATCGTT GATGATTTTC 1080 

J J AAGAATACAA CTATGGAACA ATGGAAAGTT ACCAGACAGA AGCTCCTAGG CATGTTTCTG 1140 
GGACAAATGA GCCAAATCCA GTTGAAGAAA TATTTACTGA AGAATATCTA ACGGGAGAGG 1200 
ATTATGATTC CCAGAGGAAA AATTCTGAGG ATACACTATA TGAAAACAAA GAAATAGACG 1260 
GCAGGGATTC TGATCTTCTG GTAGATGGAG ATTTAGGCGA ATATGATTTT TATGAATATA 1320 
AAGAATATGA AGATAAACCA ACAAGCCCCC CTAATGAAGA ATTTGGTCCA GGTGTACCAG 1380 

OU CAGAAACTGA TATTACAGAA ACAAGCATAA ATGGCCATGG TG CATATGG A GAGAAAGGAC 1440 

AGAAAGGAGA ACCAGCAGTG GTTGAGCCTG GTATGCTTGT CGAAGGACCA CCAGGACCAG 1500 
CAGGACCTGC AGGTATTATG GGTCCTCCAG GTCTACAAGG CCCCACTGGA CCCCCTGGTG 1560 
ACCCTGGCGA TAGGGGCCCC CCAGGACGTC CTGGCTTACC AGGGGCTGAT GGTCTACCTG 1620 
GTCCTCCTGG TACTATGTTG ATGTTACCGT TCCGTTATGG TGGTGATGGT TCCAAAGGAC 1680 

OD CAACCATCTC TGCTCAGGAA GCTCAGGCTC AAGCTATTCT TCAGCAGGCT CGGATTGCTC 1 740 

TGAGAGGCCC ACCTGGCCCA ATGGGTCTAA CTGGAAGACC AGGTCCTGTG GGGGGGCCTG 1800 
GTTCATCTGG GGCCAAAGGT GAGAGTGGTG ATCCAGGTCC TCAGGGCCCT CGAGGCGTCC 1860 
AGGGTCCCCC TGGTCCAACG GGAAAACCTG GAAAAAGGGG TCGTCCAGGT GCAGATGGAG 1920 
GAAGAGGAAT GCCAGGAGAA CCTGGGGCAA AGGGAGATCG AGGGTTTGAT GGACTTCCGG 1980 

/ U GTCTGCCAGG TGACAAAGGT CACAGGGGTG AACGAGGTCC TCAAGGTCCT CCAGGTCCTC 2040 

CTGGTGATGA TGGAATGAGG GGAGAAGATG GAGAAATTGG ACCAAGAGGT CTTCCAGGTG 2100 
AAGCTGGCCC ACGAGGTTTG CTGGGTCCAA GGGGAACTCC AGGAGCTCCA GGGCAGCCTG 2160 
GTATGGCAGG TGTAGATGGC CCCCCAGGAC CAAAAGGGAA CATGGGTCCC CAAGGGGAGC 2220 
CTGGGCCTCC AGGTCAACAA GGGAATCCAG GACCTCAGGG TCTTCCTGGT CCACAAGGTC 2280 

/ D CAATTGGTCC TCCTGGTGAA AAAGGACCAC AAGGAAAACC AGGACTTGCT GGACTTCCTG 2340 

GTGCTGATGG GCCTCCTGGT CATCCTGGGA AAGAAGGCCA GTCTGGAGAA AAGGGGGCTC 2400 
TGGGTCCCCC TGGTCCACAA GGTCCTATTG GATNNCCGGG CCCCCGGGGA GTAAAGGGAG 2460 
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CAGATGGTGT CAGAGGTCTC AAGGGATCTA AAGGTGAAAA GGGTGAAGAT GGTTTTCCAG 2520 
GATTCAAAGG TGACATGGGT CTAAAAGGTG ACAGAGGAGA AGTTGGTCAA ATTGGCCCAA 2580 
GAGGGNAAGA TGGCCCTGAA GGACCCAAAG GTCGAGCAGG CCCAACTGGA GACCCAGGTC 2640 
CTTCAGGTCA AGCAGGAGAA AAGGGAAAAC TTGGAGTTCC AGGATTACCA GGATATCCAG 2700 
J GAAGACAAGG TCCAAAGGGT TCCACTGGAT TCCCTGGGTT TCCAGGTGCC AATGGAGAGA 2760 

AAGGTGCACG GGGAGTAGCT GGCAAACCAG GCCCTCGGGG TCAGCGTGGT CCAACGGGTC 2820 
CTCGAGGTTC AAGAGGTGCA AGAGGTCCCA CTGGGAAACC TGGGCCAAAG GGCACTTCAG 2880 
GTGGCGATGG CCCTCCTGGC CCTCCAGGTG AAAGAGGTCC TCAAGGACCT CAGGGTCCAG 2940 
TTGGATTCCC TGGACCAAAA GGCCCTCCTG GACCACCAGG AAGGATGGGC TGCCCAGGAC 3000 
1 V ACCCTGGGCA ACGTGGGGAG ACTGGATTTC AAGGCAAGAC CGGCCCTCCT GGGCCAGGGG 3060 
GAGTGGTTGG ACCACAGGGA CCAACCGGTG AGACTGGTCC AATAGGGGAA CGTGGGTATC 3120 
CTGGTCCTCC TGGCCCTCCT GGTGAGCAAG GTCTTCCTGG TGCTGCAGGA AAAGAAGGTG 3180 
CAAAGGGTGA TCCAGGTCCT CAAGGTATCT CAGGGAAAGA TGGACCAGCA GGATTACGTG 3240 
GTTTCCCAGG GGAAAGAGGT CTTCCTGGAG CTCAGGGTGC ACCTGGACTG AAAGGAGGGG 3300 
1 D AAGGTCCCCA GGGCCCACCA GGTCCAGTTG GCTCACCAGG AGAACGTGGG TCAGCAGGTA 3360 

CAGCTGGCCC AATTGGTTTA CGAGGGCGCC CGGGACCTCA GGGTCCTCCT GGTCCAGCTG 3420 
GAGAGAAAGG TGCTCCTGGA GAAAAAGGTC CCCAAGGGCC TGCAGGGAGA GATGGAGTTC 3480 
AAGGTCCTGT TGGTCTCCCA GGGCCAGCTG GTCCTGCCGG CTCCCCTGGG GAAGACGGAG 3540 
;; - ACAAGGGTGA AATTG GTG AG CCGGGACAAA AAGGCAGCAA GGGTGGCAAG GGAGAAAATG 3600 

~~Z() GCCCTCCCGG TCCCCCAGGT CTTCAAGGAC CAGTTGGTGC CCCTGGAATT GCTGGAGGTG 3660 

O ATGGTGAACC AGGTCCTAGA GGACAGCAGG GGATGTTTGG GCAAAAAGGT GATGAGGGTG 3720 

CCAGAGGCTT CCCTGGACCT CCTGGTCCAA TAGGTCTTCA GGGTCTGCCA GGCCCACCTG 3780 
fe=- GTGAAAAAGG TGAAAATGGG GATGTTGGTC CATGGGGGCC ACCTGGTCCT CCAGGCCCAA 3840 

IfL GAGGCCCTCA AGGTCCCAAT GGAGCTGATG GACCACAAGG ACCCCCAGGT TCTGTTGGTT 3900 

f „-rZD CAGTTGGTGG TGTTGGAGAA AAGGGTGAAC CTGGAGAAGC AGGAAACCCA GGGCCTCCTG 3960 

W GGGAAGCAGG TGTAGGCGGT CCCAAAGGAG AAAGAGGAGA GAAAGGGGAA GCTGGTCCAC 4020 

Rj CTGGAGCTGC TGGACCTCCA GGTGCCAAGG GGCCGCCAGG TGATGATGGC CCTAAGGGTA 4080 

L " : ACCCGGGTCC TGTTGGTTTT CCTGGAGATC CTGGTCCTCC TGGGGAACTT GGCCCTGCAG 4140 

'% A GTCAAGATGG TGTTGGTGGT GACAAGGGTG AAGATGGAGA TCCTGGTCAA CCGGGTCCTC 4200 
f=pU CTGGCCCATC TGGTGAGGCT GGCCCACCAG GTCCTCCTGG AAAACGAGGT CCTCCTGGAG 4260 
" CTGCAGGTGC AGAGGGAAGA CAAGGTGAAA AAGGTGCTAA GGGGGAAGCA GGTGCAGAAG 4320 

* GTCCTCCTGG AAAAACCGGC CCAGTCGGTC CTCAGGGACC TGCAGGAAAG CCTGGTCCAG 4380 

f==: AAGGTCTTCG GGGCATCCCT GGTCCTGTGG GAGAACAAGG TCTCCCTGGA GCTGCAGGCC 4440 

rV, <- AAGATGGACC ACCTGGTCCT ATGGGACCTC CTGGCTTACC TGGTCTCAAA GGTGACCCTG 4500 

h*S J GCTCCAAGGG TG A AAAGGG A CATCCTGGTT TAATTGGCCT GATTGGTCCT CCAGGAGAAC 4560 
f? i AAGGGGAAAA AGGTGACCGA GGGCTCCCTG GAACTCAAGG ATCTCCAGGA GCAAAAGGGG 4620 

5 ATGGGGGAAT TCCTGGTCCT GCTGGTCCCT TAGGTCCACC TGGTCCTCCA GGCTTACCAG 4680 

£] GTCCTCAAGG CCCAAAGGGT AACAAAGGCT CTACTGGACC CGCTGGCCAG AAAGGTGACA 4740 

GTGGTCTTCC AGGGCCTCCT GGGCCTCCAG GTCCACCTGG TGAAGTCATT CAGCCTTTAC 4800 
fe:4U CAATCTTGTC CTCCAAAAAA ACGAGAAGAC ATACTGAAGG CATGCAAGCA GATGCAGATG 4860 

f|j ATAATATTCT TGATTACTCG GATGGAATGG AAGAAATATT TGGTTCCCTC AATTCCCTGA 4920 

AACAAGACAT CGAGCATATG AAATTTCC A A TGGGTACTCA GACCAATCCA GCCCGAACTT 4980 
GTAAAGACCT GCAACTCAGC CATCCTGACT TCCCAGATGG TGAATATTGG ATTGATCCTA 5040 
ACCAAGGTTG CTCAGGAGAT TCCTTCAAAG TTTACTGTAA TTTCACATCT GGTGGTGAGA 5100 
40 CTTGCATTTA TCCAGACAAA AAATCTGAGG GAGTAAGAAT TTCATCATGG CCAAAGGAGA 5160 

AACCAGGAAG TTGGTTTAGT GAATTTAAGA GGGGAAAACT GCTTTCATAC TTAGATGTTG 5220 
AAGGAAATTC CATCAATATG GTGCAAATGA CATTCCTGAA ACTTCTGACT GCCTCTGCTC 5280 
GGCAAAATTT CACCTACCAC TGTCATCAGT CAGCAGCCTG GTATGATGTG TCATCAGGAA 5340 
GTTATGACAA AGCACTTCGC TTCCTGGGAT CAAATGATGA GGAGATGTCC TATGACA ATA 5400 
J U ATCCTTTTAT CAAAAC ACTG TATGATGGTT GTACGTCCAG AAAAGGCTAT GAAAA AACTG 5460 

TCATTGAAAT CAATACACCA AAAATTGATC AAGTACCTAT TGTTGATGTC ATGATCAGTG 5520 
ACTTTGGTGA TCAGAATCAG AAGTTCGGAT TTGAAGTTGG TCCTGTTTGT TTTCTTGGCJ_ 5580 
AAGATTAAGA CAAAGAACAT ATCAAATCAA CAGAAAATGT ACCTTGGTGC CACCAACCCA 5640 
TTTTGTGCCA CATGCAAGTT TTGAATAAGG ATGTATGGAA AACAACGCTG CATATACAGG 5700 
J J TACCATTTAG GAAATACCGA TGCCTTTGTG GGGGCAGAAT CACAGACAAA AGCTTTGAAA 5760 

ATCATAAAGA TATAAGTTGG TGTGGCTAAG ATGGAAACAG GGCTGATTCT TGATTCCCAA 5820 
TTCTCAACTC TCCTTTTCCT ATTTGAATTT CTTTGGTGCT GTAGAAAACA AAAAAAGAAA 5880 
AATATATATT CATAAAAAAT ATGGTGCTCA TTCTCATCCA TCCAGGATGT ACTAAAACAG 5940 
TGTGTTTAAT AAATTGTAAT TATTTTGTGT ACAGTTCTAT ACTGTTATCT GTGTCCATTT 6000 
OU CCAAAACTTG CACGTGTCCC TGAATTCCGC TGACTCTAAT TTATGAGGAT GCCGAACTCT 6060 

GATGGCAATA ATATATGTAT TATGAAAATG AAGTTATGAT TTCCGATGAC CCTAAGTCCC 6120 
TTTCTTTGGT TAATGATGAA ATTCCTTTGT GTGTGTTT 

Seq ID NO: 122 Protein sequence: 
65 Protein Accession #: NP 00184S 
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IK) MEPWSSRWKT KRWLWDFTVT TLALTFLFQA REVRGAAPVD VLKALDFHNS PEGISKTTGF 60 
CTNRKNSKGS DTAYRVSKQA QLSAPTKQLF PGGTFPEDFS ILFTVKPKKG IQSFLLSIYN 120 
EHGIQQIG VE VGRSP VFLFE DHTGKPAPED YPLFRTVNIA DGKWHRVAIS VEKKT VTMI V 1 80 
DCKKKTTKPL DRSERAIVDT NGITVFGTRI LDEEVFEGDI QQFLITGDPK AAYDYCEHYS 240 
PDCDSSAPKA AQAQEPQIDE YAPEDIIEYD YEYGEAEYKE AESVTEGPTV TEETIAQTEA 300 

/ J NIVDDFQEYN YGTMESYQTE APRHVSGTNE PNPVEEIFTE EYLTGEDYDS QRKNSEDTLY 360 
ENKEIDGRDS DLLVDGDLGE YDFYEYKEYE DKPTSPPNEE FGPGVPAETD ITETSINGHG 420 
AYGEKGQKGE PAVVEPGMLV EGPPGPAGPA GIMGPPGLQG PTGPPGDPGD RGPPGRPGLP 480 

403 



GADGLPGPPG TMLM LPFRYG GDGSKGPTIS AQEAQAQAIL QQARIALRGP PGPMGLTGRP 540 
GPVGGPGSSG AKGESGDPGP QGPRGVQGPP GPTGKPGKRG RPGADGGRGM PGEPGAKGDR 600 
GFDGLPGLPG DKGHRGERGP QGPPGPPGDD GMRGEDGEIG PRGLPGEAGP RGLLGPRGTP 660 
GAPGQPGMAG VDGPPGPKGN MGPQGEPGPP GQQGNPGPQG LPGPQGPIGP PGEKGPQGKP 720 
GLAGLPGADG PPGHPGKEGQ SGEKGALGPP GPQGPIGXPG PRGVKGADGV RGLKGSKGEK 780 
GEDGFPGFKG DMGLKGDRGE VGQIGPRGXD GPEGPKGRAG PTGDPGPSGQ AGEKGKLGVP 840 
GLPGYPGRQG PKGSTGFPGF PGANGEKGAR GVAGKPGPRG QRGPTGPRGS RGARGPTGKP 900 
GPKGTSGGDG PPGPPGERGP QGPQGPVGFP GPKGPPGPPG RMGCPGHPGQ RGETGFQGKT 960 
GPPGPGGVVG PQGPTGETGP IGERGYPGPP GPPGEQGLPG AAGKEGAKGD PGPQGISGKD 1020 
GPAGLRGFPG ERGLPGAQGA PGLKGGEGPQ GPPGPVGSPG ERGSAGTAGP IGLRGRPGPQ 1080 
GPPGPAGEKG APGEKGPQGP AGRDGVQGPV GLPGPAGPAG SPGEDGDKGE IGEPGQKGSK 1 140 
GGKGENGPPG PPGLQGPVGA PGIAGGDGEP GPRGQQGMFG QKGDEGARGF PGPPGPIGLQ 1200 
GLPGPPGEKG ENGDVGPWGP PGPPGPRGPQ GPNGADGPQG PPGSVGSVGG VGEKGEPGEA 1260 
GNPGPPGEAG VGGPKGERGE KGEAGPPG A A GPPGAKGPPG DDGPKGNPGP VGFPGDPGPP 1320 
GELGPAGQDG VGGDKGEDGD PGQPGPPGPS GEAGPPGPPG KRGPPGAAGA EGRQGEKGAK 1380 
GEAGAEGPPG KTGPVGPQGP AGKPGPEGLR GIPGPVGEQG LPGAAGQDGP PGPMGPPGLP 1440 
GLKGDPGSKG EKGHPGLIGL IGPPGEQGEK GDRGLPGTQG SPGAKGDGGI PGPAGPLGPP 1500 
GPPGLPGPQG PKGNKGSTGP AGQKGDSGLP GPPGPPGPPG EVIQPLPILS SKKTRRHTEG 1560 
MQADADDNIL DYSDGMEEIF GSLNSLKQDI EHMKFPMGTQ TNPARTCKDL QLSHPDFPDG 1620 
EYWIDPNQGC SGDSFKVYCN FTSGGETCIY PDKKSEGVRI SSWPKEKPGS WFSEFKRGKL 1680 
LSYLDVEGNS INMVQMTFLK LLTASARQNF TYHCHQSAAW YDVSSGSYDK ALRFLGSNDE 1740 
EMSYDNNPFI KTLYDGCTSR KGYEKTVTEI NTPKIDQVPI VDVMISDFGD QNQKFGFEVG 1 800 
PVCFLG 

Seq ID NO: 123 DNA sequence 

Nucleic Acid Accession #: NM 015886 

Coding sequence: 485-1 261 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
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GAATTCCCCC CCCCCCCCCC TCACTTGGTG TGTCTATATG TCTGGCAGAC ATTATCAGCA 60 
CATTCTCTGT TGTTACCTGT GATTCATTTT TTCTTCACTC TCCAGGTGAA TTTCAATTGC 120 
TGAAAATTTC CCACTGAAAA TATGCAGTAA TATATTTTGT GGTTCAGACA TTTGGGGCAA 1 80 
ATGGTTCACA TTCATTTTAG GGTTAGTGGT CATGCTGTTT ATTTTTCTCT GCTATACAAA 240 
GTTCCTCTTA GGGGTCTGCC TCATGACACT AAAAAATGAA TAGAGATTCT ACTGTAGGTT 300 
ATCTCCTAGG CTTGAGTTCA ACATTTGTTT GGATTTTTGA AGAAAGTCAA ATCAAGCAAT 360 
GCTCCCAAAT GATGTCTTTG TAAATTCATA CCCTCTGGCC CTATTTTTTT TCATAGACCC 420 
TAACTCTACC TTTCTGCTTT AAAGCAAAGT AAACTCGGTG GCCTCTTCTT CTCCACCCCT 480 
CAAA ATG ATA GCAATCTCTG CCGTCAGCAG TGCACTCCTG TTCTCCCTTC TCTGTGAAGC 540 
AAGTACCGTC GTCCTACTCA ATTCCACTGA CTCATCCCCG CCAACCAATA ATTTCACTGA 600 
TATTGAAGCA GCTCTGAAAG CACAATTAGA TTCAGCGGAT ATCCCCAAAG CCAGGCGGAA 660 
GCGCTACATT TCGCAGAATG ACATGATCGC CATTCTTGAT TATCATAATC AAGTTCGGGG 720 
CAAAGTGTTC CCACCGGCAG CAAATATGGA ATATATGGTT TGGGATGAAA ATCTTGCAAA 780 
ATCGGCAGAG GCTTGGGCGG CTACTTGCAT TTGGGACCAT GGACCTTCTT ACTTACTGAG 840 
ATTTTTGGGC CAAAATCTAT CTGTACGCAC TGGAAGATAT CGCTCTATTC TCCAGTTGGT 900 
CAAGCCATGG TATGATGAAG TGAAAGATTA TGCTTTTCCA TATCCCCAGG ATTGCAACCC 960 
CAGATGTCCT ATGAGATGTT TTGGTCCCAT GTGCACACAT TATACGCAGA TGGTTTGGGC 1020 
CACTTCCAAT CGGATAGGAT GCGCAATTCA TGCTTGCCAA AACATGAATG TTTGGGGATC 1080 
TGTGTGGCGA CGTGCAGTTT ACTTGGTATG CAACTATGCC CCAAAGGGCA ATTGGATTGG 1 140 
AGAAGCACCA TATAAAGTAG GGGTACCATG TTCATCTTGT CCTCCAAGTT ATGGGGGATC 1200 
TTGTACTGAC AATCTGTGTT TTCCAGGAGT TACGTCAAAC TACCTGTACT GGTTTAAATA 1260 
AGTTTACCTT TTCCTCCAGG AAATATAATG ATTTCTGGGA ACATGGGCAT GTATATATAT 1320 
ATATGGAGAG AGAATTTTGC ACATATTATA CATATTTTGT GCTAATCTTG TTTTCCTCTT 1380 
AGTATTCCTT TGTATAAATT AGTGTTTGTC TAGCATGTTT GTTTAATCCT TTGGGAATTC 

Seq ID NO: 124 Protein sequence: 
Protein Accession #: NP_056970.1 

1 11 21 31 41 51 

I I I I I I 

MIAISAVSSA LLFSLLCEAS TVVLLNSTDS SPPTNNFTDI EAALKAQLDS ADIPKARRKR 60 
YISQNDMIAI LDYHNQVRGK VFPPAANMEY MVWDENLAKS AEAWAATCIW DHGPSYLLRF 120 
LGQNLSVRTG RYRSILQLVK PWYDEVKDYA FPYPQDCNPR CPMRCFGPMC THYTQM VWAT 1 80 
SNRIGCAIHA CQNMNVWGSV WRRAVYLVCN YAPKGNWIGE APYKVGVPCS SCPPSYGGSC 240 
TDNLCFPGVT SNYLYWFK 



Seq ID NO: 125 DNA sequence 

Nucleic Acid Accession #: NM O01793 

Coding sequence: 54-2543 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

I I ! I I I 

GCGGAACACC GGCCCGCCGT CGCGGCAGCT GCTTCACCCC TCTCTCTGCA GCCATGGGGC 60 
TCCCTCGTGG ACCTCTCGCG TCTCTCCTCC TTCTCCAGGT TTGCTGGCTG CAGTGCGCGG 120 
CCTCCGAGCC GTGCCGGGCG GTCTTCAGGG AGGCTGAAGT GACCTTGGAG GCGGGAGGCG 180 
CGGAGCAGGA GCCCGGCCAG GCGCTGGGGA AAGTATTCAT GGGCTGCCCT GGGCAAGAGC 240 

404 
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CAGCTCTGTT TAGCACTGAT AATGATGACT TCACTGTGCG G A ATGGCG AG ACAGTCCAGG 300 
A A AGAAGG TC ACTGAAGGAA AGGAATCCAT TGAAGATCTT CCCATCCAAA CGTATCTTAC 360 
GAAGACACAA GAGAGATTGG GTGGTTGCTC CAATATCTGT CCCTGAAAAT GGCAAGGGTC 420 
CCTTCCCCCA GAGACTGAAT CAGCTCAAGT CTAATAAAGA TAGAGACACC AAGATTTTCT 480 
5 ACAGCATCAC GGGGCCGGGG GCAGACAGCC CCCCTGAGGG TGTCTTCGCT GTAGAGAAGG 540 
AGACAGGCTG GTTGTTGTTG AATAAGCCAC TGGACCGGGA GGAGATTGCC AAGTATGAGC 600 
TCTTTGGCCA CGCTGTGTCA GAGAATGGTG CCTCAGTGGA GGACCCCATG AACATCTCCA 660 
TCATCGTGAC CGACCAGAAT GACCACAAGC CCAAGTTTAC CCAGGACACC TTCCGAGGGA 720 
GTGTCTTAGA GGGAGTCCTA CCAGGTACTT CTGTGATGCA GGTGACAGCC ACAGATGAGG 780 
1 0 ATGATGCCAT CTACACCTAC AATGGGGTGG TTGCTTACTC CATCCATAGC CAAGAACCAA 840 
AGGACCCACA CGACCTCATG TTCACAATTC ACCGGAGCAC AGGCACCATC AGCGTCATCT 900 
CCAGTGGCCT GGACCGGGAA AAAGTCCCTG AGTACACACT GACCATCCAG GCCACAGACA 960 
TGGATGGGGA CGGCTCCACC ACCACGGCAG TGGCAGTAGT GGAGATCCTT GATGCCAATG 1020 
ACAATGCTCC CATGTTTGAC CCCCAGAAGT ACGAGGCCCA TGTGCCTGAG AATGCAGTGG 1080 
1 5 GCCATG AGGT GCAGAGGCTG ACGGTCACTG ATCTGGACGC CCCCAACTCA CCAGCGTGGC 1 140 
GTGCCACCTA CCTTATCATG GGCGGTGACG ACGGGGACCA TTTTACCATC ACCACCCACC 1200 
CTGAGAGCAA CCAGGGCATC CTGACAACCA GGAAGGGTTT GGATTTTGAG GCCAAAAACC 1260 
AGCACACCCT GTACGTTGAA GTGACCAACG AGGCCCCTTT TGTGCTGAAG CTCCCAACCT 1320 
- CCACAGCCAC CATAGTGGTC CACGTGGAGG ATGTGAATGA GGCACCTGTG TTTGTCCCAC 1380 

H20 CCTCCAAAGT CGTTGAGGTC CAGGAGGGCA TCCCCACTGG GGAGCCTGTG TGTGTCTACA 1440 
p. CTGCAGAAGA CCCTGACAAG GAGAATCAAA AGATCAGCTA CCGCATCCTG AGAGACCCAG 1500 

]£' CAGGGTGGCT AGCCATGGAC CCAGACAGTG GGCAGGTCAC AGCTGTGGGC ACCCTCGACC 1560 

V GTGAGGATGA GCAGTTTGTG AGGAACAACA TCTATGAAGT CATGGTCTTG GCCATGGACA 1620 

[PL ATGGAAGCCC TCCCACCACT GGCACGGGAA CCCTTCTGCT AACACTGATT GATGTCAACG 1680 

„..25 ACCATGGCCC AGTCCCTGAG CCCCGTCAGA TCACCATCTG CAACCAAAGC CCTGTGCGCC 1740 
Lit ACGTGCTGAA CATCACGGAC AAGGACCTGT CTCCCCACAC CTCCCCTTTC CAGGCCCAGC 1800 

fi i TCACAGATGA CTCAGACATC TACTGGACGG CAGAGGTCAA CGAGGAAGGT GACACAGTGG 1860 

= fei TCTTGTCCCT GAAGAAGTTC CTGAAGCAGG ATACATATGA CGTGCACCTT TCTCTGTCTG 1920 

Mi n ACCATGGCAA CAAAGAGCAG CTG ACGGTGA TCAGGGCCAC TGTGTGCGAC TGCCATGGCC 1 980 
; =3 0 ATGTCGAAAC CTGCCCTGGA CCCTGG AAAG GAGGTTTCAT CCTCCCTGTG CTGGGGGCTG 2040 
W=" TCCTGGCTCT GCTGTTCCTC CTGCTGGTGC TGCTTTTGTT GGTGAGAAAG AAGCGGAAGA 2100 

TCAAGGAGCC CCTCCTACTC CCAGAAGATG ACACCCGTGA CAACGTCTTC TACTATGGCG 2160 
I. AAGAGGGGGG TGGCGAAGAG GACCAGGACT ATGACATCAC CCAGCTCCAC CGAGGTCTGG 2220 

M, <- AGGCCAGGCC GGAGGTGGTT CTCCGCAATG ACGTGGCACC AACCATCATC CCGACACCCA 2280 
i. =3 5 TGTACCGTCC TAGGCCAGCC AACCCAG ATG AAATCGGCAA CTTTATAATT GAGAACCTGA 2340 
II AGGCGGCTAA CACAGACCCC ACAGCCCCGC CCTACGACAC CCTCTTGGTG TTCGACTATG 2400 

I IJ AGGGCAGCGG CTCCGACGCC GCGTCCCTGA GCTCCCTCAC CTCCTCCGCC TCCGACCAAG 2460 

_F ACCAAGATTA CGATTATCTG AACGAGTGGG GCAGCCGCTT CAAGAAGCTG GCAGACATGT 2520 

~a r\ ACGGTGGCGG GGAGGACGAC TAG GCGGCCT GCCTGCAGGG CTGGGGACCA AACGTCAGGC 2580 
C:40 CACAGAGCAT CTCCAAGGGG TCTCAGTTCC CCCTTCAGCT GAGGACTTCG GAGCTTGTCA 2640 
== ; GGAAGTGGCC GTAGCAACTT GGCGGAGACA GGCTATGAGT CTGACGTTAG AGTGGTTGCT 2700 

! y TCCTTAGCCT TTCAGGATGG AGGAATGTGG GCAGTTTGAC TTCAGCACTG AAAACCTCTC 2760 

CACCTGGGCC AGGGTTGCCT CAGAGGCCAA GTTTCCAGAA GCCTCTTACC TGCCGTAAAA 2820 
TGCTCAACCC TGTGTCCTGG GCCTGGGCCT GCTGTGACTG ACCTACAGTG GACTTTCTCT 2880 
45 CTGGAATGGA ACCTTCTTAG GCCTCCTGGT GCAACTTAAT TTTTTTTTTT AATGCTATCT 2940 

TCAAAACGTT AGAGAAAGTT CTTCAAAAGT GCAGCCCAGA GCTGCTGGGC CCACTGGCCG 3000 
TCCTGCATTT CTGGTTTCCA GACCCCAATG CCTCCCATTC GGATGGATCT CTGCGTTTTT 3060 
ATACTGAGTG TGCCTAGGTT GCCCCTTATT TTTTATTTTC CCTGTTGCGT TGCTATAGAT 3120 
GAAGGGTGAG GACAATCGTG TATATGTACT AGAACTTTTT TATTAAAGAA A 

Seq ID NO: 126 Protein sequence: 
Protein Accession #: NP 001 784 

1 11 21 31 41 51 

55 i | | i i i 

MGLPRGPLAS LIXLQVCWLQ CAASEPCRAV FREAEVTLEA GGAEQEPGQA LGKVFMGCPG 60 
QEPALFSTDN DDFTVRNGET VQERRSLKER NPLKIFPSKR ILRRHKRDWV VAPISVPENG 120 
KGPFPQRLNQ LKSNKDRDTK IFYSITGPGA DSPPEGVFAV EKETGWLLLN KPLDREEI AK 1 80 
YELFGHAVSE NGASVEDPMN ISirVTDQND HKPKFTQDTF RGSVLEGVLP GTSVMQVTAT 240 

60 DEDDAIYTYN GVVAYSIHSQ EPKDPHDLMF TIHRSTGTIS VISSGLDREK VPEYTLTIQA 300 

TDMDGDGSTT TAVAVVEILD ANDNAPMFDP QKYEAHVPEN AVGHEVQRLT VTDLDAPNSP 360 
AWRATYLIMG GDDGDHFTIT THPESNQGIL TTRKGLDFEA KNQHTLYVEV TNEAPFVLKL 420 
PTSTATIVVH VEDVNEAPVF VPPSKVVEVQ EGIPTGEPVC VYTAEDPDKE NQKISYRILR 480 
DPAGWLAMDP DSGQVTAVGT LDREDEQFVR NNIYEVMVLA MDNGSPPTTG TGTLLLTLID 540 

65 VNDHGPVPEP RQITICNQSP VRHVLNITDK DLSPHTSPFQ AQLTDDSDIY WTAEVNEEGD 600 
TVVLSLKKFL KQDTYDVHLS LSDHGNKEQL TVIRATVCDC HGHVETCPGP WKGGFILPVL 660 
GAVLALLFLL LVLLLLVRKK RK1KEPLLLP EDDTRDNVFY YGEEGGGEED QDYDITQLHR 720 
GLEARPEVVL RNDVAPTIIP TPMYRPRPAN PDEIGNFHE NLKAANTDPT APPYDXLLVF 780 
DYEGSGSDAA SLSSLTSSAS DQDQDYDYLN EWGSRFKKLA DMYGGGEDD 

Seq ID NO: 127 DNA sequence 

Nucleic Acid Accession #: NM_003256. 1 

Coding sequence: 60-734 (underlined sequences correspond to start and stop codons) 

75 1 11 21 31 41 51 

I I I I I I 

CCTGCTGGGG CCGTCCAGTC CCCCAGACCT CACAGGCTCA GTCGCGGATC TGCAGTGTCA 60 

405 
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TGCCTGGGAG CCCTCGGCCC GCGCCAAGCT GGGTGCTGTT GCTGCGGCTG CTGGCGTTGC 1 20 
TGCGGCCCCC GGGGCTGGGT G AGGCATGCA GCTGCGCCCC GGCGCACCCT CAGCAGCACA 1 80 
TCTGCCACTC GGCACTTGTG ATTCGGGCCA AAATCTCCAG TGAGAAGGTA GTTCCGGCCA 240 
GTGCAGACCC TGCTGACACT GAAAAAATGC TCCGGTATGA AATCAAACAG ATAAAGATGT 300 
5 TCAAAGGGTT TGAGAAAGTC AAGGATGTTC AGTATATCTA TACGCCTTTT GACTCTTCCC 360 
TCTGTGGTGT GAAACTAGAA GCCAACAGCC AGAAGCAGTA TCTCTTGACT GGTCAGGTCC 420 
TCAGTGATGG AAAAGTCTTC ATCCATCTGT GCAACTACAT CGAGCCCTGG GAGGACCTGT 480 
CCTTGGTGCA GAGGGAAAGT CTGAATCATC ACTACCATCT GAACTGTGGC TGCCAAATCA 540 
CCACCTGCTA CACAGTACCC TGTACCATCT CGGCCCCTAA CGAGTGCCTC TGGACAGACT 600 
1 0 GGCTGTTGGA ACGAAAGCTC TATGGTTACC AGGCTCAGCA TTATGTCTGT ATGAAGCATG 660 
TTGACGGCAC CTGCAGCTGG TACCGGGGCC ACCTGCCTCT CAGGAAGGAG TTTGTTGACA 720 
TCGTTCAGCC C TAG TAGGGA CCAGTGACCA TCACATCCCT TCAAGAGTCC TGAAGATCAA 780 
GCCAGTTCTC CTTCCCTGCA GAGCTTTGGC CATTACCACC TGACCTCTTG CTGCCAGCTA 840 
ATAAGAAGTG CCAAGTGGAC AGTCTGGCCA CTGTCAAGGC AGGGAAGGGG CCATGACTTT 900 
1 5 TCTGCCCTGC CCTCAGCCTG TTGCCCTGCC TCCCAAACCC CATTAGTCTA GCCTTGTAGC 960 
TGTTACTGCA AGTGTTTCTT CTGGCTTAGT CTGTTTTCTA AAGCCAGGAC TATTCCCTTT 1020 
CCTCCCCAGG AATATGTGTT TTCCTTTGTC TTAATCGATC TGGTAGGGGA GAAATGGCGA 1080 
ATGTCATACA CATGAGATGG TATATCCTTG CGATGTACAG AATCAGAAGG TGGTTTGACA 1140 
GCATCATAAA CAGGCTGACT GGCAGGAATG AAAAAAAAAA AAAAAAAAA 
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Seq ID NO: 1 28 Protein sequence: 
Protein Accession #: NP 003247.1 



" 1 11 21 31 41 51 

! 25 | | | | | | 

f • MPGSPRPAPS WVLLLRLLAL LRPPGLGEAC SCAPAHPQQH ICHSALVIRA KISSEKVVPA 60 

J-! SADPADTEKM LRYEKQIKM FKGFEKVKDV QYIYTPFDSS LCGVKLEANS QKQYLLTGQV 120 

U LSDGKVFIHL CNYIEPWEDL SLVQRESLNH HYHLNCGCQI TTCYTVPCTI SAPNECLWTD 1 80 

WLLERKLYGY QAQHYVCMKH VDGTCSWYRG HLPLRKEFVD IVQP 



Seq ID NO: 129 DNA sequence 

Nucleic Acid Accession #: NM_007207.2 

Coding sequence: 143-1591 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

CCACGCGTCC GCAATGAAGC CGAGTGAATG GGGGCTGAAT GTGCGAGTCC ATAGCTGAAG 60 
!: AGGAGCGCCA GATGGTGGAG GAATACACTT ATTTATGAAA CTGTCTTGAG TTCTTCTTGA 120 

=' ATTGCCAGTT TTCAGCCTCC TC ATG CCTCC GTCTCCTTTA GACGACAGGG TAGTAGTGGC 180 

40 ACTATCTAGG CCCGTCCGAC CTCAGGATCT CAACCTTTGT TTAGACTCTA GTTACCTTGG 240 
: ; CTCTGCCAAC CCAGGCAGTA ACAGCCACCC TCCTGTCATC GCCACCACCG TTGTGTCCCT 300 

J CAAGGCTGCG AATCTGACGT ATATGCCCTC ATCCAGCGGC TCTGCCCGCT CGCTGAATTG 360 

TGGATGCAGC AGTGCCAGCT GCTGCACTGT GGCAACCTAC GACAAGGACA ATCAGGCCCA 420 
AACCCAAGCC ATTGCCGCTG GCACCACCAC CACTGCCATC GGAACCTCTA CCACCTGCCC 480 
45 TGCTAACCAG ATGGTCAACA ATAATGAGAA TACAGGCTCT CTAAGTCCAT CAAGTGGGGT 540 
GGGCAGCCCT GTGTCAGGGA CCCCCAAGCA GCTAGCCAGC ATCAAAATAA TCTACCCCAA 600 
TGACTTGGCA AAGAAGATGA CCAAATGCAG CAAGAGTCAC CTGCCGAGTC AGGGCCCTGT 660 
CATCATTGAC TGCAGGCCCT TCATGGAGTA CAACAAGAGT CACATCCAAG GAGCTGTCCA 720 
CATTAACTGT GCCGATAAGA TCAGCCGGCG GAGACTGCAG CAGGGCAAGA TCACTGTCCT 780 
AGACTTGATT TCCTGTAGGG AAGGCAAGGA CTCTTTCAAG AGGATCTTTT CCAAAGAAAT 840 
TATAGTTTAT GATGAGAATA CCAATGAACC AAGCCGAGTG ATGCCCTCCC AGCCACTTCA 900 
CATAGTCCTC GAGTCCCTGA AGAGAGAAGG CAAAGAACCT CTGGTGTTGA AAGGTGGACT 960 
TAGTAGTTTT AAGCAGAACC ATGAAAACCT CTGTGACAAC TCCCTCCAGC TCCAAGAGTG 1020 
CCGGGAGGTG GGGGGCGGCG CATCCGCGGC CTCGAGCTTG CTACCTCAGC CCATCCCCAC 1080 

5 5 CACCCCTGAC ATCGAGAACG CTGAGCTCAC CCCCATCTTG CCCTTCCTGT TCCTTGGCAA 1 140 

TGAGCAGGAT GCTCAGGACC TGGACACCAT GCAGCGGCTG AACATCGGCT ACGTCATCAA 1200 
CGTCACCACT CATCTTCCCC TCTACCACTA TGAGAAAGGC CTGTTCAACT ACAAGCGGCT 1260 
GCCAGCCACT GACAGCAACA AGCAGAACCT GCGGCAGTAC TTTGAAGAGG CTTTTGAGTT 1320 
CATTGAGGAA GCTCACCAGT GTGGGAAGGG GCTTCTCATC CACTGCCAGG CTGGGGTGTC 1380 
60 CCGCTCCGCC ACCATCGTCA TCGCTT ACTT GATGAAGCAC ACTCGGATGA CCATGACTGA 1 440 
TGCTTATAAA TTTGTCAAAG GCAAACGACC AATTATCTCC CCAAACCTTA ACTTCATGGG 1500 
GCAGTTGCTA GAGTTCGAGG AAGACCTAAA CAACGGTGTG ACACCGAGAA TCCTTACACC 1 560 
AAAGCTGATG GGCGTGGAGA CGGTTGTG TG A CAATGGTCT GGATGGAAAG GATTGCTGCT 1620 
CTCCATTAGG AGACAATGAG GAAGGAGGAT GGATTCTGGT TTTTTTTCTT TCTTTTTTTT 1 680 

6 5 TTGTAGTTGG GAGTAAAGTT TGTGAATGG A AACAAACTTG GTTAAACACT TTATTTTTAA 1 740 

CAAGTGTAAG AAGACTATAC TTTTGATGCC ATTGAGATTC ACCTTCCACA AACTGGCCAA 1 800 
ATTAAGGAGG TTAAAGAAGT AATTTTTTTT AAGCCCAACC ATTAAAAATT TAATACAACT 1860 
TGGTTTCTCC CCCTTTTTCC TTTAAAGCTA NTTTGTAAAA GTTTATGAG 

70 Seq ID NO: 130 Protein sequence: 

Protein Accession #: NP_009138.1 

1 11 21 31 41 51 

I I 1 I I I 

MPPSPLDDRV VVALSRPVRP QDLNLCLDSS YLGSANPGSN SHPPVIATTV VSLKAANLTY 60 
MPSSSGSARS LNCGCSSASC CTVATYDKDN QAQTQAIAAG TTTTAIGTST TCPANQMVNN 1 20 
NENTGSLSPS SGVGSPVSGT PKQLASKII YPNDLAKKMT KCSKSHLPSQ GPVIIDCRPF 1 80 

406 



MEYNKSHIQG AVHINCADK1 SRRRLQQGKI TVLDLISCRE GKDSFKRIFS KEIIVYDENT 240 
NEPSRVMPSQ PLHIVLESLK REGKEPLVLK GGLSSFKQNH ENLCDNSLQL QECREVGGGA 300 
SAASSLLPQP IPTTPDIENA ELTPILPFLF LGNEQDAQDL DTMQRLNIGY VINVTTHLPL 360 
YHYEKGLFNY KRLPATDSNK QNLRQYFEEA FEFIEEAHQC GKGLLIHCQA GVSRSATIVI 420 
AYLMKHTRMT MTDAYKFVKG KRPIISPNLN FMGQLLEFEE DLNNGVTPRI LTPKLMGVET 480 
VV 

SeqIDNO:131 DNA sequence 

Nucleic Acid Accession #: NM 005409.3 

Coding sequence: 94-378 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

TTCCTTTCAT GTTCAGCATT TCTACTCCTT CCAAGAAGAG CAGCAAAGCT GAAGTAGCAG 60 
CAACAGCACC AGCAGCAACA GCAAAAAACA AAC ATG AGTG TGAAGGGCAT GGCTATAGCC 120 
TTGGCTGTGA TATTGTGTGC TACAGTTGTT CAAGGCTTCC CCATGTTCAA AAG AGGACGC 1 80 
TGTCTTTGCA TAGGCCCTGG GGTAAAAGCA GTGAAAGTGG CAGATATTGA GAAAGCCTCC 240 
ATAATGTACC CAAGTAACAA CTGTGACAAA ATAGAAGTGA TTATTACCCT GAAAGAAAAT 300 
AAAGGACAAC GATGCCTAAA TCCCAAATCG AAGCAAGCAA GGCTTATAAT CAAAAAAGTT 360 
GAAAGAAAGA ATTTT TAAA A ATATCAAAAC ATATGAAGTC CTGGAAAAGG GCATCTGAAA 420 
AACCTAGAAC AAGTTTAACT GTGACTACTG AAATGACAAG AATTCTACAG TAGGAAACTG 480 
AGACTTTTCT ATGGTTTTGT GACTTTCAAC TTTTGTACAG TTATGTGAAG GATGAAAGGT 540 
GGGTGAAAGG ACCAAAAACA GAAATACAGT CTTCCTGAAT GAATGACAAT CAGAATTCCA 600 
CTGCCCAAAG GAGTCCAGCA ATTAAATGGA TTTCTAGGAA AAGCTACCTT AAGAAAGGCT 660 
GGTTACCATC GGAGTTTACA AAGTGCTTTC ACGTTCTTAC TTGTTGTATT ATACATTCAT 720 
GCATTTCTAG GCTAGAGAAC CTTCTAGATT TGATGCTTAC AACTATTCTG TTGTGACTAT 780 
GAGAACATTT CTGTCTCTAG AAGTTATCTG TCTGTATTGA TCTTTATGCT ATATTACTAT 840 
CTGTGGTTAC AGTGGAGACA TTGACATTAT TACTGGAGTC AAGCCCTTAT AAGTCAAAAG 900 
CATCTATGTG TCGTAAAGCA TTCCTCAAAC ATTTTTTCAT GCAAATACAC ACTTCTTTCC 960 
CCAAATATCA TGTAGCACAT CAATATGTAG GGAAACATTC TTATGCATCA TTTGGTTTGT 1020 
TTTATAACCA ATTCATTAAA TGTAATTCAT AAAATGTACT ATGAAAAAAA TTATACGCTA 1080 
TGGGATACTG GCAACAGTGC ACATATTTCA TAACCAAATT AGCAGCACCG GTCTTAATTT 1140 
GATGTTTTTC AACTTTTATT CATTGAGATG TTTTGAAGCA ATTAGGATAT GTGTGTTTAC 1200 
TGTACTTTTT GTTTTGATCC GTTTGTATAA ATGATAGCAA TATCTTGGAC ACATTTGAAA 1260 
TACAAAATGT TTTTGTCTAC CAAAGAAAAA TGTTGAAAAA TAAGCAAATG TATACCTAGC 1320 
AATCACTTTT ACTTTTTGTA ATTCTGTCTC TTAGAAAAAT ACATAATCTA ATCAATTTCT 1380 
TTGTTCATGC CTATATACTG TAAAATTTAG GTATACTCAA GACTAGTTTA AAGAATCAAA 1440 
GTCATTTTTT TCTCTAATAA ACTACCACAA CCTTTCTTTT TTAAAAAAAA AAA 

Seq ID NO: 132 Protein sequence: 
Protein Accession #: NP 005400. 1 

1 11 21 31 41 51 

I I I I I I 

MSVKGMAIAL AVILCATVVQ GFPMFKRGRC LCIGPGVKAV KVADIEKASI MYPSNNCDKI 60 
EV1ITLKENK GQRCLNPKSK QARLIKKVE RKNF 

Seq ID NO: 133 DNA sequence 

Nucleic Acid Accession #: NM 012342 

Coding sequence: 373-1 1 55 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

CTGGCGCGGG CGGGAGCTGC GGCGGATACC CTTGCGTGCT GTGGAGACCC TACTCTCTTC 60 
GCTGAGAACG GCCGCTAGCG GGGACTGAAG GCCGGGAGCC CACTCCCGAC CCGGGGCTAG 120 
CGTGCGTCCC TAGAGTCGAG CGGGGCAAGG GAGCCAGTGG CCGCCGACGG GGGACCGGG A 1 80 
AACTTTTCTG GGCTCCTGGA GAGCCCTGTA GCCGCGCTCC ATGCTCCGGC AGCGGCCCGA 240 
AACCCAGCCC CGCCGCTGAC GGAGCCCGCC GCTCCGGGCA GGGCCCATGC CCTGCGCGCT 300 
CCGGGGGTCG TAGCTGCCGC CGAGCCGGGG CTCCGGAAGC CGGCGGGGGC GCCGCGGCCG 360 
TGCGGGGCGT CA ATG GATCG CCACTCCAGC TACATCTTCA TCTGGCTGCA GCTGGAGCTC 420 
TGCGCCATGG CCGTGCTGCT CACCAAAGGT GAAATTCGAT GCTACTGTGA TGCTGCCCAC 480 
TGTGTAGCCA CTGGTTATAT GTGTAAATCT GAGCTCAGCG CCTGCTTCTC TAGACTTCTT 540 
GATCCTCAGA ACTCAAATTC CCCACTCACC CATGGCTGCC TGGACTCTCT TGCAAGCACG 600 
ACAGACATCT GCCAAGCCAA ACAGGCCCGA AACCACTCTG GCACCACCAT ACCCACATTG 660 
GAATGCTGTC ATGAAGACAT GTGCAATTAC AGAGGGCTGC ACGATGTTCT CTCTCCTCCC 720 
AGGGGTGAGG CCTCAGGACA AGGAAACAGG TATCAGCATG ATGGTAGCAG AAACCTTATC 780 
ACCAAGGTGC AGGAGCTGAC TTCTTCCAAA GAGTTGTGGT TCCGGGCAGC GGTCATTGCC 840 
GTGCCCATTG CTGGAGGGCT GATTTTAGTG TTGCTTATTA TGTTGGCCCT GAGGATGCTT 900 
CGAAGTGAAA ATAAGAGGCT GCAGGATCAG CGGCAACAGA TGCTCTCCCG TTTGCACTAC 960 
AGCTTTCACG GACACCATTC CAAAAAGGGG CAGGTTGCAA AGTTAGACTT GGAATGCATG 1020 
GTGCCGGTCA GTGGGCACGA GAACTGCTGT CTGACCTGTG ATAAAATGAG ACAAGCAGAC 1080 
CTCAGCAACG ATAAGATCCT CTCGCTTGTT CACTGGGGCA TGTACAGTGG GCACGGGAAG 1 140 
CTGGAATTCG TA TGA CGGAG TCTTATCTGA ACTACACTTA CTGAACAGCT TGAAGGCCTT 1200 
TTGAGTTCTG CTGGACAGGA GCACTTTATC TGAAGACAAA CTCATTTAAT CATCTTTGAG 1260 
AG AC A A AATG ACCTCTGCAA ACAGAATCTT GGATATTTCT TCTGAAGGAT TATTTGCACA 1320 
GACTTAAATA CAGTTAAATG TGTTATTTGC TTTTAAAATT ATAAAAAGCA AAGAGAAGAC 1380 
TTTGTACACA CTGTCACCAG GGTTATTTGC ATCCAAGGGA GCTGGAATTG AGTACCTAAA 1440 

407 
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TAAACAAAA A TGTGCCCTAT GTAAGCTTCT ACATCTTGAT TTATTGTAAA GATTTAAAAG 1 500 
AAATATATAT ATTTTGTCTG A 

Seq ID NO: 134 Protein sequence: 
J Protein Accession #: NP 036474.1 

1 11 21 31 41 51 

I I I I I I 

MDRHSSYIFI WLQLELCAMA VLLTKGEIRC YCDAAHCVAT GYMCKSELSA CFSRLLDPQN 60 
1 U SNSPLTHGCL DSLASTTDIC QAKQARNHSG TTIPTLECCH EDMCNYRGLH D VLSPPRGEA 1 20 
SGQGNRYQHD GSRNLITKVQ ELTSSKELWF RAAVIAVPIA GGLILVLLIM LALRMLRSEN 180 
KRLQDQRQQM LSRLHYSFHG HHSKKGQVAK LDLECMVPVS GHENCCLTCD KMRQADLSND 240 
KILSLVHWGM YSGHGKLEFV 

15 Seq ID NO: 135 DNA sequence 

Nucleic Acid Accession #: NM 00 1 627. 1 

Coding sequence: 64-1 815 (underlined sequences correspond to start and stop codons) 
_„ 1 11 21 31 41 51 

M 20 I I I I I I 

CGGGACGACG CCCCCTCCTG CGGCGTGGAC TCCGTCAGTG GCCCACCAAG AAGGAGGAGG 60 
i»i AATATGGAAT CCAAGGGGGC CAGTTCCTGC CGTCTGCTCT TCTGCCTCTT GATCTCCGCC 120 

f»; ACCGTCTTCA GGCCAGGCCT TGGATGGTAT ACTGTAAATT CAGCATATGG AGATACCATT 1 80 

— ATCATACCTT GCCGACTTGA CGTACCTCAG AATCTCATGT TTGGCAAATG GAAATATGAA 240 

y-;ZD AAGCCCGATG GCTCCCCAGT ATTTATTGCC TTCAGATCCT CTACAAAGAA AAGTGTGCAG 300 
3V : TACGACGATG TACCAGAATA CAAAGACAGA TTGAACCTCT CAGAAAACTA CACTTTGTCT 360 

W> ! ATCAGTAATG CAAGGATCAG TGATGAAAAG AGATTTGTGT GCATGCTAGT AACTGAGGAC 420 

ft! AACGTGTTTG AGGCACCTAC AATAGTCAAG GTGTTCAAGC AACCATCTAA ACCTGAAATT 480 

■- 1(\ GTAAGCAAAG CACTGTTTCT CG AAACAGAG CAGCTAAAAA AGTTGGGTGA CTGCATTTCA 540 
> O V GAAGACAGTT ATCCAGATGG CAATATCACA TGGTACAGGA ATGGAAAAGT GCTACATCCC 600 
p|! CTTGAAGGAG CGGTGGTCAT AATTTTTAAA AAGGAAATGG ACCCAGTGAC TCAGCTCTAT 660 

ACCATGACTT CCACCCTGGA GTACAAGACA ACCAAGGCTG ACATACAAAT GCCATTCACC 720 
¥ TGCTCGGTGA CATATTATGG ACCATCTGGC CAGAAAACAA TTCATTCTGA ACAGGCAGTA 780 

3=;_ _ TTTGATATTT ACTATCCTAC AGAGCAG GTG ACAATACAAG TGCTGCCACC AAAAAATGCC 840 
== J D ATCAAAGAAG GGGATAACAT CACTCTTAAA TGCTTAGGGA ATGGCAACCC TCCCCCAGAG 900 
|= GAATTTTTGT TTTACTTACC AGGACAGCCC GAAGGAATAA GAAGCTCAAA TACTTACACA 960 

n ; CTGATGGATG TGAGGCGCA A TGCAACAGG A GACTACAAGT GTTCCCTGAT AGACAAAAAA 1 020 

5 '= : AGCATGATTG CTTCAACAGC CATCACAGTT CACTATTTGG ATTTGTCCTT AAACCCAAGT 1080 

<£> A f\ GGAGAAGTGA CTAGACAGAT TGGTGATGCC CTACCCGTGT CATGCACAAT ATCTGCTAGC 1 140 
«=f*U AGGAATGCAA CTGTGGTATG GATGAAAGAT AACATCAGGC TTCGATCTAG CCCGTCATTT 1200 
W TCTAGTCTTC ATTATCAGGA TGCTGGAAAC TATGTCTGCG AAACTGCTCT GCAGGAGGTT 1260 

H ! GAAGGACTAA AGAAAAGAGA GTCATTGACT CTCATTGTAG AAGGCAAACC TCAAATAAAA 1 320 

ATGACAAAGA AAACTGATCC CAGTGGACTA TCTAAAACAA TAATCTGCCA TGTGGAAGGT 1380 
TTTCCAAAGC CAGCCATTCA GTGGACAATT ACTGGCAGTG GAAGCGTCAT AAACCAAACA 1440 
40 GAGGAATCTC CTTATATTA A TGGCAGGTAT TATAGTAAAA TTATCATTTC CCCTG AAGAG 1 500 
AATGTTACAT TAACTTGCAC AGCAGAAAAC CAACTGGAGA GAACAGTAAA CTCCTTGAAT 1560 
GTCTCTGCTA TAAGTATTCC AGAACACGAT GAGGCAGACG AGATAAGTGA TGAAAACAGA 1620 
GAAAAGGTGA ATGACCAGGC AAAACTAATT GTGGGAATCG TTGTTGGTCT CCTCCTTGCT 1680 
GGCCTTGTTG CTGGTGTCGT CTACTGGCTG TACATGAAGA AGTCAAAGAC TGCATCAAAA 1740 
-> U CATGTAAACA AGGACCTCGG TAATATGGAA GAAAACAAAA AGTTAGAAGA AAACAATCAC 1 800 
AAAACTGAAG GCTAAGAGAG AAACTGTCCT AGTTGTCCAG AGATAAAAAT CATATAGACC 1860 
AATTGAAGCA TGAACGTGGA TTGTATTTAA GACATAAACA AAGACATTGA CAGCAATTCA 1920 
TGGTTCAAGT ATTAAGCAGT TCATTCTACC AAGCTGTCAC AGGTTTTCAG AGAATTATCT 1980 
CAAGTAAAAC AAATGAAATT TAATTACAAA CAATAAGAAC AAGTTTTGGC AGCCATGATA 2040 
J J ATAGGTCATA TGTTGTGTTT GGTTCAATTT TTTTTCCGTA AATGTCTGC A CTGAGGATTT 2 1 00 
CTTTTTGGTT TGCCTTTTAT GTAAATTTTT TACGTAGCTA TTTTTATACA CTGTAAGCTT 2160 
TGTTCTGGGA GTTGCTGTTA ATCTGATGTA TAATGTAATG TTTTTATTTC AATTGTTTAT 2220 
ATGGATAATC TGAGCAGGTA CATTTCTGAT TCTGATTGCT ATCAGCAATG CCCCAAACTT 2280 
TCTCATAAGC ACCTAAAACC CAAAGGTGGC AGCTTGTGAA GATTGGGGAC ACTCATATTG 2340 
OU CCCTAATTAA AAACTGTGAT TTTTATCACA AGGGAGGGGA GGCCGAGAGT CAGACTGATA 2400 
GACACCATAG GAGCCGACTC TTTGATATGC CACCAGCGAA CTCTCAGAAA TAAATCACAG 2460 
ATGCATATAG ACACACATAC ATAATGGTAC TCCCAAACTG ACAATTTTAC CTATTCTGAA 2520 
AAAGACATAA AACAGAATT 

65 Seq ID NO: 136 Protein sequence: 

Protein Accession #: NP 00 1618.1 

1 11 21 31 41 51 

-T/-1 I I I I I I 

/U MESKGASSCR LLFCLLISAT VFRPGLGWYT VNSAYGDTII IPCRLDVPQN LMFGKWKYEK 60 
PDGSPVFIAF RSSTKKSVQY DDVPEYK.DRL NLSENYTLSI SNARISDEKR FVCMLVTEDN 120 
VFEAPTIVKV FKQPSKPEIV SKALFLETEQ LKKLGDCISE DSYPDGNITW YRNGKVLHPL 180 
EGAVVIIFKK EMDPVTQLYT MTSTLEYKTT KADIQMPFTC SVTYYGPSGQ KTIHSEQAVF 240 



DIYYPTEQVT IQVLPPKNAI KEGDNITLKC LGNGNPPPEE FLFYLPGQPE GIRSSNTYTL 300 
/ 0 MDVRRNATGD YKCSLIDKKS MIASTAITVH YLDLSLNPSG EVTRQIGDAL PVSCTISASR 360 
' "~ " " "" " " RESLT 

IYYSK 

408 



... u » u „ w w * * M.M. 1 Wi/UJUlll Li Y i l\yiVJfAL I V llOrtlJIV JUU 

NATVVWMKDN IRLRSSPSFS SLHYQDAGNY VCETALQEVE GLKKRESLTL IVEGKPQIKM 420 
TKKTDPSGLS KTIICHVEGF PKPAIQWT1T GSGSVINQTE ESPYING RYY SKIIISPEEN 480 
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VTLTCTAENQ LERTVNSLNV SAISIPEHDE ADEISDENRE KVNDQAKLIV GIVVGLLLAA 540 
LVAGVVYWLY MKKSKTASKH VNKDLGNMEE NKKLEENNHK TEA 

Seq ID NO: 137 DNA sequence 
Nucleic Acid Accession #: XM 030559 

Coding sequence: 1-119 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

ATGAACCGCA GCCACCGGCA CGGGGCGGGC AGCGGCTGCC TGGGCACTAT GGAGGTGAAG 60 
AGCAAGTTTG GAGCTGAATT TCGTCGGTTT TCGCTGGAAA GATCAAAACC TGGAAAATTT 120 
GAGGAGTTTT ATGGATTACT ACAACATGTT CATAAGATCC CCAATGTTGA CGTTTTGGTA 1 80 
GGCTATGCAG ACATCCATGG AGACTTACTA CCTATAAATA ATGATGATAA TTATCACAAA 240 
1 D GCTGTTTCAA CGGCCAATCC ACTGCTTAGG ATATTTATAC AAAAGAAGGA AGAAGCAGAC 300 
TACAGTGCCT TTGGTACAGA CACGCTAATA AAGAAGAAGA ATGTTTTAAC CAACGTATTG 360 
CGTCCTGACA ACCATAGAAA AAAGCCACAT ATAGTCATTA GTATGCCCCA AGACTTTAGA 420 
CCTGTGTCTT CTATTATAGA CGTGGATATT CTCCCAGAAA CGCATCGTAG GGTACGTCTT 480 
== TACAAATACG GC ACGGAGAA ACCCCTAGGA TTCTACATCC GGG ATGGCTC CAGTGTCAGG 540 

. A\J GTAACACCAC ATGGCTTAGA AAAGGTTCCA GGGATCTTTA TATCCAGGCT TGTCCCAGGA 600 
= == : GGTCTGGCTC AAAGTACAGG ACTATTAGCT GTTAATGATG AAGTTTTAGA AGTTAATGGC 660 

Pi ATAGAAGTTT CAGGGAAGAG CCTTGATCAA GTAACAGACA TGATGATTGC AAATAGCCGT 720 

, ~ AACCTCATCA TAACAGTGAG ACCGGCAAAC CAGAGGAATA ATGTTGTGAG GAACAGTCGG 780 

U ; 0 ACTTCTGGCA GTTCCGGTCA GTCTACTGAT AACAGCCTTC TTGGCTACCC ACAGCAGATT 840 

ft= ; Z D GAACCAAGCT TTG AGCCAG A GGATGAAGAC AGCGAAGAAG ATGACATTAT CATTGAAGAC 900 
I:"! AATGGAGTGC CACAGCAGAT TCCAAAAGCT GTTCCTAATA CTGAGAGCCT GGAGTCATTA 960 

| y ACACAGATAG AGCTAAGCTT TGAGTCTGG A CAGAATGGCT TTATTCCCTC TAATGAAGTG 1 020 

t ; AGCTTAGCAG CCATAGCAAG CAGCTCAAAC ACGGAATTTG AAACACATGC TCCAGATCAA 1080 

AAACTCTTAG AAGAAGATGG AACAATCATA ACATTATGA 



r30 



Seq ID NO: 138 Protein sequence: 
Protein Accession #: XP 030559 



f^c 1 11 21 31 41 SI 

H35 I I I | | I 

fi ; MNRSHRHGAG SGCLGTMEVK SKFGAEFRRF SLERSKPGKF EEFYGLLQHV HKIPNVDVLV 60 

•-Z'_ GYADIHGDLL PINNDDNYHK AVSTANPLLR IFIQKKEEAD YSAFGTDTLI KKKNVLTNVL 120 

*i RPDNHRKKPH IVISMPQDFR PVSSIIDVDI LPETHRRVRL YKYGTEKPLG FYIRDGSSVR 180 

~* VTPHGLEKVP GIFISRLVPG GLAQSTGLLA VNDEVLEVNG IEVSGKSLDQ VTDMMIANSR 240 

«=*rU NLIITVRPAN QRNNVVRNSR TSGSSGQSTD NSLLGYPQQI EPSFEPEDED SEEDDIIIED 300 

W NGVPQQIPKA VPNTESLESL TQIELSFESG QNGFBPSNEV SLAAIASSSN TEFETHAPDO 360 
KLLEEDGTII TL 



45 



409 
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It is understood that the examples described above in no way serve to limit the 
true scope of this invention, but rather are presented for illustrative purposes. All 
publications, sequences of accession numbers, and patent applications cited in this 
specification are herein incorporated by reference as if each individual publication or patent 
application were specifically and individually indicated to be incorporated by reference. 
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